Skip to content

Commit

Permalink
Pass pad_token_id as model compilation parameters
Browse files Browse the repository at this point in the history
  • Loading branch information
AsyaPronina committed Jan 15, 2025
1 parent ef1f0f6 commit cf94047
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions src/cpp/src/llm_pipeline_static.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -754,6 +754,7 @@ void StatefulLLMPipeline::updateStatefulConfig(
KVAxesPosition axes = get_kv_axes(model_desc.type);
update_config(pipeline_config, {"NPUW_LLM_BATCH_DIM", axes.batch});
update_config(pipeline_config, {"NPUW_LLM_SEQ_LEN_DIM", axes.seq_len});
pipeline_config["NPUW_LLM_PAD_TOKEN_ID"] = m_tokenizer.get_pad_token_id();

update_config(pipeline_config, {"NPUW_LLM_MAX_PROMPT_LEN", kMaxPromptLen});
update_config(pipeline_config, {"NPUW_LLM_MIN_RESPONSE_LEN", kMinResponseLen});
Expand Down

0 comments on commit cf94047

Please sign in to comment.