fill prompt for sampler analysis with real tokens in VLM pipeline #1247

sbalandi · 2024-11-22T10:43:35Z

add missed token, if prev generation was finished because max length was reached

src/cpp/src/visual_language/pipeline.cpp

ilya-lavrenov · 2024-11-27T15:50:56Z

src/cpp/src/visual_language/pipeline.cpp

-        std::fill_n(prompt_ids.data<int64_t>(), prompt_ids.get_size(), 0);
+
+        auto chat_history = m_inputs_embedder->get_tokenized_chat_history();
+        size_t chat_history_size = std::max(chat_history.get_shape().at(1), history_size + inputs_embeds_size);


looks like we have the same case as for LLMs, when decode ( encode ( X ) ) provides smaller value than X ?
in this case we need to partially re-compute the history.

in general, I would consider merging VLM and LLM pipelines generate functions to keep all this magic with history in one place.
Or at least to create helper function similar to get_lm_encoded_results

I'll try to merge some part here after #1215

sbalandi · 2024-12-12T14:22:12Z

rebased on #1254

src/cpp/src/visual_language/inputs_embedder.cpp

src/cpp/src/utils.hpp

github-actions bot added the category: visual language Visual language pipeline label Nov 22, 2024

ilya-lavrenov assigned ilya-lavrenov and Wovchena Nov 26, 2024

ilya-lavrenov reviewed Nov 27, 2024

View reviewed changes

ilya-lavrenov added this to the 2025.0 milestone Nov 27, 2024

sbalandi force-pushed the vlm_mask_vs_hist branch from 15fdc3c to 2b26160 Compare December 12, 2024 14:21

github-actions bot added category: LLM LLM pipeline (stateful, static) no-match-files labels Dec 12, 2024

sbalandi marked this pull request as ready for review December 12, 2024 15:36

sbalandi force-pushed the vlm_mask_vs_hist branch 6 times, most recently from c6b1907 to 53cd2f7 Compare December 16, 2024 17:33

github-actions bot removed the category: LLM LLM pipeline (stateful, static) label Dec 16, 2024

ilya-lavrenov reviewed Dec 17, 2024

View reviewed changes

ilya-lavrenov requested a review from Wovchena December 17, 2024 13:25

sbalandi force-pushed the vlm_mask_vs_hist branch 2 times, most recently from 22101ad to a8b866c Compare December 17, 2024 21:01

ilya-lavrenov approved these changes Dec 18, 2024

View reviewed changes

Wovchena approved these changes Dec 18, 2024

View reviewed changes

src/cpp/src/visual_language/inputs_embedder.cpp Outdated Show resolved Hide resolved

src/cpp/src/utils.hpp Outdated Show resolved Hide resolved

sbalandi force-pushed the vlm_mask_vs_hist branch from 71a7cd2 to 8a8e513 Compare December 18, 2024 10:56

Fill prompt for sampler analysis with real tokens in VLM pipeline

8a8e513

ilya-lavrenov added this pull request to the merge queue Dec 18, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 18, 2024

Merge branch 'master' into vlm_mask_vs_hist

10a8eaa

sbalandi enabled auto-merge December 19, 2024 00:03

sbalandi added this pull request to the merge queue Dec 19, 2024

ilya-lavrenov removed this pull request from the merge queue due to a manual request Dec 19, 2024

ilya-lavrenov merged commit 17f4eb3 into openvinotoolkit:master Dec 19, 2024
59 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

sbalandi commented Nov 22, 2024 •

edited

Loading

ilya-lavrenov Nov 27, 2024

ilya-lavrenov Nov 27, 2024

sbalandi Dec 16, 2024

sbalandi commented Dec 12, 2024

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

Conversation

sbalandi commented Nov 22, 2024 • edited Loading

ilya-lavrenov Nov 27, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 27, 2024

Choose a reason for hiding this comment

sbalandi Dec 16, 2024

Choose a reason for hiding this comment

sbalandi commented Dec 12, 2024

sbalandi commented Nov 22, 2024 •

edited

Loading