Whisper pipeline: use Sampler #1615

as-suvorov · 2025-01-22T09:59:29Z

Ticket: 152889
Closes #1164

…isper

src/cpp/src/sampler.cpp

src/cpp/src/whisper/models/decoder.cpp

Wovchena · 2025-01-24T08:19:25Z

src/cpp/src/whisper/whisper.cpp

+    }
+}
+
+std::pair<ov::genai::EncodedResults, bool> decode(std::shared_ptr<ov::genai::WhisperDecoder> decoder,


Suggested change

std::pair<ov::genai::EncodedResults, bool> decode(std::shared_ptr<ov::genai::WhisperDecoder> decoder,

std::pair<ov::genai::EncodedResults, bool> decode(const std::shared_ptr<ov::genai::WhisperDecoder>& decoder,

Wovchena · 2025-01-24T08:19:39Z

src/cpp/src/whisper/whisper.cpp

+                                                  const ov::Tensor& encoder_hidden_state,
+                                                  const std::shared_ptr<ov::genai::StreamerBase> streamer_ptr,
+                                                  ov::genai::Sampler& sampler,
+                                                  ov::genai::SequenceGroup::Ptr sequence_group,


Suggested change

ov::genai::SequenceGroup::Ptr sequence_group,

const ov::genai::SequenceGroup::Ptr& sequence_group,

Wovchena · 2025-01-24T08:28:27Z

src/cpp/src/whisper/whisper_utils.cpp

+    const float* logits_data = logits.data<const float>() + batch_offset + sequence_offset;
+
+    int64_t out_token = std::max_element(logits_data, logits_data + vocab_size) - logits_data;
+    float max_logit = logits_data[out_token];


Suggested change

float max_logit = logits_data[out_token];

Not used

Wovchena · 2025-01-24T08:40:04Z

src/python/py_whisper_pipeline.cpp

+    top_p:              if set to float < 1, only the smallest set of most probable tokens with probabilities that add up to top_p or higher are kept for generation.
+    top_k:              the number of highest probability vocabulary tokens to keep for top-k-filtering.
+    do_sample:          whether or not to use multinomial random sampling that add up to `top_p` or higher are kept.
+    num_return_sequences: the number of sequences to generate from a single prompt.


rng_seed is missing. Although it's not Whisper's fault because it's missing in the parent config as well. Can you add it, possible in a separate PR

Sure, will do

…isper

as-suvorov added 23 commits January 3, 2025 15:23

use decoder interface

68d3e48

Merge remote-tracking branch 'upstream/master' into as/statefull_whisper

ade7313

remove reshape

b2df4a6

use stateful seq2seq barnch

17e3ea7

Address review comments

806b01a

Rename

e041a33

Use commit

9502d9b

Set tests reqs

6c30fa4

Merge remote-tracking branch 'upstream/master' into as/statefull_whisper

7600072

Merge remote-tracking branch 'upstream/master' into as/statefull_whisper

f870a4c

Add with_past model tests

e38cf5c

remove comment

acc656f

Merge remote-tracking branch 'upstream/master' into as/statefull_whisper

aa0f742

bump tokenizers

3728884

Fix typo

445ce5a

Add deprecation message

5bdd695

Use sampler for whisper pipeline

7f2a153

Add with past decoder

c368401

Refactor with past decoder

2e061aa

Do not copy encoder_hidden_states if not needed

4eaa9a7

Merge remote-tracking branch 'upstream/master' into as/sampler_for_wh…

3139a43

…isper

Add stubs

50fb829

Remove comment

5021742

as-suvorov added this to the 2025.1 milestone Jan 22, 2025

as-suvorov added do_not_merge do_not_review labels Jan 22, 2025

github-actions bot added category: continuous batching Continuous batching category: LLM LLM pipeline (stateful, static) category: whisper Whisper pipeline category: sampling Sampling / Decoding algorithms labels Jan 22, 2025

as-suvorov added 4 commits January 22, 2025 17:35

Apply review comments

56bf11c

move set_encoder_states to base class

50eb509

Merge remote-tracking branch 'upstream/master' into as/sampler_for_wh…

e802584

…isper

Move detect_language to base decoder

abde309

github-actions bot added the category: samples GenAI samples label Jan 23, 2025

revert sample

3348ad5

github-actions bot removed the category: samples GenAI samples label Jan 23, 2025

as-suvorov assigned ilya-lavrenov and Wovchena Jan 23, 2025

as-suvorov requested a review from Wovchena January 23, 2025 10:08

as-suvorov marked this pull request as ready for review January 23, 2025 10:09

as-suvorov removed do_not_merge do_not_review labels Jan 23, 2025

ilya-lavrenov approved these changes Jan 23, 2025

View reviewed changes

src/cpp/src/sampler.cpp Outdated Show resolved Hide resolved

src/cpp/src/sampler.cpp Outdated Show resolved Hide resolved

src/cpp/src/whisper/models/decoder.cpp Show resolved Hide resolved

as-suvorov added 3 commits January 23, 2025 14:20

Move whisper utils

d63e700

Add get_max_new_tokens for sequence group

6411b17

Use sg get_max_new_tokens

f9cb461

Wovchena approved these changes Jan 24, 2025

View reviewed changes

Wovchena added this pull request to the merge queue Jan 24, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 24, 2025

ilya-lavrenov added this pull request to the merge queue Jan 24, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 24, 2025

ilya-lavrenov added this pull request to the merge queue Jan 24, 2025

Merge remote-tracking branch 'upstream/master' into as/sampler_for_wh…

f7e2044

…isper

github-merge-queue bot removed this pull request from the merge queue due to a conflict with the base branch Jan 24, 2025

Merge remote-tracking branch 'upstream/master' into as/sampler_for_wh…

267fc15

…isper

as-suvorov enabled auto-merge January 24, 2025 13:09

as-suvorov added this pull request to the merge queue Jan 24, 2025

Merged via the queue into openvinotoolkit:master with commit 42b16e5 Jan 24, 2025
60 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper pipeline: use Sampler #1615

Whisper pipeline: use Sampler #1615

as-suvorov commented Jan 22, 2025 •

edited

Loading

Wovchena Jan 24, 2025

Wovchena Jan 24, 2025

Wovchena Jan 24, 2025

Wovchena Jan 24, 2025

as-suvorov Jan 24, 2025

	std::pair<ov::genai::EncodedResults, bool> decode(std::shared_ptr<ov::genai::WhisperDecoder> decoder,
	std::pair<ov::genai::EncodedResults, bool> decode(const std::shared_ptr<ov::genai::WhisperDecoder>& decoder,

	ov::genai::SequenceGroup::Ptr sequence_group,
	const ov::genai::SequenceGroup::Ptr& sequence_group,

Whisper pipeline: use Sampler #1615

Whisper pipeline: use Sampler #1615

Conversation

as-suvorov commented Jan 22, 2025 • edited Loading

Wovchena Jan 24, 2025

Choose a reason for hiding this comment

Wovchena Jan 24, 2025

Choose a reason for hiding this comment

Wovchena Jan 24, 2025

Choose a reason for hiding this comment

Wovchena Jan 24, 2025

Choose a reason for hiding this comment

as-suvorov Jan 24, 2025

Choose a reason for hiding this comment

as-suvorov commented Jan 22, 2025 •

edited

Loading