Related work: Prompt lookup decoding #45

shermansiu · 2024-01-18T00:34:40Z

https://github.com/apoorvumang/prompt-lookup-decoding

This method was recently merged into Huggingface transformers and also uses n-grams (found in the input prompt) to accelerate decoding.

The text was updated successfully, but these errors were encountered:

learning-chip · 2024-01-18T04:28:20Z

Interesting, could you point to the merged PR? Does it support batching?

This method has a similar idea (copy from input, no Jacobi): https://github.com/alipay/PainlessInferenceAcceleration

shermansiu · 2024-01-18T13:44:13Z

Here's the PR: huggingface/transformers#27775

From a cursory glance at the PR, it seems like it supports batching.

dongxiaolong · 2024-01-19T04:00:28Z

Here's the PR: huggingface/transformers#27775

From a cursory glance at the PR, it seems like it supports batching.

I have also noticed these two methods.
Do you know the specific difference between them?

shermansiu · 2024-01-23T19:24:43Z

Lookahead decoding takes the n-grams from prior lookahead decoding steps /Jacobi trajectories. Prompt lookup decoding takes the n-grams from the prompt.

learning-chip · 2024-01-25T07:07:04Z

it seems like it supports batching.

It doesn't :/ huggingface/transformers#27775 (comment)

shermansiu · 2024-01-25T18:15:36Z

Interesting. As the comment also suggests, it seems like PLD can support batching in theory - it's just the implementation that doesn't support it.

jivanph · 2024-01-31T13:15:14Z

Lookahead was mentioned here https://github.com/SafeAILab/EAGLE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Related work: Prompt lookup decoding #45

Related work: Prompt lookup decoding #45

shermansiu commented Jan 18, 2024

learning-chip commented Jan 18, 2024

shermansiu commented Jan 18, 2024

dongxiaolong commented Jan 19, 2024

shermansiu commented Jan 23, 2024

learning-chip commented Jan 25, 2024

shermansiu commented Jan 25, 2024

jivanph commented Jan 31, 2024

Related work: Prompt lookup decoding #45

Related work: Prompt lookup decoding #45

Comments

shermansiu commented Jan 18, 2024

learning-chip commented Jan 18, 2024

shermansiu commented Jan 18, 2024

dongxiaolong commented Jan 19, 2024

shermansiu commented Jan 23, 2024

learning-chip commented Jan 25, 2024

shermansiu commented Jan 25, 2024

jivanph commented Jan 31, 2024