Remove max_stop_sequences by default #2584

sestinj · 2024-09-29T21:03:48Z

System Info

text-generation-inference/launcher/src/main.rs

Lines 427 to 428 in 1028996

    
               #[clap(default_value = "4", long, env)] 
        
               max_stop_sequences: usize,

I think it would be best for this limit to be non-existent by default, rather than 4. Or at least something higher like 16. Though client applications are able to detect that TGI is being used and encode the limit, it causes poorer behavior in autocomplete scenarios where >4 stop words are actually necessary. Yes, users of TGI can technically change this value, but many do not know to do this and will have lower-quality first experiences with whatever tools they are using.

All this said, I appreciate that this limit was initially defined by OpenAI and following along with their patterns is something that I in general support.

For additional background: continuedev/continue#2380

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

Send a request with >4 stop tokens

Expected behavior

Ideally a request with >4 stop tokens does not throw an error, unless the person running TGI has explicitly set a limit

sestinj mentioned this issue Sep 29, 2024

Malformed JSON Error with codegemma-7b and Hugging Face TGI in VS Code continuedev/continue#2380

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove max_stop_sequences by default #2584

Remove max_stop_sequences by default #2584

sestinj commented Sep 29, 2024

Remove max_stop_sequences by default #2584

Remove max_stop_sequences by default #2584

Comments

sestinj commented Sep 29, 2024

System Info

Information

Tasks

Reproduction

Expected behavior