Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support reponse_format: {"type": "json_object"} without any constrained schema #2899

Open
lhoestq opened this issue Jan 10, 2025 · 1 comment
Assignees

Comments

@lhoestq
Copy link
Member

lhoestq commented Jan 10, 2025

In other inference APIs, response_format={"type": "json_object"} restricts the model output to be a valid JSON object without enforcing a schema.

Right now this is not supported:

Failed to deserialize the JSON body into the target type: response_format: missing field `value` at line 1 column 168

I ended up with this error while using lotus-ai which uses the litellm library with response_format={ "type": "json_object" }

To reproduce:

from huggingface_hub import InferenceClient

c = InferenceClient("meta-llama/Llama-3.3-70B-Instruct")
c.chat_completion([{"role": "user", "content": "Give me a dummy json of a person"}], response_format={"type": "json_object"})

Original issue: huggingface/huggingface_hub#2744

@drbh
Copy link
Collaborator

drbh commented Jan 13, 2025

Hi @lhoestq I just took a look at how this may be implemented in TGI and it seems that outlines does not support arbitrary JSON so we would need to add different grammar backend that supports this feature.

I believe XGrammar supports this functionality, and an issues has been opened recently here: #2900. Its unclear to me at the moment what would need to change to support XGrammar but I will dive a bit deeper and share updates here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants