Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

turbomind 支持结构化输出 #3047

Open
ZanePoe opened this issue Jan 17, 2025 · 1 comment
Open

turbomind 支持结构化输出 #3047

ZanePoe opened this issue Jan 17, 2025 · 1 comment

Comments

@ZanePoe
Copy link

ZanePoe commented Jan 17, 2025

Motivation

目前lmdeploy pytorch engine已经支持结构化输出,但pytorch engine 速度确实比turbomind慢很多,而且输出的延迟也比较大,特别是首次输出。所以还是希望turbomind engine能够更快的支持 结构化输出,12月以来开源社区guided decoding发展迅速,outlines-dev/outlines, mlc-ai/xgrammar,,noamgat/lm-format-enforcer,都有很好的效果,特别是mlc-ai/xgrammar非常好。感谢贵团队的开源工作!

Related resources

No response

Additional context

No response

@lvhan028
Copy link
Collaborator

我们调研下 mlc-ai/xgrammar

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants