We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
目前lmdeploy pytorch engine已经支持结构化输出,但pytorch engine 速度确实比turbomind慢很多,而且输出的延迟也比较大,特别是首次输出。所以还是希望turbomind engine能够更快的支持 结构化输出,12月以来开源社区guided decoding发展迅速,outlines-dev/outlines, mlc-ai/xgrammar,,noamgat/lm-format-enforcer,都有很好的效果,特别是mlc-ai/xgrammar非常好。感谢贵团队的开源工作!
No response
The text was updated successfully, but these errors were encountered:
我们调研下 mlc-ai/xgrammar
Sorry, something went wrong.
No branches or pull requests
Motivation
目前lmdeploy pytorch engine已经支持结构化输出,但pytorch engine 速度确实比turbomind慢很多,而且输出的延迟也比较大,特别是首次输出。所以还是希望turbomind engine能够更快的支持 结构化输出,12月以来开源社区guided decoding发展迅速,outlines-dev/outlines, mlc-ai/xgrammar,,noamgat/lm-format-enforcer,都有很好的效果,特别是mlc-ai/xgrammar非常好。感谢贵团队的开源工作!
Related resources
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: