Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Integration of TurboMind AWQ and GPTQ #2788

Open
2 tasks
zhyncs opened this issue Jan 8, 2025 · 1 comment
Open
2 tasks

[Feature] Integration of TurboMind AWQ and GPTQ #2788

zhyncs opened this issue Jan 8, 2025 · 1 comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@zhyncs
Copy link
Member

zhyncs commented Jan 8, 2025

Checklist

Motivation

The AWQ and GPTQ of TurboMind should be among the best-performing open-source implementations currently available. We plan to integrate them into SGLang, and once the integration is complete, we can consider removing SGLang's dependency on vLLM's AWQ and GPTQ kernel.

During development, we can initially install the wheel https://github.com/InternLM/turbomind/releases/tag/v0.0.1 manually for verification and later add the TurboMind repo as a dependency in sgl-kernel.

ref
https://github.com/InternLM/turbomind

Related resources

No response

@zhyncs zhyncs added good first issue Good for newcomers help wanted Extra attention is needed labels Jan 8, 2025
@Chen-0210
Copy link

Hi, I’d like to work on this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants