You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. I have searched related issues but cannot get the expected help.
2. The bug has not been fixed in the latest version.
3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
This gives the error: AttributeError: 'NoneType' object has no attribute 'dtype'.
By the way I'm using dummy weights like: python3 -m sglang.launch_server --model my_path/deepseek_v3/DeepSeek-V3 --tp 1 --trust-remote-code --port 30000 --load-format dummy
Does this mean using dummy weights cannot run decoding phase?
Suppose I only have 1 GPU that cannot hold such large model weights of hugging face *safetensors, , what can I do to run decoding phase on 1 GPU?
When MLA enabled, the dummy weights cannot be used since there are some post processing of the weights after weights loading.
It's not possible to run decoding on 1 GPU even you use the dummy weight. Dummy weights means the parameter values are random initialized. The number of parameters has not been reduced.
I see. Previously I reduce the layer number to 2 to use dummy weights on 1 GPU. But seems indeed MLA cannot use dummy weights to run decoding phase. Thanks.
Checklist
Describe the bug
In deepseekv3(v2)'s attention, for decoder mode using
forward_absorb()
.However,
self.w_kc = None
is been set insglang/python/sglang/srt/models/deepseek_v2.py
Line 498 in 9dec582
In this way the None shouldn't have any dtype:
sglang/python/sglang/srt/models/deepseek_v2.py
Line 579 in 9dec582
This gives the error:
AttributeError: 'NoneType' object has no attribute 'dtype'
.By the way I'm using dummy weights like:
python3 -m sglang.launch_server --model my_path/deepseek_v3/DeepSeek-V3 --tp 1 --trust-remote-code --port 30000 --load-format dummy
Reproduction
python3 -m sglang.launch_server --model my_path/deepseek_v3/DeepSeek-V3 --tp 1 --trust-remote-code --port 30000 --load-format dummy
Environment
Docker provided:
docker pull lmsysorg/sglang:latest
The text was updated successfully, but these errors were encountered: