We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
transformers==4.44.0
No response
examples
Load a model that has 40000+ nn.Modules (etc. DeepSeekV3),then you will find it cost too much time on set_initialized_submodules.
This function is O(N^2) and should be optimized.
The text was updated successfully, but these errors were encountered:
Yep, it's definitely O(n^2). Let me see if I can whip something up!
Sorry, something went wrong.
Fixed by #35493
Successfully merging a pull request may close this issue.
System Info
transformers==4.44.0
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Load a model that has 40000+ nn.Modules (etc. DeepSeekV3),then you will find it cost too much time on set_initialized_submodules.
Expected behavior
This function is O(N^2) and should be optimized.
The text was updated successfully, but these errors were encountered: