-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add pytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile
#134
Conversation
This commit also contains some formatting improvements to better debug the `Dockerfile` such as indentation when a command is divided in multiple lines to know that it refers to the unindented command above; also set bash as the default shell, and fix `gcloud` CLI installation
Bump the `transformers` dependency to 4.48.0 to support the ModernBERT architecture, as well as bumping `diffusers` including new video and image generation pipelines, as well as a bunch of other features, improvements and bug fixes. Additionally, the `Dockerfile` formatting has been fixed.
pytorch/training/gpu/2.3.0/transformers/4.47.0/py311/Dockerfile
pytorch/training/gpu/2.3.0/transformers/4.48.0/py311/Dockerfile
pytorch/training/gpu/2.3.0/transformers/4.48.0/py311/Dockerfile
pytorch/training/gpu/2.3.0/transformers/4.47.1/py311/Dockerfile
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Whats the time benefit of uv
? Not sure if we should add another dependency for a few seconds faster build time of containers. Especially with the alias, e.g. adding custom dependencies in training jobs, what if they are not supported etc.
So it's mainly for both the |
Ok @philschmid after checking, apparently |
Lets remove it and maybe revisit in a few months. |
pytorch/training/gpu/2.3.0/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.q/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.q/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.1/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.1/transformers/4.47.1/py311/Dockerfile
pytorch/training/gpu/2.3.1/transformers/4.48.0/py311/Dockerfile
Description
This PR bumps the dependencies to release a new PyTorch DLC for training with improvements, support for newer model architectures, bug fixes and much more.
Additionally, besides the version bumps, this PR also includes the
gcloud
CLI and installshuggingface_hub
with thehf-transfer
utility for improvements on download/upload speed to the Hugging Face Hub.Note
This PR will enable the example on how to fine-tune PaliGemma 2 with TRL to be shipped within #133