Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Legacy flash remat fix
#943 opened Jan 23, 2025 by hanzhi713 Loading…
Add GKE A3 Ultra support
#940 opened Jan 22, 2025 by samos123 Draft
Flash Attention for Neuron
#939 opened Jan 21, 2025 by apoorvtintin Loading…
Adds mesh rule for a3-megagpu-8g.
#936 opened Jan 20, 2025 by markblee Loading…
Enable GCP Workload Monitoring
#932 opened Jan 17, 2025 by Perseus14 Draft
Enabled running Pallas Flash Attention on CPU.
#922 opened Jan 14, 2025 by ds-hwang Loading…
TRN2 Meshes and Configurations
#916 opened Jan 10, 2025 by apoorvtintin Loading…
Enable cudnn attention dropout
#913 opened Jan 8, 2025 by hanzhi713 Loading…
use "true" and "false" instead of 0 and 1
#890 opened Dec 12, 2024 by samos123 Loading…
Input batch sharding strategy BATCH
#884 opened Dec 11, 2024 by apoorvtintin Loading…
Docker: Upgrade Jax to 0.4.37
#880 opened Dec 10, 2024 by samos123 Draft
Add Goodput & Badput recording and monitoring support.
#783 opened Oct 25, 2024 by dipannita08 Loading…
5 tasks done
Use regex for parsing step_dir
#739 opened Oct 7, 2024 by nlusskin Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.