Skip to content

8x22b seq len

8x22b seq len #7415

L2_Megatron_GPT_with_ResetLR_Pretraining_and_Resume_Training_TP2  /  main

succeeded Jan 10, 2025 in 4m 42s