Skip to content

Add additional arguments to TransformerLayer subclasses #7377

Add additional arguments to TransformerLayer subclasses

Add additional arguments to TransformerLayer subclasses #7377

L2_Megatron_UL2_Pretraining_and_Resume_Training_TP2  /  main

succeeded Jan 9, 2025 in 3m 11s