Skip to content

Add additional arguments to TransformerLayer subclasses #7377

Add additional arguments to TransformerLayer subclasses

Add additional arguments to TransformerLayer subclasses #7377

L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Pre-LN  /  main

succeeded Jan 9, 2025 in 1m 0s