Skip to content

Commit

Permalink
Update fuji.py
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 authored Nov 6, 2024
1 parent 27e92a3 commit a0142c1
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion axlearn/experiments/text/gpt/fuji.py
Original file line number Diff line number Diff line change
Expand Up @@ -300,7 +300,8 @@ def get_trainer_kwargs(
trainer_kwargs = dict(
model_kwargs=dict(
num_layers=126,
hidden_dim=53248,
hidden_dim=16384,
ffn_dim=53248,
num_heads=128,
# No GQA support in V1 models, so num_kv_heads is the same as num_heads.
num_kv_heads=None if version == Version.V1 else 8,
Expand Down

0 comments on commit a0142c1

Please sign in to comment.