Skip to content

Actions: huggingface/nanotron

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,982 workflow runs
1,982 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fp8
Run non-FA2-related unit tests #722: Pull request #266 synchronize by xrsrke
January 14, 2025 13:52 8m 2s xrsrke/fp8_for_nanotron
January 14, 2025 13:52 8m 2s
fix test_base_model
Secret Leaks #133: Commit 3dde0af pushed by xrsrke
January 14, 2025 13:52 17s xrsrke/fp8_for_nanotron
January 14, 2025 13:52 17s
fp8
Run FA2-related unit tests #721: Pull request #266 synchronize by xrsrke
January 14, 2025 11:11 7m 10s xrsrke/fp8_for_nanotron
January 14, 2025 11:11 7m 10s
fp8
Code Quality #624: Pull request #266 synchronize by xrsrke
January 14, 2025 11:11 16s xrsrke/fp8_for_nanotron
January 14, 2025 11:11 16s
fp8
Run non-FA2-related unit tests #721: Pull request #266 synchronize by xrsrke
January 14, 2025 11:11 8m 14s xrsrke/fp8_for_nanotron
January 14, 2025 11:11 8m 14s
fix bias[None, :] in tp's functional
Secret Leaks #132: Commit f8c40ad pushed by xrsrke
January 14, 2025 11:11 17s xrsrke/fp8_for_nanotron
January 14, 2025 11:11 17s
fp8
Code Quality #623: Pull request #266 synchronize by xrsrke
January 13, 2025 12:34 20s xrsrke/fp8_for_nanotron
January 13, 2025 12:34 20s
fp8
Run non-FA2-related unit tests #720: Pull request #266 synchronize by xrsrke
January 13, 2025 12:34 7m 53s xrsrke/fp8_for_nanotron
January 13, 2025 12:34 7m 53s
fp8
Run FA2-related unit tests #720: Pull request #266 synchronize by xrsrke
January 13, 2025 12:34 6m 37s xrsrke/fp8_for_nanotron
January 13, 2025 12:34 6m 37s
add tp_recompute_allgather to column linear
Secret Leaks #131: Commit 21b2408 pushed by xrsrke
January 13, 2025 12:34 17s xrsrke/fp8_for_nanotron
January 13, 2025 12:34 17s
fp8
Code Quality #622: Pull request #266 synchronize by xrsrke
January 11, 2025 11:56 20s xrsrke/fp8_for_nanotron
January 11, 2025 11:56 20s
fp8
Run non-FA2-related unit tests #719: Pull request #266 synchronize by xrsrke
January 11, 2025 11:56 7m 48s xrsrke/fp8_for_nanotron
January 11, 2025 11:56 7m 48s
fp8
Run FA2-related unit tests #719: Pull request #266 synchronize by xrsrke
January 11, 2025 11:56 10m 25s xrsrke/fp8_for_nanotron
January 11, 2025 11:56 10m 25s
Merge branch 'main' into xrsrke/fp8_for_nanotron
Secret Leaks #130: Commit 9a99ab6 pushed by xrsrke
January 11, 2025 11:56 15s xrsrke/fp8_for_nanotron
January 11, 2025 11:56 15s
add
Secret Leaks #129: Commit ebea115 pushed by xrsrke
January 11, 2025 11:54 15s xrsrke/fp8_for_nanotron
January 11, 2025 11:54 15s
clean up
Secret Leaks #128: Commit e8b114b pushed by xrsrke
January 10, 2025 11:46 17s xrsrke/fp8_for_nanotron
January 10, 2025 11:46 17s
remove ablated fp8 config, and uncessary files/code
Secret Leaks #127: Commit a3a13ce pushed by xrsrke
January 9, 2025 15:27 21s xrsrke/fp8_for_nanotron
January 9, 2025 15:27 21s
pp
Secret Leaks #125: Commit 67c5ebb pushed by NouamaneTazi
December 27, 2024 16:39 17s nouamane/bench2
December 27, 2024 16:39 17s
fix grad_clipping for fp8
Secret Leaks #124: Commit 4723335 pushed by xrsrke
December 19, 2024 13:46 19s xrsrke/fp8_for_nanotron
December 19, 2024 13:46 19s
fix nan in fwd pass
Secret Leaks #123: Commit b440408 pushed by xrsrke
December 18, 2024 17:33 16s xrsrke/fp8_for_nanotron
December 18, 2024 17:33 16s
fix datastages
Secret Leaks #122: Commit 4e075ab pushed by NouamaneTazi
December 13, 2024 20:04 17s nouamane/bench2
December 13, 2024 20:04 17s
stress test
Secret Leaks #121: Commit 6ac7d73 pushed by NouamaneTazi
December 13, 2024 18:45 16s nouamane/bench2
December 13, 2024 18:45 16s
[Feature] Support resume ZeRO1 in a new data parallelism size
Run non-FA2-related unit tests #718: Pull request #263 opened by xrsrke
December 10, 2024 17:10 8m 59s xrsrke/fix_zero1_resume
December 10, 2024 17:10 8m 59s
[Feature] Support resume ZeRO1 in a new data parallelism size
Run FA2-related unit tests #718: Pull request #263 opened by xrsrke
December 10, 2024 17:10 3m 20s xrsrke/fix_zero1_resume
December 10, 2024 17:10 3m 20s