Skip to content

Actions: JoelNiklaus/lighteval

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
207 workflow runs
207 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Merge branch 'llm-judge-fix' into dev
Scan Secret Leaks #177: Commit 012b94f pushed by JoelNiklaus
February 5, 2025 14:47 16s dev
dev
February 5, 2025 14:47 16s
Improved stability of litellm models for reasoning models.
Scan Secret Leaks #176: Commit 923f036 pushed by JoelNiklaus
February 5, 2025 14:46 14s improve-litellm-model
February 5, 2025 14:46 14s
enable together models and reasoning models as judges.
Scan Secret Leaks #175: Commit bfbd2e3 pushed by JoelNiklaus
February 5, 2025 14:43 17s llm-judge-fix
February 5, 2025 14:43 17s
Sync Math-verify (#535)
Build Documentation #10: Commit cb35bea pushed by JoelNiklaus
February 5, 2025 14:41 1m 51s main
February 5, 2025 14:41 1m 51s
Sync Math-verify (#535)
Scan Secret Leaks #174: Commit cb35bea pushed by JoelNiklaus
February 5, 2025 14:41 16s main
February 5, 2025 14:41 16s
Sync Math-verify (#535)
Quality #10: Commit cb35bea pushed by JoelNiklaus
February 5, 2025 14:41 2m 19s main
February 5, 2025 14:41 2m 19s
Sync Math-verify (#535)
Tests #10: Commit cb35bea pushed by JoelNiklaus
February 5, 2025 14:41 41m 24s main
February 5, 2025 14:41 41m 24s
Merge branch 'add_swiss_legal_evals' into dev
Scan Secret Leaks #173: Commit 594500c pushed by JoelNiklaus
February 5, 2025 13:25 16s dev
dev
February 5, 2025 13:25 16s
Added more judge models.
Scan Secret Leaks #172: Commit c62647e pushed by JoelNiklaus
February 5, 2025 13:24 22s add_swiss_legal_evals
February 5, 2025 13:24 22s
Merge branch 'add_swiss_legal_evals' into dev
Scan Secret Leaks #171: Commit 65c71fb pushed by JoelNiklaus
February 1, 2025 10:55 15s dev
dev
February 1, 2025 10:55 15s
Fixed judge setup.
Scan Secret Leaks #170: Commit 186a6c8 pushed by JoelNiklaus
February 1, 2025 10:54 17s add_swiss_legal_evals
February 1, 2025 10:54 17s
Merge branch 'add_swiss_legal_evals' into dev
Scan Secret Leaks #169: Commit 3fb93c9 pushed by JoelNiklaus
February 1, 2025 10:37 20s dev
dev
February 1, 2025 10:37 20s
Added additional judge prompt configurations.
Scan Secret Leaks #168: Commit e7f9a09 pushed by JoelNiklaus
February 1, 2025 10:37 14s add_swiss_legal_evals
February 1, 2025 10:37 14s
Fixed bug in comet score.
Scan Secret Leaks #167: Commit 866e770 pushed by JoelNiklaus
January 27, 2025 13:20 20s add_swiss_legal_evals
January 27, 2025 13:20 20s
Merge branch 'huggingface:main' into add_swiss_legal_evals
Scan Secret Leaks #166: Commit 306ee76 pushed by rolshoven
January 20, 2025 12:39 17s add_swiss_legal_evals
January 20, 2025 12:39 17s
Merge branch 'load-details' into dev
Scan Secret Leaks #165: Commit 76e867a pushed by JoelNiklaus
January 14, 2025 01:11 18s dev
dev
January 14, 2025 01:11 18s
Made loading details more robust against tensors being saved in the d…
Scan Secret Leaks #164: Commit 299b90c pushed by JoelNiklaus
January 14, 2025 01:10 21s load-details
January 14, 2025 01:10 21s
Merge branch 'load-details' into dev
Scan Secret Leaks #163: Commit 4f52fbb pushed by JoelNiklaus
January 13, 2025 19:31 19s dev
dev
January 13, 2025 19:31 19s
Made bulk loading easier by also allowing first timestamp more genera…
Scan Secret Leaks #162: Commit dae2d2b pushed by JoelNiklaus
January 13, 2025 19:31 16s load-details
January 13, 2025 19:31 16s
Merge branch 'add_swiss_legal_evals' into dev
Scan Secret Leaks #161: Commit 2f32d1e pushed by JoelNiklaus
January 13, 2025 19:07 19s dev
dev
January 13, 2025 19:07 19s
Moved unpack to the pipeline code.
Scan Secret Leaks #160: Commit cb6bfb4 pushed by JoelNiklaus
January 13, 2025 19:06 21s add_swiss_legal_evals
January 13, 2025 19:06 21s
Merge branch 'load-details' into dev
Scan Secret Leaks #159: Commit 426ee30 pushed by JoelNiklaus
January 13, 2025 19:04 17s dev
dev
January 13, 2025 19:04 17s
Unpacking predictions to fix issue with weirdly saved predictions.
Scan Secret Leaks #158: Commit 3a22d93 pushed by JoelNiklaus
January 13, 2025 19:02 21s load-details
January 13, 2025 19:02 21s
Merge branch 'add_swiss_legal_evals' into dev
Scan Secret Leaks #157: Commit eb38a2d pushed by JoelNiklaus
January 13, 2025 05:36 20s dev
dev
January 13, 2025 05:36 20s
Made comet score more robust.
Scan Secret Leaks #156: Commit 7f36065 pushed by JoelNiklaus
January 13, 2025 05:35 21s add_swiss_legal_evals
January 13, 2025 05:35 21s