Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
oulianov authored Nov 20, 2024
1 parent 61c4162 commit 76df3fe
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,6 +59,8 @@ Each LLM has an ELO score based on its results.
| 13 | **together:meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo:vision** | 1269.84 |
| 14 | anthropic:claude-3-sonnet-20240229:text | 1029.31 |

*Note: In our experiments, Claude 3 Sonnet got a low score due to many refusal to fight and large API latencies.*

### Win rate matrix

![Win rate matrix](notebooks/result_matrix.png)
Expand Down

0 comments on commit 76df3fe

Please sign in to comment.