We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Updated Home (markdown)
Updated Models (markdown)
Updated Dataset (markdown)
Updated Evaluation (markdown)
Updated _Sidebar (markdown)
Fix minor typos
Merge changes
Add more discussion of model table
Add discussion of how to run evaluation and description of APPs benchmark
Add description of pass@k