-
Notifications
You must be signed in to change notification settings - Fork 226
Pull requests: bigcode-project/bigcode-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
use tokenizer.chat_template by default for instruction type tasks
#301
opened Jan 25, 2025 by
TK-21st
Loading…
[Pytest] Fix bad import to use relative instead @ module_test
#298
opened Jan 11, 2025 by
ggcr
Loading…
Fix the bugs in the ds1000 sample bash script; Fix typos
#295
opened Dec 10, 2024 by
gameofby
Loading…
Support multiple datasets from MBPP; Fix missing commas in python list; Fix doc typos;
#291
opened Dec 4, 2024 by
gameofby
Loading…
Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code
#260
opened Jul 22, 2024 by
q-rz
Loading…
fix: Multiple-E dataset fix go_test.go path for test execution
#225
opened Apr 20, 2024 by
hitesh-1997
Loading…
remove pad tokens added by the accelerator.pad_across_processes
#216
opened Apr 13, 2024 by
IQ17
Loading…
fix apps evaluate error: local variable 'level' referenced before assignment
#206
opened Mar 10, 2024 by
koking0
Loading…
Add support for Ollama, Palm, Claude-2, Cohere, Replicate, Llama2 CodeLlama (100+LLMs) [LiteLLM]
#160
opened Nov 9, 2023 by
ishaan-jaff
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.