Critical-Review-of-LLM-Eval

Code and data of the paper A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations published at the Proceedings of EMNLP 2024.

If you find this work useful, please cite our paper:

@inproceedings{laskar-etal-2024-systematic,
    title = "A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations",
    author = "Laskar, Md Tahmid Rahman  and
      Alqahtani, Sawsan  and
      Bari, M Saiful  and
      Rahman, Mizanur  and
      Khan, Mohammad Abdullah Matin  and
      Khan, Haidar  and
      Jahan, Israt  and
      Bhuiyan, Amran  and
      Tan, Chee Wei  and
      Parvez, Md Rizwan  and
      Hoque, Enamul  and
      Joty, Shafiq  and
      Huang, Jimmy",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.emnlp-main.764/",
    doi = "10.18653/v1/2024.emnlp-main.764",
    pages = "13785--13816",
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
samsum_llm_evaluation		samsum_llm_evaluation
README.md		README.md
diversity_coverage.py		diversity_coverage.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Critical-Review-of-LLM-Eval

About

Releases

Packages

Contributors 2

Languages

ntunlp/Critical-Review-of-LLM-Eval

Folders and files

Latest commit

History

Repository files navigation

Critical-Review-of-LLM-Eval

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages