Skip to content

ntunlp/Critical-Review-of-LLM-Eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Critical-Review-of-LLM-Eval

Code and data of the paper A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations published at the Proceedings of EMNLP 2024.

If you find this work useful, please cite our paper:

@inproceedings{laskar-etal-2024-systematic,
    title = "A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations",
    author = "Laskar, Md Tahmid Rahman  and
      Alqahtani, Sawsan  and
      Bari, M Saiful  and
      Rahman, Mizanur  and
      Khan, Mohammad Abdullah Matin  and
      Khan, Haidar  and
      Jahan, Israt  and
      Bhuiyan, Amran  and
      Tan, Chee Wei  and
      Parvez, Md Rizwan  and
      Hoque, Enamul  and
      Joty, Shafiq  and
      Huang, Jimmy",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.emnlp-main.764/",
    doi = "10.18653/v1/2024.emnlp-main.764",
    pages = "13785--13816",
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages