Code and data of the paper A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations published at the Proceedings of EMNLP 2024.
If you find this work useful, please cite our paper:
@inproceedings{laskar-etal-2024-systematic,
title = "A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations",
author = "Laskar, Md Tahmid Rahman and
Alqahtani, Sawsan and
Bari, M Saiful and
Rahman, Mizanur and
Khan, Mohammad Abdullah Matin and
Khan, Haidar and
Jahan, Israt and
Bhuiyan, Amran and
Tan, Chee Wei and
Parvez, Md Rizwan and
Hoque, Enamul and
Joty, Shafiq and
Huang, Jimmy",
editor = "Al-Onaizan, Yaser and
Bansal, Mohit and
Chen, Yun-Nung",
booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
month = nov,
year = "2024",
address = "Miami, Florida, USA",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.emnlp-main.764/",
doi = "10.18653/v1/2024.emnlp-main.764",
pages = "13785--13816",
}