MMLU-ProX : A Multilingual Benchmark for Advanced Large Language Model Evaluation
ContributorsXuan, Weihao; Yang, Rui; Qi, Heli; Zeng, Qingcheng; Xiao, Yunze; Feng, Aosong; Liu, Dairui; Xing, Yun; Wang, Junjue; Gao, Fan; Lu, Jinghui; Jiang, Yuang; Li, Huitao; Li, Xin; Yu, Kunyu; Dong, Ruihai; Gu, Shangding; Li, Yuekang; Xie, Xiaofei; Juefei-Xu, Felix; Khomh, Foutse; Yoshie, Osamu; Chen, Qingyu; Teodoro, Douglas
; Liu, Nan; Goebel, Randy; Ma, Lei; Marrese-Taylor, Edison; Lu, Shijian; Iwasawa, Yusuke; Matsuo, Yutaka; Li, Irene
Presented atSuzhou (China), November 4-9, 2025
Published inChristodoulopoulos, C., Chakraborty, T., Rose, C. & Peng, V. (Ed.), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, p. 1513-1532
PublisherKerrville, TX : Association for Computational Linguistics
Publication date2025-11
First online date2025-11
Abstract
Affiliation entities
Research groups
Citation (ISO format)
XUAN, Weihao et al. MMLU-ProX : A Multilingual Benchmark for Advanced Large Language Model Evaluation. In: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. Christodoulopoulos, C., Chakraborty, T., Rose, C. & Peng, V. (Ed.). Suzhou (China). Kerrville, TX : Association for Computational Linguistics, 2025. p. 1513–1532. doi: 10.18653/v1/2025.emnlp-main.79
Main files (1)
Proceedings chapter (Published version)
Identifiers
- PID : unige:191905
- DOI : 10.18653/v1/2025.emnlp-main.79
Additional URL for this publicationhttps://aclanthology.org/2025.emnlp-main.79/
ISBN979-8-89176-332-6
