NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper • 2502.05167 • Published 9 days ago • 12
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3