Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? Paper • 2406.12822 • Published Jun 18, 2024