CRAFT: Extracting and Tuning Cultural Instructions from the Wild Paper • 2405.03138 • Published May 6, 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 30
AudioBench: A Universal Benchmark for Audio Large Language Models Paper • 2406.16020 • Published Jun 23, 2024
Evaluating Word Embedding Models: Methods and Experimental Results Paper • 1901.09785 • Published Jan 28, 2019
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs Paper • 2412.11699 • Published 20 days ago
Chimera: Improving Generalist Model with Domain-Specific Experts Paper • 2412.05983 • Published 28 days ago • 9
Knowledge Graph Embedding with 3D Compound Geometric Transformations Paper • 2304.00378 • Published Apr 1, 2023
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning Paper • 2309.04766 • Published Sep 9, 2023
Instructive Dialogue Summarization with Query Aggregations Paper • 2310.10981 • Published Oct 17, 2023
Just Rank: Rethinking Evaluation with Word and Sentence Similarities Paper • 2203.02679 • Published Mar 5, 2022