CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models Paper • 2405.13684 • Published May 22, 2024
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Paper • 2409.10999 • Published Sep 17, 2024
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models Paper • 2412.13702 • Published 22 days ago
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper • 2412.03304 • Published Dec 4, 2024 • 17
An Efficient Self-Supervised Cross-View Training For Sentence Embedding Paper • 2311.03228 • Published Nov 6, 2023 • 1
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 31
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16, 2024 • 30
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models Paper • 2307.07889 • Published Jul 15, 2023 • 1
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models Paper • 2405.13684 • Published May 22, 2024
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models Paper • 2303.08896 • Published Mar 15, 2023 • 4
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization Paper • 2301.12307 • Published Jan 28, 2023 • 3