Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 3 days ago • 34
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 3 days ago • 34
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published about 1 month ago • 82
Language Models are Symbolic Learners in Arithmetic Paper • 2410.15580 • Published Oct 21, 2024 • 7 • 2
KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains Paper • 2311.09797 • Published Nov 16, 2023 • 1
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data Paper • 2311.09805 • Published Nov 16, 2023 • 3
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks Paper • 2311.09835 • Published Nov 16, 2023 • 11
Investigating Data Contamination in Modern Benchmarks for Large Language Models Paper • 2311.09783 • Published Nov 16, 2023 • 2