vansin's picture

vansin

vansin

·

vansin

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Are Your LLMs Capable of Stable Reasoning?

upvoted a paper 15 days ago

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

commented a paper 24 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

View all activity

Organizations

vansin's activity

upvoted 2 papers 15 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 16 days ago • 91

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 16 days ago • 41

upvoted 3 collections 24 days ago

📑Trending Papers - September 9⃣️

10 items • Updated 9 days ago • 9

🏆 Leaderboards & Arenas

19 items • Updated 9 days ago • 7

🖼️ MLLMs

39 items • Updated 9 days ago • 12

upvoted a paper 24 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 27 days ago • 121

upvoted a paper 30 days ago

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29, 2024 • 41

upvoted a paper 2 months ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 59

upvoted 3 papers 3 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 53

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 41

Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts

Paper • 2409.13449 • Published Sep 20, 2024 • 10

upvoted a collection 4 months ago

InternLM2.5-MLC

InternLM Weights of MLC-LLM Collection ——https://huggingface.co/mlc-ai • 9 items • Updated Sep 4, 2024 • 1

upvoted a collection 5 months ago

InternLM2.5

14 items • Updated Sep 14, 2024 • 70