LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 13 days ago • 105
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published 24 days ago • 64
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation Paper • 2410.23090 • Published 29 days ago • 53
InternVL 2.0 Collection Expanding Performance Boundaries of Open-Source MLLM • 18 items • Updated 2 days ago • 81
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Paper • 2410.09584 • Published Oct 12 • 45