Submitted by ksshumab 48 Predictive Data Selection: The Data That Predicts Is the Data That Teaches · 8 authors 2
Submitted by lzq2021 30 DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking · 9 authors 4
Submitted by nicolas-dufour 25 How far can we go with ImageNet for Text-to-Image generation? · 5 authors 2
Submitted by autumncc 18 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents · 7 authors 2
Submitted by akhaliq 11 Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids · 5 authors 2
Submitted by hturbe 10 Tell me why: Visual foundation models as self-explainable classifiers · 4 authors 2
Submitted by kamahori 10 LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation · 4 authors 2
Submitted by kamahori 7 TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval · 14 authors 2
Submitted by adaamko 7 LettuceDetect: A Hallucination Detection Framework for RAG Applications · 2 authors 2
Submitted by Yifan-Zhong 6 DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping · 7 authors 2
Submitted by BestWishYsh 4 MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing · 6 authors 2
Submitted by akhaliq 1 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models · 8 authors 2