Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 8 days ago • 79
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Paper • 2503.05379 • Published 7 days ago • 30
Forgetting Transformer: Softmax Attention with a Forget Gate Paper • 2503.02130 • Published 10 days ago • 27
Agent models: Internalizing Chain-of-Action Generation into Reasoning models Paper • 2503.06580 • Published 5 days ago • 14
Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning Paper • 2503.04973 • Published 8 days ago • 19
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published 4 days ago • 36
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 9 days ago • 207
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published 7 days ago • 4
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System Paper • 2503.09600 • Published 1 day ago • 3
Cost-Optimal Grouped-Query Attention for Long-Context LLMs Paper • 2503.09579 • Published 1 day ago • 4
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 2 days ago • 7
Quantization for OpenAI's Whisper Models: A Comparative Analysis Paper • 2503.09905 • Published 1 day ago • 5
WebGames: Challenging General-Purpose Web-Browsing AI Agents Paper • 2502.18356 • Published 17 days ago • 11
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 17 days ago • 68