DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 7 days ago • 17
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published 8 days ago • 21
Outcome-Refining Process Supervision for Code Generation Paper • 2412.15118 • Published 11 days ago • 16
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published 7 days ago • 25
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 7 days ago • 37
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 11 days ago • 78
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 7 days ago • 38
IDOL: Instant Photorealistic 3D Human Creation from a Single Image Paper • 2412.14963 • Published 11 days ago • 5
Sequence Matters: Harnessing Video Models in 3D Super-Resolution Paper • 2412.11525 • Published 14 days ago • 10
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design Paper • 2412.14590 • Published 11 days ago • 12
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published 11 days ago • 16
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published 10 days ago • 18
SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Paper • 2412.13649 • Published 12 days ago • 18
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 10 days ago • 33
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper • 2412.17153 • Published 8 days ago • 32