The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Paper • 2501.09653 • Published 1 day ago • 8
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Paper • 2501.09756 • Published 1 day ago • 13
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published 1 day ago • 12
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Paper • 2501.09755 • Published 1 day ago • 20
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Paper • 2501.08617 • Published 3 days ago • 7
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 1 day ago • 15
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 2 days ago • 6
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Paper • 2501.09433 • Published 2 days ago • 10
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published 4 days ago • 10
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 1 day ago • 29
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 1 day ago • 39
Counting Ability of Large Language Models and Impact of Tokenization Paper • 2410.19730 • Published Oct 25, 2024 • 11
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data Paper • 2410.18558 • Published Oct 24, 2024 • 19
HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots Paper • 2410.21229 • Published Oct 28, 2024 • 4
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 4 days ago • 39
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper • 2501.08994 • Published 3 days ago • 13
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper • 2501.08828 • Published 3 days ago • 24
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Paper • 2501.08983 • Published 3 days ago • 16
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Paper • 2501.09019 • Published 3 days ago • 10
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Paper • 2501.08809 • Published 3 days ago • 9