High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion Paper • 2502.12752 • Published 4 days ago • 2
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published 4 days ago • 33
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models Paper • 2502.14834 • Published 1 day ago • 20
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 1 day ago • 85
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 1 day ago • 85
TESS 2: A Large-Scale Generalist Diffusion Language Model Paper • 2502.13917 • Published 3 days ago • 4
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper • 2502.12464 • Published 4 days ago • 26
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models Paper • 2502.10458 • Published 10 days ago • 27
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation Paper • 2502.13143 • Published 4 days ago • 28
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 6 days ago • 48
Detailed Human-Centric Text Description-Driven Large Scene Synthesis Paper • 2311.18654 • Published Nov 30, 2023 • 2
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Paper • 2502.08690 • Published 10 days ago • 39
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 9 days ago • 139
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 12 days ago • 32
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published 17 days ago • 42