Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders Paper • 2412.09586 • Published 3 days ago • 5
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published 4 days ago • 35
Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Paper • 2412.06016 • Published 7 days ago • 19
Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction Paper • 2412.06234 • Published 7 days ago • 17
StyleMaster: Stylize Your Video with Artistic Generation and Translation Paper • 2412.07744 • Published 5 days ago • 16
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations Paper • 2412.08580 • Published 4 days ago • 38
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 5 days ago • 42
PanoDreamer: 3D Panorama Synthesis from a Single Image Paper • 2412.04827 • Published 10 days ago • 9
Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction Paper • 2412.04887 • Published 10 days ago • 14
Mind the Time: Temporally-Controlled Multi-Event Video Generation Paper • 2412.05263 • Published 9 days ago • 9
2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction Paper • 2412.03428 • Published 11 days ago • 8
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper • 2412.04862 • Published 10 days ago • 44
Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 10 days ago • 10
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 11 days ago • 40
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Paper • 2412.04455 • Published 10 days ago • 33
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published 13 days ago • 39
Imagine360: Immersive 360 Video Generation from Perspective Anchor Paper • 2412.03552 • Published 11 days ago • 26