Submitted by akhaliq 43 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models · 8 authors 4
Submitted by akhaliq 26 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes · 7 authors
Submitted by akhaliq 14 Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk · 6 authors
Submitted by akhaliq 7 ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video · 3 authors