Submitted by akhaliq 63 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning · 7 authors 7
Submitted by akhaliq 19 Semantic-SAM: Segment and Recognize Anything at Any Granularity · 9 authors 1
Submitted by akhaliq 17 Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration · 6 authors
Submitted by akhaliq 6 Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features · 2 authors
Submitted by akhaliq 6 On decoder-only architecture for speech-to-text and large language model integration · 11 authors
Submitted by akhaliq 3 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement · 8 authors
Submitted by akhaliq 1 AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System · 8 authors