Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 14 days ago • 18
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 14 days ago • 18
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 14 days ago • 18 • 4
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 14 days ago • 18 • 4
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Paper • 2412.01292 • Published Dec 2, 2024 • 12
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper • 2410.11817 • Published Oct 15, 2024 • 15
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces Paper • 2407.11895 • Published Jul 16, 2024 • 7
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces Paper • 2407.11895 • Published Jul 16, 2024 • 7 • 3
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding Paper • 2307.09267 • Published Jul 18, 2023
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Paper • 2303.05309 • Published Mar 9, 2023
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers Paper • 2312.08168 • Published Dec 13, 2023
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion Paper • 2405.04883 • Published May 8, 2024