LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Paper • 2412.13871 • Published Dec 18, 2024 • 18
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation Paper • 2410.01680 • Published Oct 2, 2024 • 34
AM-RADIO: Agglomerative Model -- Reduce All Domains Into One Paper • 2312.06709 • Published Dec 10, 2023 • 1
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects Paper • 2312.08344 • Published Dec 13, 2023 • 13
HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions Paper • 2308.01477 • Published Aug 2, 2023 • 12
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models Paper • 2307.06925 • Published Jul 13, 2023 • 11
nvidia/stt_en_fastconformer_transducer_xlarge Automatic Speech Recognition • Updated 15 days ago • 40 • 24
nvidia/stt_ua_fastconformer_hybrid_large_pc Automatic Speech Recognition • Updated 24 days ago • 232 • 3
nvidia/stt_en_fastconformer_transducer_large Automatic Speech Recognition • Updated 15 days ago • 1.47k • 7
Bridging the Domain Gap for Stance Detection for the Zulu language Paper • 2205.03153 • Published May 6, 2022
RumourEval 2019: Determining Rumour Veracity and Support for Rumours Paper • 1809.06683 • Published Sep 18, 2018