Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Paper • 2409.11355 • Published Sep 17, 2024 • 29
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Paper • 2403.16428 • Published Mar 25, 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Paper • 2409.11355 • Published Sep 17, 2024 • 29
Point2Vec for Self-Supervised Representation Learning on Point Clouds Paper • 2303.16570 • Published Mar 29, 2023
InstrumentGen: Generating Sample-Based Musical Instruments From Text Paper • 2311.04339 • Published Nov 7, 2023
Distortion Audio Effects: Learning How to Recover the Clean Signal Paper • 2202.01664 • Published Feb 3, 2022
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models Paper • 2407.15641 • Published Jul 22, 2024
DSP-informed bandwidth extension using locally-conditioned excitation and linear time-varying filter subnetworks Paper • 2407.15624 • Published Jul 22, 2024
Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech Paper • 2210.13397 • Published Oct 24, 2022
Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project Paper • 2309.15869 • Published Sep 26, 2023
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28, 2024 • 3
Real-time Speech Summarization for Medical Conversations Paper • 2406.15888 • Published Jun 22, 2024 • 1
VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain Paper • 2404.05659 • Published Apr 8, 2024 • 2
Cascaded Span Extraction and Response Generation for Document-Grounded Dialog Paper • 2106.07275 • Published Jun 14, 2021
ClimateGPT: Towards AI Synthesizing Interdisciplinary Research on Climate Change Paper • 2401.09646 • Published Jan 17, 2024 • 17
On Sampling-Based Training Criteria for Neural Language Modeling Paper • 2104.10507 • Published Apr 21, 2021
Investigation on Data Adaptation Techniques for Neural Named Entity Recognition Paper • 2110.05892 • Published Oct 12, 2021
Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model Paper • 2112.08844 • Published Dec 16, 2021
Does Joint Training Really Help Cascaded Speech Translation? Paper • 2210.13700 • Published Oct 24, 2022