Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published 15 days ago • 7
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 12
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9, 2024 • 54