A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions Paper • 2401.00536 • Published Dec 31, 2023 • 2
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 9 days ago • 130
ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation Paper • 2407.19835 • Published Jul 29 • 21