M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought Paper • 2405.16473 • Published May 26, 2024
Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement Paper • 2406.17233 • Published Jun 25, 2024
A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification Paper • 2304.09820 • Published Apr 18, 2023
Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding Paper • 2112.11953 • Published Dec 22, 2021
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models Paper • 2412.05939 • Published 26 days ago • 13
ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution Paper • 2408.15993 • Published Aug 28, 2024 • 7
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning Paper • 2306.00103 • Published May 31, 2023 • 1
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning Paper • 2206.08657 • Published Jun 17, 2022 • 2