VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation Paper • 2411.13281 • Published Nov 20, 2024 • 17
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper • 2406.07476 • Published Jun 11, 2024 • 32
CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification Paper • 2405.00253 • Published Apr 30, 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models Paper • 2405.00390 • Published May 1, 2024
Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models Paper • 2401.13298 • Published Jan 24, 2024
MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems Paper • 2404.09486 • Published Apr 15, 2024 • 1
Positional Artefacts Propagate Through Masked Language Model Embeddings Paper • 2011.04393 • Published Nov 9, 2020 • 1
Augmented Large Language Models with Parametric Knowledge Guiding Paper • 2305.04757 • Published May 8, 2023 • 2
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 41
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse Paper • 2401.01523 • Published Jan 3, 2024 • 1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval Paper • 2302.02908 • Published Feb 6, 2023 • 1
WizardCoder: Empowering Code Large Language Models with Evol-Instruct Paper • 2306.08568 • Published Jun 14, 2023 • 28