MMMU

non-profit

https://mmmu-benchmark.github.io/

Activity Feed Request to join this org

AI & ML interests

Multimodal Model Evaluation

Recent Activity

wenhu authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

zhangysk authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

wren93 authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

View all activity

MMMU's activity

wenhu

authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published 4 days ago • 14

zhangysk

authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published 4 days ago • 14

wren93

authored a paper 1 day ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published 4 days ago • 14

wenhu

authored a paper 6 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 7 days ago • 56

a43992899

authored 16 papers 7 days ago

Chinese Open Instruction Generalist: A Preliminary Release

Paper • 2304.07987 • Published Apr 17, 2023 • 2

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

Paper • 2306.17103 • Published Jun 29, 2023 • 1

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

Paper • 2402.13109 • Published Feb 20, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning

Paper • 2403.18058 • Published Mar 26, 2024 • 4

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis

Paper • 2404.01204 • Published Apr 1, 2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5, 2024 • 14

MuPT: A Generative Symbolic Music Pretrained Transformer

Paper • 2404.06393 • Published Apr 9, 2024 • 16

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

Paper • 2404.18081 • Published Apr 28, 2024 • 2

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Paper • 2407.20962 • Published Jul 30, 2024

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

Paper • 2404.00610 • Published Mar 31, 2024

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning

Paper • 2212.02508 • Published Dec 5, 2022

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Paper • 2410.13267 • Published Oct 17, 2024 • 1

You Know What I'm Saying: Jailbreak Attack via Implicit Reference

Paper • 2410.03857 • Published Oct 4, 2024

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages

Paper • 2502.10362 • Published Feb 14 • 4