jizhongpeng's picture

jizhongpeng

jizhongpeng

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

deepseek-ai/deepseek-vl2

liked a model 4 days ago

Aria-UI/Aria-UI-base

liked a Space 4 days ago

Aria-UI/Aria-UI

View all activity

Organizations

jizhongpeng's activity

upvoted a paper about 1 month ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20 • 17

upvoted a paper 3 months ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

upvoted a collection 3 months ago

🏆 Leaderboards & Arenas

19 items • Updated 6 days ago • 7

upvoted a collection 4 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 25 days ago • 182

upvoted 3 papers 4 months ago

K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences

Paper • 2408.14468 • Published Aug 26 • 35

Towards flexible perception with visual memory

Paper • 2408.08172 • Published Aug 15 • 20

FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting

Paper • 2408.11706 • Published Aug 21 • 6

upvoted 2 papers 5 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

Q-Ground: Image Quality Grounding with Large Multi-modality Models

Paper • 2407.17035 • Published Jul 24 • 1

upvoted a collection 5 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Sep 14 • 10

upvoted a paper 5 months ago

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Paper • 2407.15754 • Published Jul 22 • 19

upvoted a collection 6 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 8 days ago • 205

upvoted a paper 6 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5 • 52

upvoted a collection 6 months ago

InternVL2.0

Expanding Performance Boundaries of Open-Source MLLM • 15 items • Updated 2 days ago • 87

upvoted 3 papers 7 months ago

CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Paper • 2406.09356 • Published Jun 13 • 4

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12 • 24

A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

Paper • 2406.03070 • Published Jun 5 • 2

upvoted 3 collections 7 months ago

MaPO

This collection includes the models and datasets as a part of the MaPO release. • 9 items • Updated Jun 12 • 5

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28 • 352

GLM-4

GLM-4 Open Models • 13 items • Updated Nov 27 • 115