CodeDPO

community

https://jdf-prog.github.io/

DongfuJiang

jdf-prog

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang updated a dataset 1 day ago

CodeDPO/rl_dataset_llama3_instruct_8b_20241230_human_eval_format

WyettZ updated a dataset 5 days ago

CodeDPO/rl_dataset_llama3_instruct_8b_20241230

WyettZ updated a dataset 5 days ago

CodeDPO/rl_dataset_qwen_coder_2.5_7b_20241230

View all activity

CodeDPO's activity

DongfuJiang

updated a dataset 1 day ago

CodeDPO/rl_dataset_llama3_instruct_8b_20241230_human_eval_format

Viewer • Updated 1 day ago • 284k • 1

WyettZ

updated 2 datasets 5 days ago

CodeDPO/rl_dataset_llama3_instruct_8b_20241230

Viewer • Updated 5 days ago • 1.42M • 42

CodeDPO/rl_dataset_qwen_coder_2.5_7b_20241230

Viewer • Updated 5 days ago • 1.51M • 4

WyettZ

updated a dataset 11 days ago

CodeDPO/rl_dataset_20241225

Viewer • Updated 11 days ago • 1.98M • 13

wenhu

authored a paper 26 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 29 days ago • 46

WyettZ

updated a dataset 27 days ago

CodeDPO/codedpo_20241208

Viewer • Updated 27 days ago • 89.4k • 43

WyettZ

updated a dataset 30 days ago

CodeDPO/codedpo_ground_truth_20241206_qwen_coder_2.5_32b

Viewer • Updated 30 days ago • 91.9k • 27

wenhu

authored a paper about 1 month ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 26

wenhu

updated a dataset about 1 month ago

CodeDPO/rl_dataset_20241201

Viewer • Updated Dec 1, 2024 • 486k • 30

WyettZ

updated 3 datasets about 2 months ago

wenhu

authored a paper 3 months ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30

DongfuJiang

authored a paper 3 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38

wenhu

authored a paper 4 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 28

wenhu

authored 2 papers 6 months ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25, 2024 • 22

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21, 2024 • 14

DongfuJiang

authored a paper 6 months ago

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21, 2024 • 14

wenhu

authored a paper 7 months ago

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16, 2024 • 13

DongfuJiang

authored a paper 7 months ago

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16, 2024 • 13

AI & ML interests

Recent Activity

Team members 4

CodeDPO's activity