Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

upvoted a paper about 2 hours ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

updated a Space about 5 hours ago

TIGER-Lab/GenAI-Arena

liked a model 1 day ago

NovaSky-AI/Sky-T1-32B-Preview

View all activity

Organizations

Papers 10

arxiv:2410.10563

arxiv:2406.15252

arxiv:2406.11069

arxiv:2406.04485

models 38

DongfuJiang/Qwen2-VL-VAE-7B-Instruct

Image-Text-to-Text • Updated Dec 17, 2024 • 26

DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae

Text2Text Generation • Updated Dec 17, 2024 • 7

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt

Text Generation • Updated Dec 9, 2024 • 46

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft

Text Generation • Updated Dec 9, 2024 • 48

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt

Text Generation • Updated Dec 9, 2024 • 47

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft

Text Generation • Updated Dec 9, 2024 • 46

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt

Text Generation • Updated Dec 7, 2024 • 47

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft

Updated Dec 7, 2024

DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf

Text Generation • Updated Dec 1, 2024 • 5 • 1

DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf

Text Generation • Updated Dec 1, 2024 • 48

datasets 12

DongfuJiang/PRM_SFT

Viewer • Updated Dec 1, 2024 • 4.01M • 40

DongfuJiang/zeroeval

Viewer • Updated Nov 27, 2024 • 13.5k • 57

DongfuJiang/PRM_eval

Viewer • Updated Nov 27, 2024 • 9.54k • 40

DongfuJiang/eval

Viewer • Updated Nov 27, 2024 • 6k • 49

DongfuJiang/PRM_prepared

Viewer • Updated Nov 26, 2024 • 39.9k • 43

DongfuJiang/PRM_train

Viewer • Updated Nov 25, 2024 • 32.7k • 40

DongfuJiang/MATH-500

Viewer • Updated Nov 6, 2024 • 500 • 77

DongfuJiang/simpo_v2_ultrafeedback

Viewer • Updated Aug 2, 2024 • 59.9k • 33

DongfuJiang/VAPO

Viewer • Updated Jul 31, 2024 • 72.5k • 36

DongfuJiang/PairRM-data

Viewer • Updated Jul 30, 2024 • 586k • 36