arxiv:2410.10563
Dongfu Jiang
DongfuJiang
AI & ML interests
NLP, common sense reasoning
Recent Activity
updated
a model
about 9 hours ago
DongfuJiang/prm_version3_full_hf
updated
a Space
about 12 hours ago
TIGER-Lab/GenAI-Arena
upvoted
a
paper
about 20 hours ago
Organizations
Papers
10
models
20
DongfuJiang/prm_version3_full_hf
Text Generation
•
Updated
DongfuJiang/prm_version3_subsample_no_ref_hf
Text Generation
•
Updated
DongfuJiang/prm_version3_subsample_hf
Text Generation
•
Updated
•
4
DongfuJiang/prm_version2_subsample_no_ref_hf
Text Generation
•
Updated
•
49
DongfuJiang/Qwen2.5-0.5B-Instruct
Text Generation
•
Updated
•
151
DongfuJiang/prm_version2_subsample_hf
Text Generation
•
Updated
•
1.22k
DongfuJiang/prm_version2_hf
Updated
DongfuJiang/PairRM-V2-phi-3-4k-mini-all
Updated
•
5
DongfuJiang/vapo_lora_all_data_iter_2
Updated
•
6
DongfuJiang/vapo_lora_all_data_iter_1
Updated
•
7
datasets
11
DongfuJiang/PRM_SFT
Viewer
•
Updated
•
3.26M
•
62
DongfuJiang/PRM_prepared
Viewer
•
Updated
•
25k
•
36
DongfuJiang/PRM_train
Viewer
•
Updated
•
25.2k
•
104
DongfuJiang/PRM_eval
Viewer
•
Updated
•
3.54k
•
18
DongfuJiang/zeroeval
Viewer
•
Updated
•
13.5k
•
99
DongfuJiang/MATH-500
Viewer
•
Updated
•
500
•
23
DongfuJiang/simpo_v2_ultrafeedback
Viewer
•
Updated
•
59.9k
•
36
DongfuJiang/VAPO
Viewer
•
Updated
•
72.5k
•
34
DongfuJiang/PairRM-data
Viewer
•
Updated
•
586k
•
33
DongfuJiang/WildFeedback
Viewer
•
Updated
•
26.5k
•
34