Yurun Yuan
RyanYr
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
28
RyanYr/gemma-2-2b-it_CoT-it_SFT
Text Generation
•
Updated
•
122
RyanYr/reward-judge_iter-sft-genRM_pilot-exp_iter3
Text Generation
•
Updated
•
8
RyanYr/reward-judge_iter-sft-genRM_pilot-exp_iter2
Text Generation
•
Updated
•
46
RyanYr/reward-judge_iter-sft-genRM_pilot-exp_iter1
Text Generation
•
Updated
•
18
RyanYr/reward-judge_iter-dpo-genRM_pilot-exp_iter3
Updated
•
22
RyanYr/reward-judge_iter-dpo-genRM_pilot-exp_iter2
Updated
•
29
RyanYr/reward-judge_iter-dpo-genRM_pilot-exp_iter1
Updated
•
28
RyanYr/reward-judge_SFT-genRM_pilot-exp
Text Generation
•
Updated
•
317
RyanYr/reward-judge_pilot-exp
Text Classification
•
Updated
•
228
RyanYr/last-letter-cat_genRM_iter1_pilot_experiment
Updated
•
36
datasets
48
RyanYr/math_problems_diverse_completion_expanded_binlabeled
Viewer
•
Updated
•
782k
•
3
RyanYr/math_problems_diverse_completion_expanded
Viewer
•
Updated
•
782k
RyanYr/math_problems_diverse_completion
Viewer
•
Updated
•
21.5k
•
11
RyanYr/math_problems_data_collection_binlabeled
Viewer
•
Updated
•
21.5k
•
1
RyanYr/math_problems_data_collection
Viewer
•
Updated
•
21.5k
•
62
RyanYr/math_problems
Viewer
•
Updated
•
23k
•
3
RyanYr/math_problems_diverse_completion_labeled
Viewer
•
Updated
•
14
RyanYr/PRM800k_trajectory_pair_output
Viewer
•
Updated
•
139
RyanYr/PRM800k_trajectory_pair
Viewer
•
Updated
•
139
RyanYr/PRM800k_completion-wise_labels
Viewer
•
Updated
•
15k
•
21