arxiv:2501.04682
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
about 4 hours ago
Asap7772/elix_multexpert_preferences_gpt4o_pref
published
a dataset
about 4 hours ago
Asap7772/elix_multexpert_preferences_gpt4o_pref
updated
a dataset
about 4 hours ago
Asap7772/elix_multexpert_preferences_gpt-4o_pref_test
Organizations
models
18
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_epoch0
Text Generation
•
Updated
•
10
Asap7772/prm_datamath-mc-full_objbce_lr1e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr1e-06_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_epoch0
Text Generation
•
Updated
•
9
Asap7772/prm_datamath-mc-full_objbce_lr5e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr0.0005_epoch0
Text Generation
•
Updated
•
5
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_checkpoint2400
Updated
datasets
853
Asap7772/elix_multexpert_preferences_gpt4o_pref
Viewer
•
Updated
•
533k
Asap7772/elix_multexpert_preferences_gpt-4o_pref_test
Viewer
•
Updated
•
267k
Asap7772/elix_multexpert_preferences_gpt-4o_pref_train
Viewer
•
Updated
•
267k
Asap7772/elix_multexpert_preferences
Viewer
•
Updated
•
533k
Asap7772/elix_multexpert_generations_flat
Viewer
•
Updated
•
10.7M
•
8
Asap7772/elix_multexpert_generations_llama32_3b_fixed
Viewer
•
Updated
•
60.7k
•
2
Asap7772/elix_multexpert_generations_altuser_llama32_3b_fixed
Viewer
•
Updated
•
60.7k
•
3
Asap7772/elix_multexpert_generations_llama31_8b_fixed
Viewer
•
Updated
•
60.7k
•
4
Asap7772/elix_multexpert_generations_altuser_llama31_8b_fixed
Viewer
•
Updated
•
60.7k
•
2
Asap7772/elix_multexpert_generations_gemma2_9b_fixed
Viewer
•
Updated
•
60.7k
•
1