-
self-generate/ds_chat_original_cn_mining_sandbox_debug_iter0-full_response_traceback
Viewer • Updated • 2 • 33 -
self-generate/ds_chat_original_cn_rl_oj_debug_iter0-full_response_traceback
Viewer • Updated • 5 • 32 -
self-generate/ds_chat_original_cn_rl_oj_debug_iter0-binarized
Viewer • Updated • 5 • 30 -
self-generate/ds_chat_original_cn_mining_sandbox_debug_iter0-binarized
Viewer • Updated • 2 • 31
self-generate-experiments
community
AI & ML interests
None defined yet.
Collections
4
models
None public yet
datasets
389
self-generate/ds_chat_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-binarized_all_pairs
Viewer
•
Updated
•
12.8k
self-generate/ds_chat_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-full_response_traceback
Viewer
•
Updated
•
3.31k
self-generate/ds_chat_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-binarized
Viewer
•
Updated
•
3.31k
self-generate/ds_coder_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-pos-binarized-reflection-scored
Viewer
•
Updated
•
3.66k
self-generate/ds_chat_original_cn_mining_oj_iter0-pos-binarized-reflection-scored
Viewer
•
Updated
•
4.02k
self-generate/ds_coder_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-binarized_all_pairs
Viewer
•
Updated
•
13.3k
self-generate/ds_coder_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-full_response_traceback
Viewer
•
Updated
•
4.03k
self-generate/ds_coder_pos_reflct_adamw_iter1_sppo_hard_new_cn_mining_oj_iter1-binarized
Viewer
•
Updated
•
4.03k
self-generate/ds_chat_original_cn_mining_oj_iter0-binarized_all_pairs
Viewer
•
Updated
•
17.4k
self-generate/ds_chat_original_cn_mining_oj_iter0-full_response_traceback
Viewer
•
Updated
•
4.66k
•
33