SongTonyLi/OpenELM-270M-SFT-D1_chosen-then-PPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 23 • 7
SongTonyLi/OpenELM-3B-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 24 • 7
SongTonyLi/OpenELM-3B-SFT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 24 • 99
SongTonyLi/OpenELM-3B-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 24 • 8
SongTonyLi/OpenELM-3B-CPT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-450M-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 372
SongTonyLi/OpenELM-450M-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-450M-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 6
SongTonyLi/OpenELM-450M-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-270M-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-270M-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 8
SongTonyLi/OpenELM-1_1B-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-1_1B-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 8
SongTonyLi/OpenELM-1_1B-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 8
SongTonyLi/OpenELM-1_1B-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 6
SongTonyLi/OpenELM-3B-CPT-D1_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 7
SongTonyLi/OpenELM-3B-CPT-D1_chosen-then-SFT-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 25 • 11
SongTonyLi/OpenELM-3B-SFT-D1_chosen-then-DPO_D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 26 • 11
SongTonyLi/OpenELM-3B-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 26 • 9
SongTonyLi/OpenELM-3B-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge Text Generation • Updated Sep 26 • 6