arxiv:2311.08290
Corrado
NicholasCorrado
AI & ML interests
Reinforcement learning
Organizations
None yet
Papers
3
models
60
NicholasCorrado/mistral-7b-ift
Text Generation
•
Updated
•
35
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.1
Text Generation
•
Updated
•
16
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
Text Generation
•
Updated
•
18
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
Text Generation
•
Updated
•
18
NicholasCorrado/zephyr-7b-uf-rc-small-dpo
Text Generation
•
Updated
•
21
NicholasCorrado/test
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
17
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
21
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
Updated
•
21
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
•
17
datasets
None public yet