arxiv:2311.08290
Corrado
NicholasCorrado
AI & ML interests
Reinforcement learning
Organizations
None yet
Papers
3
models
60
NicholasCorrado/mistral-7b-ift
Text Generation
•
Updated
•
26
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.1
Text Generation
•
Updated
•
7
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
Text Generation
•
Updated
•
11
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
Text Generation
•
Updated
•
3
NicholasCorrado/zephyr-7b-uf-rc-small-dpo
Text Generation
•
Updated
•
5
NicholasCorrado/test
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
6
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
5
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
Updated
•
11
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
•
6
datasets
None public yet