OpenRLHF/Llama-3-8b-rm-mixture
Updated
•
4.66k
•
1
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
•
24
•
1
OpenRLHF/Llama-3-8b-rm-700k
Updated
•
2.28k
•
3
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
Updated
•
13
•
1
OpenRLHF/Llama-3-8b-iter-dpo-179k
Text Generation
•
Updated
•
23
OpenRLHF/Llama-3-8b-rlhf-100k
Text Generation
•
Updated
•
330
•
4
OpenRLHF/Llama-3-8b-sft-mixture
Text Generation
•
Updated
•
20.7k
•
1
OpenRLHF/Llama-2-7b-sft-model-ocra-500k
Text Generation
•
Updated
•
11
OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
•
19
OpenRLHF/Llama-2-13b-sft-model-ocra-500k
Text Generation
•
Updated
•
113
•
1