Edit Models filters

Model Tree

OpenRLHF/Llama-3-8b-sft-mixture

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

3

Full-text search

Active filters: OpenRLHF/Llama-3-8b-sft-mixture

zkshan2002/RewardModel-uf-llama3-8B-OpenRLHF

Updated Oct 11, 2024 • 670

zkshan2002/PPO-uf-llama3-8B-OpenRLHF

Updated Oct 11, 2024 • 3

zkshan2002/DPO-uf-llama3-8B-OpenRLHF

Updated Oct 14, 2024 • 249