yyqoni
/

rlhflow-llama-3-sft-8b-v2-bandit-ppo-60k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rlhflow-llama-3-sft-8b-v2-bandit-ppo-60k

1 contributor

History: 5 commits

yyqoni's picture

Update README.md

4724aa2 verified about 1 month ago