Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Lichang-Chen
/
ODIN-ppo-L230-best
like
0
Text Generation
Transformers
PyTorch
English
llama
ODIN
RLHF
PPO
text-generation-inference
Inference Endpoints
arxiv:
2402.07319
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
64fe8ba
ODIN-ppo-L230-best
1 contributor
History:
1 commit
Lichang-Chen
initial commit
64fe8ba
verified
8 months ago
.gitattributes
1.52 kB
initial commit
8 months ago