Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-ppo-13b-hh-rlhf-60k
like
0
Follow
Ai2
1,296
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
tulu-v2.5-ppo-13b-hh-rlhf-60k
/
README.md
Commit History
Update README.md
9f489ec
verified
hamishivi
commited on
Jun 14
Update README.md
26c6387
verified
hamishivi
commited on
Jun 12
Update README.md
778a12e
verified
hamishivi
commited on
Jun 12
Update README.md
49ec259
verified
hamishivi
commited on
Jun 12
Create README.md
b72b74e
verified
hamishivi
commited on
Jun 12