Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_ppo_llama13b
like
0
Follow
ContextualAI
53
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
ae03dc3
archangel_ppo_llama13b
Commit History
Upload README.md with huggingface_hub
ae03dc3
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
af401a6
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
574c634
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
b1a3e0f
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
819363c
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
78c0db7
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
2502bb9
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
3b2ee14
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
3e06ca8
stas
commited on
Nov 26, 2023
Upload tokenizer
eea4d3b
stas
commited on
Nov 26, 2023
initial commit
d10555f
stas
commited on
Nov 26, 2023