Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
archangel_ppo_llama30b
like
0
Follow
ContextualAI
54
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
3f15d76
archangel_ppo_llama30b
Commit History
Upload tokenizer
3f15d76
verified
stas
commited on
Jan 11
Upload README.md with huggingface_hub
24d75d8
xwinxu
commited on
Jan 9
Upload tokenizer
c08e7c4
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
9df898e
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
5495f8b
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
d4f6710
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
6502c59
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
205d01e
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
d2d2428
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
595c5a5
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
c297db2
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
a3f5fca
stas
commited on
Nov 26, 2023
Upload tokenizer
703835b
stas
commited on
Nov 26, 2023
initial commit
319be27
stas
commited on
Nov 26, 2023