Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_sft-dpo_pythia1-4b
like
0
Follow
ContextualAI
52
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
gpt_neox
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
2689685
archangel_sft-dpo_pythia1-4b
Commit History
Upload README.md with huggingface_hub
2689685
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
60b6b81
xwinxu
commited on
Dec 6, 2023
Upload GPTNeoXForCausalLM
43b4571
stas
commited on
Dec 2, 2023
Upload tokenizer
9229066
stas
commited on
Dec 2, 2023
initial commit
8788509
stas
commited on
Dec 2, 2023