Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_dpo_llama7b
like
0
Follow
ContextualAI
52
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
16bb3f6
archangel_dpo_llama7b
Commit History
Upload README.md with huggingface_hub
16bb3f6
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
29439c5
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
68e5858
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
e13b70f
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
e9039ec
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
cac935f
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
7b2caf9
stas
commited on
Nov 25, 2023
Upload tokenizer
da3f291
stas
commited on
Nov 25, 2023
initial commit
d29e27a
stas
commited on
Nov 25, 2023