Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
haes95
/
POLAR-10.7B-HES-DPO-v0.1
like
0
Text Generation
Transformers
Safetensors
Korean
llama
trl
dpo
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
POLAR-10.7B-HES-DPO-v0.1
Commit History
Update README.md
5189a0c
verified
haes95
commited on
May 29
Upload tokenizer
65b1af9
verified
haes95
commited on
May 29
Upload LlamaForCausalLM
d83f243
verified
haes95
commited on
May 29
initial commit
44dfcb1
verified
haes95
commited on
May 29