Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aakritil
/
content
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
content
/
tokenizer
1 contributor
History:
1 commit
aakritil
aakritil/llama2b_reg_dpo_trainer
2bb6f58
verified
3 months ago
special_tokens_map.json
Safe
437 Bytes
aakritil/llama2b_reg_dpo_trainer
3 months ago
tokenizer.json
Safe
3.62 MB
aakritil/llama2b_reg_dpo_trainer
3 months ago
tokenizer.model
Safe
500 kB
LFS
aakritil/llama2b_reg_dpo_trainer
3 months ago
tokenizer_config.json
Safe
948 Bytes
aakritil/llama2b_reg_dpo_trainer
3 months ago