Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
chchen
/
Mistral-7B-Instruct-v0.2-ORPO
like
0
PEFT
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
Mistral-7B-Instruct-v0.2-ORPO
/
adapter_model.safetensors
Commit History
Training in progress, step 1500
d85b286
verified
chchen
commited on
May 22
Training in progress, step 1000
1d2f5ca
verified
chchen
commited on
May 22
Training in progress, step 500
a144b89
verified
chchen
commited on
May 22
Model save
18b1374
verified
chchen
commited on
May 20