Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wxzhang
/
dpo-selective-longerrun
like
0
Text Generation
Transformers
Safetensors
mistral
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo-selective-longerrun
Commit History
Model save
59c7c4d
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1500
048bd6f
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1400
f9b6a81
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1300
ca7283f
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1200
d58cdeb
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1100
26f18de
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 1000
2de1d9a
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 900
8d90f1d
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 800
40bada0
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 700
7653b41
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 600
7d90a1d
verified
wxzhang
commited on
Apr 3, 2024
Training in progress, step 500
aab0daa
verified
wxzhang
commited on
Apr 2, 2024
Training in progress, step 400
2d23bb5
verified
wxzhang
commited on
Apr 2, 2024
Training in progress, step 300
37abb5c
verified
wxzhang
commited on
Apr 2, 2024
Training in progress, step 200
6f17790
verified
wxzhang
commited on
Apr 2, 2024
Training in progress, step 100
9da5a65
verified
wxzhang
commited on
Apr 2, 2024
initial commit
f9d8dbb
verified
wxzhang
commited on
Apr 2, 2024