zephyr-7b-dpo-full-prometheus-reward-scale-1-rpo / model-00003-of-00003.safetensors

Commit History

Training in progress, step 437
df0bdec
verified

sfulay commited on

Training in progress, step 400
a5168ad
verified

sfulay commited on

Training in progress, step 300
fc63106
verified

sfulay commited on

Training in progress, step 200
a560cfa
verified

sfulay commited on

Training in progress, step 100
fca3b62
verified

sfulay commited on

Training in progress, step 100
416f770
verified

sfulay commited on