hxssgaa
/

llama-3-8b-dpo-full

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

llama-3-8b-dpo-full / runs /Oct07_16-47-37_a2ap-dgx001

Commit History

End of training

1169074
verified

hxssgaa commited on Oct 7

Training in progress, step 477

c58d37d
verified

hxssgaa commited on Oct 7

Training in progress, step 400

ef510b8
verified

hxssgaa commited on Oct 7

Training in progress, step 300

f01c97c
verified

hxssgaa commited on Oct 7

Training in progress, step 200

62b3db7
verified

hxssgaa commited on Oct 7

Training in progress, step 100

d96c7bc
verified

hxssgaa commited on Oct 7