Fine Tune Training

by DazzlingXeno - opened Sep 2

Sep 2

•

How did you fine tune this? Did you convert the Gutenberg dataset to mistral instruct format or did you just use a JSON/parquet? Thanks in advance.

DazzlingXeno changed discussion title from Training to Fine Tune Training Sep 2

nbeerbower

Owner Sep 2

I used a modified version of Maxime Labonne's ORPO notebook. The data was formatted using ChatML. The changes are shown in this thread: https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v2/discussions/1

DazzlingXeno

Sep 3

Thank you 😊

DazzlingXeno

Sep 3

I'm thinking of using the new command-r 32 or magnum 34b. Leaning towards Magnum as that's already ChatML. So I'm going to have to test your model on both formats to see if it matters.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment