Document/publish the training data and training procedure

#7
by kno10 - opened

It would be important to document:

  • how much data was used for finetuning
  • how many samples for for DPO
  • mixture of training data (in particular, languages)

Sign up or log in to comment