How to finetune the model?

#1
by Labmem009 - opened

What kind of model is this, GPT2? How can I finetune the model? Thanks a lot!

Owner

Hi, I used this model as base model.
After training 100k wikipedia-based QA via SFT manner, DPO was performed using dataset
Thanks for comment!

Hi, I used this model as base model.
After training 100k wikipedia-based QA via SFT manner, DPO was performed using dataset
Thanks for comment!

I'm interested in this jp SLM. How can I fine-tune this model, could you pls offer a script or tell me how to fine-tune it?
Thanks a lot!

Owner
β€’
edited May 12

Hi,
I forgot to answer your previous question. Model architecture is GPT2.
I release the code in this repository.
Sorry for that English version is not ready now, but you can use codes.
You should modify the part of prompts from Japanese to your target language.
Thanks!

Hi,
I forgot to answer your previous question. Model architecture is GPT2.
I release the code in this repository.
Sorry for that English version is not ready now, but you can use codes.
You should modify the part of prompts from Japanese to your target language.
Thanks!

Thanks a lot!

Labmem009 changed discussion status to closed

Sign up or log in to comment