Running
on
A100
135
🌟
Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook
Note The SFT model that was used for alignment with DPO
Note Part of the SFT mix
Note Part of the SFT mix
Note Part of the SFT mix
Note Part of the SFT mix
Note Part of the SFT mix
Note Part of the DPO mix
Note Part of the DPO mix