StarChat2 15B - a HuggingFaceH4 Collection

HuggingFaceH4 's Collections

Scaling Test Time Compute with Open Models

Zephyr 7B Gemma

Papers We've Read

Awesome SFT datasets

Awesome feedback datasets

Awesome reward models

StarChat2 15B

updated Apr 12

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook

Paused

135

🌟

StarChat2 Demo
HuggingFaceH4/starchat2-15b-v0.1

Text Generation • Updated Mar 13 • 27.4k • • 107
HuggingFaceH4/starchat2-15b-sft-v0.1

Text Generation • Updated Mar 12 • 50 • 5

Note The SFT model that was used for alignment with DPO
jondurbin/airoboros-3.2

Viewer • Updated Jan 2 • 58.7k • 119 • 44

Note Part of the SFT mix
abacusai/SystemChat

Viewer • Updated Mar 4 • 7.02k • 107 • 126

Note Part of the SFT mix
microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4 • 200k • 1.14k • 423

Note Part of the SFT mix
m-a-p/Code-Feedback

Viewer • Updated Feb 26 • 66.4k • 195 • 198

Note Part of the SFT mix
LDJnr/Capybara

Viewer • Updated Jun 7 • 16k • 215 • 229

Note Part of the SFT mix
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16 • 187k • 6.95k • 254

Note Part of the DPO mix
Intel/orca_dpo_pairs

Viewer • Updated Nov 29, 2023 • 12.9k • 1.68k • 291

Note Part of the DPO mix