Hymba: A Hybrid-head Architecture for Small Language Models
Paper
•
2411.13676
•
Published
•
40
We aim to provide the best recipes to find, select, and synthesize high-quality and large-quantity data for post-training your LLMs.