Model Description
This is a Medusa model for Mistral 7B Instruct v0.2. This is trained using the latest Medusa 2 commit.
Training:
- Dataset used is the self distillation dataset from Mistral 7B Instruct v0.2, temperature 0.3 with output token of 2048.
- It has been trained using axolotl fork as describe in Medusa 2 README.md
Inference:
- To load the model please follow the instruction found in Github
- Downloads last month
- 32
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.