chargoddard
/

mistral-11b-slimorca

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chargoddard commited on Apr 22, 2024

Commit

4f456d9

•

1 Parent(s): ced8eaf

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -114,7 +114,8 @@ Full weight fine tuned on two epochs of [SlimOrca](https://huggingface.co/datase
 The base model for this came from a variation on Undi's [Mistral 11B recipe](https://huggingface.co/Undi95/Mistral-11B-v0.1). The `o_proj` and `down_proj` tensors were set to zero in the added layers, making the output exactly identical to Mistral 7B before training.
-Benchmarks look good locally but still evaluating actual usefulness.
 ### Reproducing

 The base model for this came from a variation on Undi's [Mistral 11B recipe](https://huggingface.co/Undi95/Mistral-11B-v0.1). The `o_proj` and `down_proj` tensors were set to zero in the added layers, making the output exactly identical to Mistral 7B before training.
+~Benchmarks look good locally but still evaluating actual usefulness.~
+Update: this turned out great! 10/10 would recommend as a training approach.
 ### Reproducing