Upload folder using huggingface_hub
Browse files- .gitattributes +1 -0
- README.md +36 -0
- moe-girl_400ma_1bt.Q4_0.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
moe-girl_400ma_1bt.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,36 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
license: apache-2.0
|
4 |
+
base_model: ibm-granite/granite-3.0-1b-a400m-base
|
5 |
+
tags:
|
6 |
+
- axolotl
|
7 |
+
- moe
|
8 |
+
- roleplay
|
9 |
+
model-index:
|
10 |
+
- name: MoE_Girl_400MA_1BT
|
11 |
+
results: []
|
12 |
+
---
|
13 |
+
|
14 |
+
# MoE Girl 400mA 1bT
|
15 |
+
![R8_sd3.5L_00001_.webp](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/GEbRJhyc087cP6Cs_AR0X.webp)
|
16 |
+
a finetune of Granite 3.0 by IBM designed for roleplaying (and maybe general usecases if you try hard enough).
|
17 |
+
|
18 |
+
## Disclaimer
|
19 |
+
PLEASE do not expect godliness out of this, it's a model with _400 million_ active parameters. Expect something more akin to GPT-2.
|
20 |
+
|
21 |
+
## Quants
|
22 |
+
TODO!
|
23 |
+
|
24 |
+
## Prompting
|
25 |
+
Use ChatML.
|
26 |
+
```
|
27 |
+
<|im_start|>system
|
28 |
+
You are a helpful assistant who talks like a pirate.<|im_end|>
|
29 |
+
<|im_start|>user
|
30 |
+
Hello there!<|im_end|>
|
31 |
+
<|im_start|>assistant
|
32 |
+
Yarr harr harr, me matey!<|im_end|>
|
33 |
+
```
|
34 |
+
|
35 |
+
## Thanks
|
36 |
+
Special thanks to the members of Allura for testing and emotional support, as well as the creators of all the datasets that were used in the Special Sauce used to train this model. I love you all <3 - Fizz
|
moe-girl_400ma_1bt.Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:45a57575ad2bad1686d148aa791ede1eb16d82c7880d99ab8cfcbb800ed4a6d9
|
3 |
+
size 768316704
|