Initial README
Browse files
README.md
CHANGED
@@ -2,4 +2,39 @@
|
|
2 |
license: other
|
3 |
license_name: microsoft-research-license
|
4 |
license_link: https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE
|
|
|
5 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: other
|
3 |
license_name: microsoft-research-license
|
4 |
license_link: https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE
|
5 |
+
pipeline_tag: text-generation
|
6 |
---
|
7 |
+
|
8 |
+
# OrcaMaid-13b-v2-FIX-10k
|
9 |
+
|
10 |
+
This is the fixed version of **OrcaMaid-v2-13b**, further extended to 10240 context length via YaRN. The (now-deleted) v2 model had issues with the merged tokenizer that prevented it from stopping when necessary, and caused it to output broken ChatML tokens like `<|im_end`, etc.
|
11 |
+
|
12 |
+
This is a gradient SLERP merge of Microsoft's [Orca-2-13b](https://huggingface.co/microsoft/Orca-2-13b) and Undi and IkariDev's [Noromaid-v0.1.1-13b](https://huggingface.co/NeverSleep/Noromaid-13b-v0.1.1), biased towards Orca.
|
13 |
+
|
14 |
+
Just as with OrcaMaid v1, the overall goal of this merge is to create a model that sounds uniquely human and natural, without sacrificing intelligence.
|
15 |
+
|
16 |
+
The prompt format is Alpaca. You can use the standard format as shown, but for best results, you should customize the system prompt to your specific needs.
|
17 |
+
|
18 |
+
```
|
19 |
+
Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
20 |
+
|
21 |
+
### Instruction:
|
22 |
+
{YOUR MESSAGE HERE}
|
23 |
+
|
24 |
+
### Response:
|
25 |
+
{BOT MESSAGE HERE}
|
26 |
+
|
27 |
+
|
28 |
+
```
|
29 |
+
|
30 |
+
|
31 |
+
### Misc. information
|
32 |
+
- BOS token is `<s>`
|
33 |
+
- EOS token is `</s>`
|
34 |
+
- Native context length is `10240` via YaRN (original context length was `4096`)
|
35 |
+
- Base model is Llama 2
|
36 |
+
- Due to the inclusion of Orca-2-13b, the model is subject to the terms of the [Microsoft Research License](https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE)
|
37 |
+
|
38 |
+
### Thanks
|
39 |
+
- Thanks to [Charles Goddard](https://github.com/cg123) for his kind help with mergekit (as always)
|
40 |
+
- Thanks to [Undi](https://ko-fi.com/undiai) and [IkariDev](https://ikaridevgit.github.io/) for Noromaid
|