Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,47 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: llama2
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: conversational
|
6 |
+
tags:
|
7 |
+
- Xwin
|
8 |
+
- Euryale 1.3
|
9 |
+
- frankenmerge
|
10 |
+
- 90b
|
11 |
+
---
|
12 |
+
# BigWeave v6 90B
|
13 |
+
|
14 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65a6db055c58475cf9e6def1/4CbbAN-X7ZWj702JrcCGH.png" width=600>
|
15 |
+
|
16 |
+
A Goliath-120b style frankenmerge of Xwin-LM-70b-v0.1 and Euryale-1.3-70b. The goal is to find other merge combinations that work well.
|
17 |
+
|
18 |
+
The version number is for me to keep track of the merges, only results that seem to work reasonably well are kept/published.
|
19 |
+
|
20 |
+
# Prompting Format
|
21 |
+
Vicuna and Alpaca.
|
22 |
+
|
23 |
+
# Merge process
|
24 |
+
The models used in the merge are [Xwin-LM-70b-v0.1](https://huggingface.co/Xwin-LM/Xwin-LM-70B-V0.1) and [Euryale-1.3-70b](https://huggingface.co/Sao10K/Euryale-1.3-L2-70B).
|
25 |
+
|
26 |
+
The layer mix:
|
27 |
+
```yaml
|
28 |
+
- range 0, 12
|
29 |
+
Xwin
|
30 |
+
- range 9, 14
|
31 |
+
Euryale
|
32 |
+
- range 12, 62
|
33 |
+
Xwin
|
34 |
+
- range 54, 71
|
35 |
+
Euryale
|
36 |
+
- range 62, 80
|
37 |
+
Xwin
|
38 |
+
```
|
39 |
+
|
40 |
+
# Acknowledgements
|
41 |
+
[@Xwin-LM](https://huggingface.co/Xwin-LM) For creating Xwin
|
42 |
+
|
43 |
+
[@Sao10K](https://huggingface.co/Sao10K) For creating Euryale
|
44 |
+
|
45 |
+
[@alpindale](https://huggingface.co/alpindale) For creating the original Goliath
|
46 |
+
|
47 |
+
[@chargoddard](https://huggingface.co/chargoddard) For developing [mergekit](https://github.com/cg123/mergekit).
|