Commit
•
647eb4c
1
Parent(s):
61f52c7
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,9 @@ tags:
|
|
9 |
|
10 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
11 |
|
|
|
|
|
|
|
12 |
## Merge Details
|
13 |
### Merge Method
|
14 |
|
|
|
9 |
|
10 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
11 |
|
12 |
+
Using the updated config file and a rope_alpha=2.5, this should be able to handle a context up to 16384 (formerly would start devolving after 4k).
|
13 |
+
May be unstable past that - have so far been unable to get coherency out fully to 32k.
|
14 |
+
|
15 |
## Merge Details
|
16 |
### Merge Method
|
17 |
|