kaiokendev
commited on
Commit
•
3ed5582
1
Parent(s):
86709bd
Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ license: mit
|
|
4 |
|
5 |
### SuperHOT Prototype 2 w/ 4-8K Context
|
6 |
|
7 |
-
This is a second prototype of SuperHOT, this time with 4K context and no RLHF. In my testing, it can go all the way to 6K without breaking down and I made the change with intention to reach 8K, so I'll assume it will go to 8K although I only trained on 4K sequences.
|
8 |
|
9 |
#### Looking for Merged & Quantized Models?
|
10 |
- 13B 8K GGML: [tmpupload/superhot-13b-8k-no-rlhf-test-GGML](https://huggingface.co/tmpupload/superhot-13b-8k-no-rlhf-test-GGML)
|
|
|
4 |
|
5 |
### SuperHOT Prototype 2 w/ 4-8K Context
|
6 |
|
7 |
+
This is a second prototype of SuperHOT, a NSFW focused LoRA, this time with 4K context and no RLHF. In my testing, it can go all the way to 6K without breaking down and I made the change with intention to reach 8K, so I'll assume it will go to 8K although I only trained on 4K sequences.
|
8 |
|
9 |
#### Looking for Merged & Quantized Models?
|
10 |
- 13B 8K GGML: [tmpupload/superhot-13b-8k-no-rlhf-test-GGML](https://huggingface.co/tmpupload/superhot-13b-8k-no-rlhf-test-GGML)
|