Update README.md
Browse files
README.md
CHANGED
@@ -17,6 +17,8 @@ Built using the DARE TIES merge method, it combines pre-trained language models
|
|
17 |
|
18 |
The model configuration emphasizes long sequence lengths, conversation datasets, and dense reasoning abilities.
|
19 |
|
|
|
|
|
20 |
|
21 |
|
22 |
### Configuration
|
|
|
17 |
|
18 |
The model configuration emphasizes long sequence lengths, conversation datasets, and dense reasoning abilities.
|
19 |
|
20 |
+
## Note:
|
21 |
+
If you want good reasoning power from this model, please use FP16 and XTC sampling.
|
22 |
|
23 |
|
24 |
### Configuration
|