Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ Built using the DARE TIES merge method, it combines pre-trained language models
|
|
18 |
The model configuration emphasizes long sequence lengths, conversation datasets, and dense reasoning abilities.
|
19 |
|
20 |
## Note:
|
21 |
-
If you want good reasoning power from this model, please use
|
22 |
|
23 |
|
24 |
### Configuration
|
|
|
18 |
The model configuration emphasizes long sequence lengths, conversation datasets, and dense reasoning abilities.
|
19 |
|
20 |
## Note:
|
21 |
+
If you want good reasoning power from this model, please use BF16 and XTC sampling.
|
22 |
|
23 |
|
24 |
### Configuration
|