Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,18 @@ The model was trained on ~8 billion tokens.
|
|
16 |
- Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
|
17 |
- Tokenizer changes.
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
## How coherent is the 150M model?
|
20 |
Let's look at real-world examples:
|
21 |
|
|
|
16 |
- Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
|
17 |
- Tokenizer changes.
|
18 |
|
19 |
+
## Chat format
|
20 |
+
|
21 |
+
This model is **very sensitive** to the chat template used. Ensure you use the correct template:
|
22 |
+
```
|
23 |
+
<s>system
|
24 |
+
[System message]</s>
|
25 |
+
<s>user
|
26 |
+
[Your question or message]</s>
|
27 |
+
<s>assistant
|
28 |
+
[The model's response]</s>
|
29 |
+
```
|
30 |
+
|
31 |
## How coherent is the 150M model?
|
32 |
Let's look at real-world examples:
|
33 |
|