edwko commited on
Commit
d03a684
·
verified ·
1 Parent(s): 91d4796

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -16,6 +16,18 @@ The model was trained on ~8 billion tokens.
16
  - Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
17
  - Tokenizer changes.
18
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  ## How coherent is the 150M model?
20
  Let's look at real-world examples:
21
 
 
16
  - Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
17
  - Tokenizer changes.
18
 
19
+ ## Chat format
20
+
21
+ This model is **very sensitive** to the chat template used. Ensure you use the correct template:
22
+ ```
23
+ <s>system
24
+ [System message]</s>
25
+ <s>user
26
+ [Your question or message]</s>
27
+ <s>assistant
28
+ [The model's response]</s>
29
+ ```
30
+
31
  ## How coherent is the 150M model?
32
  Let's look at real-world examples:
33