update readme: clarify R1 made the DSL, not the company itself
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ This is the syntax for the DSL I trained it on, which is called ROL (Reasoning O
|
|
25 |
`↺` Self-Reflect - Reconsider assumptions, mitigate overconfidence. Usually begins with the word "Wait" or "Alternatively" or "Actually". Use this at least 3 times.
|
26 |
`➤` Output - Structure tone, format, intent.
|
27 |
```
|
28 |
-
ROL was invented by Deepseek and tweaked by me. The spec for it was not included in the training data - the model figured out how it works based on a synthetic hand-edited dataset with 65 samples.
|
29 |
|
30 |
The total training time for this was under 15 minutes.
|
31 |
|
|
|
25 |
`↺` Self-Reflect - Reconsider assumptions, mitigate overconfidence. Usually begins with the word "Wait" or "Alternatively" or "Actually". Use this at least 3 times.
|
26 |
`➤` Output - Structure tone, format, intent.
|
27 |
```
|
28 |
+
ROL was invented by Deepseek R1 and tweaked by me. The spec for it was not included in the training data - the model figured out how it works based on a synthetic hand-edited dataset with 65 samples.
|
29 |
|
30 |
The total training time for this was under 15 minutes.
|
31 |
|