nkpz
/

falcon-thought-7b-v0

nkpz commited on 10 days ago

Commit

2ec39aa

verified ·

1 Parent(s): 193ac32

update readme: clarify R1 made the DSL, not the company itself

Files changed (1) hide show

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ This is the syntax for the DSL I trained it on, which is called ROL (Reasoning O
 `↺` Self-Reflect - Reconsider assumptions, mitigate overconfidence. Usually begins with the word "Wait" or "Alternatively" or "Actually". Use this at least 3 times.
 `➤` Output - Structure tone, format, intent.
 ```
-ROL was invented by Deepseek and tweaked by me. The spec for it was not included in the training data - the model figured out how it works based on a synthetic hand-edited dataset with 65 samples.
 The total training time for this was under 15 minutes.

 `↺` Self-Reflect - Reconsider assumptions, mitigate overconfidence. Usually begins with the word "Wait" or "Alternatively" or "Actually". Use this at least 3 times.
 `➤` Output - Structure tone, format, intent.
 ```
+ROL was invented by Deepseek R1 and tweaked by me. The spec for it was not included in the training data - the model figured out how it works based on a synthetic hand-edited dataset with 65 samples.
 The total training time for this was under 15 minutes.