haritzpuerto
commited on
Commit
•
556f49a
1
Parent(s):
90f97b1
Update README.md
Browse files
README.md
CHANGED
@@ -25,9 +25,6 @@ This is the official model from the publication "Fine-Tuning with Divergent Chai
|
|
25 |
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
26 |
|
27 |
|
28 |
-
Stay tuned for the release of the paper!
|
29 |
-
|
30 |
-
|
31 |
# Load the Model
|
32 |
```
|
33 |
from peft import LoraConfig, PeftModel
|
|
|
25 |
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
26 |
|
27 |
|
|
|
|
|
|
|
28 |
# Load the Model
|
29 |
```
|
30 |
from peft import LoraConfig, PeftModel
|