Update README.md
Browse files
README.md
CHANGED
@@ -66,7 +66,10 @@ Actively monitoring and intervening based on a decrease in perplexity on the gsm
|
|
66 |
|
67 |
This process not only helps in understanding the effectiveness of Spherical Linear Interpolation but also introduces a new method for refining models with enhanced skills through a cycle of targeted data selection (Laser data(x)) + SLERP, followed by a subsequent focus on different data (Laser again on data(y)).
|
68 |
|
69 |
-
Additionally, we integrated a novel training strategy on the SFT and DPO training process, where we partially freeze the model according to a laser-like analysis aiming to navigate and optimize the trade-offs highlighted by the no free lunch theorem. This innovative training method effectively prevents the significant problem of forgetting previously acquired knowledge.
|
|
|
|
|
|
|
70 |
|
71 |
|
72 |
We improved the German language skills on this model. Nevertheless, certain formulations may occur that are not entirely correct.
|
|
|
66 |
|
67 |
This process not only helps in understanding the effectiveness of Spherical Linear Interpolation but also introduces a new method for refining models with enhanced skills through a cycle of targeted data selection (Laser data(x)) + SLERP, followed by a subsequent focus on different data (Laser again on data(y)).
|
68 |
|
69 |
+
Additionally, we integrated a novel training strategy on the SFT and DPO training process, where we partially freeze the model according to a laser-like analysis aiming to navigate and optimize the trade-offs highlighted by the no free lunch theorem. This innovative training method effectively prevents the significant problem of forgetting previously acquired knowledge.
|
70 |
+
This aspect is particularly crucial when attempting to teach the model specific skills, such as a new language, where traditionally, the model might lose a considerable amount of its prior knowledge and exhibit a decline in overall intelligence.
|
71 |
+
|
72 |
+
Concrete information on how the new training strategy works and the advantages it offers over conventional training methods will soon be published in a detailed paper by the LaserRMT research group.
|
73 |
|
74 |
|
75 |
We improved the German language skills on this model. Nevertheless, certain formulations may occur that are not entirely correct.
|