alimosavian
commited on
Commit
•
3c58bde
1
Parent(s):
1097ce5
Update README.md
Browse files
README.md
CHANGED
@@ -77,7 +77,7 @@ r = pipe(
|
|
77 |
|
78 |
### Training Data
|
79 |
|
80 |
-
The model has been on a proprietary dataset of ~1.35M examples consisting of
|
81 |
* High quality swedish instruct data
|
82 |
* Single turn
|
83 |
* Multi-turn
|
@@ -88,14 +88,14 @@ The model has been on a proprietary dataset of ~1.35M examples consisting of
|
|
88 |
For training we used hugginface Accelerate and TRL.
|
89 |
|
90 |
|
91 |
-
#### Preprocessing
|
92 |
|
93 |
For efficiency, we packed all the examples into 8K context windows, reducing the number examples to ~12% of their original count.
|
94 |
|
95 |
|
96 |
#### Training Hyperparameters
|
97 |
|
98 |
-
- **Training regime:**
|
99 |
|
100 |
|
101 |
[More Information Needed]
|
@@ -104,8 +104,10 @@ For efficiency, we packed all the examples into 8K context windows, reducing the
|
|
104 |
|
105 |
The model has been evaluated on [Scandeval](https://scandeval.com/swedish-nlg/) swedish subset.
|
106 |
|
|
|
107 |
![metrics](assets/metrics.png)
|
108 |
|
|
|
109 |
![mean-score](assets/mean_score.png)
|
110 |
|
111 |
|
|
|
77 |
|
78 |
### Training Data
|
79 |
|
80 |
+
The model has been trained on a proprietary dataset of ~1.35M examples consisting of
|
81 |
* High quality swedish instruct data
|
82 |
* Single turn
|
83 |
* Multi-turn
|
|
|
88 |
For training we used hugginface Accelerate and TRL.
|
89 |
|
90 |
|
91 |
+
#### Preprocessing
|
92 |
|
93 |
For efficiency, we packed all the examples into 8K context windows, reducing the number examples to ~12% of their original count.
|
94 |
|
95 |
|
96 |
#### Training Hyperparameters
|
97 |
|
98 |
+
- **Training regime:**
|
99 |
|
100 |
|
101 |
[More Information Needed]
|
|
|
104 |
|
105 |
The model has been evaluated on [Scandeval](https://scandeval.com/swedish-nlg/) swedish subset.
|
106 |
|
107 |
+
The result of the individual metrics compared to other top scoring models
|
108 |
![metrics](assets/metrics.png)
|
109 |
|
110 |
+
The mean score of all metrics compared to other models in the Swedish NLG category.
|
111 |
![mean-score](assets/mean_score.png)
|
112 |
|
113 |
|