Update README.md
Browse files
README.md
CHANGED
@@ -9,8 +9,42 @@ metrics:
|
|
9 |
pipeline_tag: fill-mask
|
10 |
widget:
|
11 |
- text: Det vore [MASK] häller nödvändigt att bita af tungan än berättat hvad jag varit med om.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
# Historical Swedish Bert Model
|
15 |
|
16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
pipeline_tag: fill-mask
|
10 |
widget:
|
11 |
- text: Det vore [MASK] häller nödvändigt att bita af tungan än berättat hvad jag varit med om.
|
12 |
+
|
13 |
+
train-eval-index:
|
14 |
+
- config: Riksarkivet/mini_cleaned_diachronic_swe
|
15 |
+
task: fill-mask
|
16 |
+
task_id: fill-mask
|
17 |
+
splits:
|
18 |
+
eval_split: test
|
19 |
+
col_mapping:
|
20 |
+
text: text
|
21 |
+
|
22 |
+
model-index:
|
23 |
+
- name: bert-base-cased-swe-1800
|
24 |
+
results:
|
25 |
+
- task:
|
26 |
+
type: fill-mask
|
27 |
+
name: fill-mask
|
28 |
+
dataset:
|
29 |
+
name: Riksarkivet/mini_cleaned_diachronic_swe
|
30 |
+
type: Riksarkivet/mini_cleaned_diachronic_swe
|
31 |
+
split: test
|
32 |
+
metrics:
|
33 |
+
- type: perplexity
|
34 |
+
value: 3.42
|
35 |
+
name: Perplexity
|
36 |
---
|
37 |
|
38 |
# Historical Swedish Bert Model
|
39 |
|
40 |
+
A historical Swedish Bert model is released from the National Swedish Archives to better generalise to Swedish historical text. Researches are well-aware that the Swedish language has been subject to change over time which means that present-day point-of-view models less ideal candidates for the job.
|
41 |
+
However, this model can be used to interpret and analyse historical textual material and be fine-tuned for different downstream tasks.
|
42 |
+
|
43 |
+
## Model Dscription
|
44 |
+
|
45 |
+
|
46 |
+
|
47 |
+
## Acknowledgements
|
48 |
+
|
49 |
+
We gratefully acknowledge the HPC RIVR consortium (https://www.hpc-rivr.si) and EuroHPC JU (https://eurohpc-ju.europa.eu) for funding this research by providing computing resources of the HPC system Vega at the Institute of Information Science (https://www.izum.si).
|
50 |
+
|