Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ widget:
|
|
7 |
---
|
8 |
|
9 |
## Model description
|
10 |
-
An xlm-roberta-large model fine-tuned on ~1,7 million annotated statements contained in the [Manifesto Corpus](https://manifesto-project.wzb.eu/information/documents/corpus) (version
|
11 |
The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme ([Handbook 4](https://manifesto-project.wzb.eu/coding_schemes/mp_v4)).
|
12 |
It works for all languages the xlm-roberta model is pretrained on ([overview](https://github.com/facebookresearch/fairseq/tree/main/examples/xlmr#introduction)), just note that it will perform best for the 38 languages contained in the Manifesto Corpus:
|
13 |
|
@@ -69,7 +69,7 @@ The model was evaluated on a test set of 200,920 annotated manifesto statements.
|
|
69 |
|
70 |
Please cite the model as follows:
|
71 |
|
72 |
-
Burst, Tobias / Lehmann, Pola / Franzmann, Simon / Al-Gaddooa, Denise / Ivanusch, Christoph / Regel, Sven / Riethmüller, Felicia / Weßels, Bernhard / Zehnter, Lisa (2024): manifestoberta. Version 56topics.sentence.
|
73 |
|
74 |
```bib
|
75 |
@misc{Burst:2024,
|
@@ -78,6 +78,6 @@ Burst, Tobias / Lehmann, Pola / Franzmann, Simon / Al-Gaddooa, Denise / Ivanusch
|
|
78 |
Publisher = {Wissenschaftszentrum Berlin für Sozialforschung / Göttinger Institut für Demokratieforschung},
|
79 |
Title = {manifestoberta. Version 56topics.sentence.2024.1.1},
|
80 |
doi = {10.25522/manifesto.manifestoberta.56topics.sentence.2024.1.1},
|
81 |
-
url = {https://doi.org/10.25522/manifesto.manifestoberta.56topics.sentence.2024.1.1},
|
82 |
Year = {2024},
|
83 |
```
|
|
|
7 |
---
|
8 |
|
9 |
## Model description
|
10 |
+
An xlm-roberta-large model fine-tuned on ~1,7 million annotated statements contained in the [Manifesto Corpus](https://manifesto-project.wzb.eu/information/documents/corpus) (version 2024a).
|
11 |
The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme ([Handbook 4](https://manifesto-project.wzb.eu/coding_schemes/mp_v4)).
|
12 |
It works for all languages the xlm-roberta model is pretrained on ([overview](https://github.com/facebookresearch/fairseq/tree/main/examples/xlmr#introduction)), just note that it will perform best for the 38 languages contained in the Manifesto Corpus:
|
13 |
|
|
|
69 |
|
70 |
Please cite the model as follows:
|
71 |
|
72 |
+
Burst, Tobias / Lehmann, Pola / Franzmann, Simon / Al-Gaddooa, Denise / Ivanusch, Christoph / Regel, Sven / Riethmüller, Felicia / Weßels, Bernhard / Zehnter, Lisa (2024): manifestoberta. Version 56topics.sentence.2024.1.1. Berlin: Wissenschaftszentrum Berlin für Sozialforschung (WZB) / Göttingen: Institut für Demokratieforschung (IfDem). https://doi.org/10.25522/manifesto.manifestoberta.56topics.sentence.2024.1.1
|
73 |
|
74 |
```bib
|
75 |
@misc{Burst:2024,
|
|
|
78 |
Publisher = {Wissenschaftszentrum Berlin für Sozialforschung / Göttinger Institut für Demokratieforschung},
|
79 |
Title = {manifestoberta. Version 56topics.sentence.2024.1.1},
|
80 |
doi = {10.25522/manifesto.manifestoberta.56topics.sentence.2024.1.1},
|
81 |
+
url = {https://doi.org/10.25522/manifesto.manifestoberta.56topics.sentence.2024.1.1},
|
82 |
Year = {2024},
|
83 |
```
|