ptaszynski commited on
Commit
5663327
1 Parent(s): 0bc3258

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -13
README.md CHANGED
@@ -1,16 +1,12 @@
1
  ---
2
- language: pl
3
-
4
- license: cc-by-sa-4.0
5
-
6
  datasets:
7
-
8
- - Polish subset of Open Subtitles
9
- - Polish subset of ParaCrawl
10
- - Polish Parliamentary Corpus
11
- - Polish Wikipedia - Feb 2020
12
- - Expert-annotated Dataset for Automatic Cyberbullying Detection in Polish Laguage
13
-
14
  ---
15
 
16
  # Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection
@@ -65,11 +61,23 @@ Original dataset:
65
 
66
  Improved dataset:
67
 
 
 
 
68
  ```
69
- TBA
 
 
 
 
 
 
 
 
 
70
  ```
71
 
72
  ## References
73
  * https://github.com/google-research/bert
74
  * https://github.com/ptaszynski/cyberbullying-Polish
75
- * https://huggingface.co/datasets/poleval2019_cyberbullying
 
1
  ---
2
+ license: cc-by-4.0
 
 
 
3
  datasets:
4
+ - ptaszynski/PolishCyberbullyingDataset
5
+ language:
6
+ - pl
7
+ tags:
8
+ - cyberbullying
9
+ - hate-speech
 
10
  ---
11
 
12
  # Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection
 
61
 
62
  Improved dataset:
63
 
64
+ The improved dataset used for training this model was released as follows.
65
+ [Expert-annotated dataset to study cyberbullying in Polish language](https://huggingface.co/datasets/ptaszynski/PolishCyberbullyingDataset)
66
+
67
  ```
68
+ @article{ptaszynski2023expert,
69
+ title={Expert-Annotated Dataset to Study Cyberbullying in Polish Language},
70
+ author={Ptaszynski, Michal and Pieciukiewicz, Agata and Dybala, Pawel and Skrzek, Pawel and Soliwoda, Kamil and Fortuna, Marcin and Leliwa, Gniewosz and Wroczynski, Michal},
71
+ journal={Data},
72
+ volume={9},
73
+ number={1},
74
+ pages={1},
75
+ year={2023},
76
+ publisher={MDPI}
77
+ }
78
  ```
79
 
80
  ## References
81
  * https://github.com/google-research/bert
82
  * https://github.com/ptaszynski/cyberbullying-Polish
83
+ * https://huggingface.co/datasets/poleval2019_cyberbullying