lucadiliello
commited on
Commit
•
fbf513c
1
Parent(s):
76a5f93
Update README.md
Browse files
README.md
CHANGED
@@ -10,4 +10,6 @@ pipeline_tag: fill-mask
|
|
10 |
|
11 |
A small version of `DeBERTa` trained on the clean version of google C4 dataset. For more info about the size of the model, see `config.json`.
|
12 |
|
13 |
-
The model has been trained for **100K** steps with a batch size of **2048** and a sequence length of **512**, for a total of **104B** tokens.
|
|
|
|
|
|
10 |
|
11 |
A small version of `DeBERTa` trained on the clean version of google C4 dataset. For more info about the size of the model, see `config.json`.
|
12 |
|
13 |
+
The model has been trained for **100K** steps with a batch size of **2048** and a sequence length of **512**, for a total of **104B** tokens.
|
14 |
+
|
15 |
+
The vocabulary and the tokenizer are the same as `microsoft/deberta-base`.
|