Migrate model card from transformers-repo
Browse filesRead announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md
README.md
CHANGED
@@ -1,3 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
## GigaBERT-v3
|
2 |
GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-trained in a large-scale corpus (Gigaword+Oscar+Wikipedia) with ~10B tokens, showing state-of-the-art zero-shot transfer performance from English to Arabic on information extraction (IE) tasks. More details can be found in the following paper:
|
3 |
|
@@ -8,11 +18,10 @@ GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-tr
|
|
8 |
year = {2020}
|
9 |
}
|
10 |
|
11 |
-
##
|
12 |
```
|
13 |
from transformers import *
|
14 |
tokenizer = BertTokenizer.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English", do_lower_case=True)
|
15 |
model = BertForTokenClassification.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English")
|
16 |
```
|
17 |
-
|
18 |
-
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
- ar
|
5 |
+
datasets:
|
6 |
+
- gigaword
|
7 |
+
- oscar
|
8 |
+
- wikipedia
|
9 |
+
---
|
10 |
+
|
11 |
## GigaBERT-v3
|
12 |
GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-trained in a large-scale corpus (Gigaword+Oscar+Wikipedia) with ~10B tokens, showing state-of-the-art zero-shot transfer performance from English to Arabic on information extraction (IE) tasks. More details can be found in the following paper:
|
13 |
|
|
|
18 |
year = {2020}
|
19 |
}
|
20 |
|
21 |
+
## Usage
|
22 |
```
|
23 |
from transformers import *
|
24 |
tokenizer = BertTokenizer.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English", do_lower_case=True)
|
25 |
model = BertForTokenClassification.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English")
|
26 |
```
|
27 |
+
More code examples can be found [here](https://github.com/lanwuwei/GigaBERT).
|
|