Ahmed Abdelali commited on
Commit
ef485b3
1 Parent(s): 6c37233

pushd updates for README and config

Browse files
Files changed (2) hide show
  1. README.md +4 -2
  2. config.json +1 -0
README.md CHANGED
@@ -3,6 +3,8 @@ language: ar
3
  tags:
4
  - pytorch
5
  - tf
 
 
6
  datasets:
7
  - arabic_billion_words
8
  - open_subtitles
@@ -29,11 +31,11 @@ QARiB: Is the Arabic name for "Boat".
29
  - Number of Layers: 12
30
 
31
  ## Training QARiB
32
- See details in [Training QARiB](./Training_QARiB.md)
33
 
34
  ## Using QARiB
35
 
36
- You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine-tuned on a downstream task. See the model hub to look for fine-tuned versions on a task that interests you. For more details, see [Using QARiB](./Using_QARiB.md)
37
 
38
  ### How to use
39
  You can use this model directly with a pipeline for masked language modeling:
 
3
  tags:
4
  - pytorch
5
  - tf
6
+ - QARiB
7
+ - qarib
8
  datasets:
9
  - arabic_billion_words
10
  - open_subtitles
 
31
  - Number of Layers: 12
32
 
33
  ## Training QARiB
34
+ See details in [Training QARiB](https://github.com/qcri/QARIB/Training_QARiB.md)
35
 
36
  ## Using QARiB
37
 
38
+ You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine-tuned on a downstream task. See the model hub to look for fine-tuned versions on a task that interests you. For more details, see [Using QARiB](https://github.com/qcri/QARIB/Using_QARiB.md)
39
 
40
  ### How to use
41
  You can use this model directly with a pipeline for masked language modeling:
config.json CHANGED
@@ -5,6 +5,7 @@
5
  "attention_probs_dropout_prob": 0.1,
6
  "directionality": "bidi",
7
  "hidden_act": "gelu",
 
8
  "hidden_dropout_prob": 0.1,
9
  "hidden_size": 768,
10
  "initializer_range": 0.02,
 
5
  "attention_probs_dropout_prob": 0.1,
6
  "directionality": "bidi",
7
  "hidden_act": "gelu",
8
+ "model_type": "bert",
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "initializer_range": 0.02,