Graphcore
/

bert-base-uncased

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

jimypbr commited on Mar 17, 2022

Commit

4d6353c

·

1 Parent(s): 3bab858

Update README.md

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -17,9 +17,6 @@ This model is a pre-trained BERT-Base trained in two phases on the [Graphcore/wi
 Pre-trained BERT Base model trained on Wikipedia data.
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
@@ -31,7 +28,7 @@ Trained on wikipedia datasets:
 ## Training procedure
 Trained MLM and NSP pre-training scheme from [Large Batch Optimization for Deep Learning: Training BERT in 76 minutes](https://arxiv.org/abs/1904.00962).
-Trained on 16 Graphcore Mk2 IPUs.
 Command lines:

 Pre-trained BERT Base model trained on Wikipedia data.
 ## Training and evaluation data
 ## Training procedure
 Trained MLM and NSP pre-training scheme from [Large Batch Optimization for Deep Learning: Training BERT in 76 minutes](https://arxiv.org/abs/1904.00962).
+Trained on 16 Graphcore Mk2 IPUs using [`optimum-graphcore`](https://github.com/huggingface/optimum-graphcore)
 Command lines: