Spaces:
Running
Running
File size: 1,293 Bytes
bd9ac5f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 |
# Saves models to disk to package in dockerfile
## zari-bert-cda
Converts [zari-bert-cda](https://github.com/google-research-datasets/Zari) to a Hugging Face model.
Download original model
```
mkdir raw
cd raw
curl https://storage.googleapis.com/bert_models/filbert/2020_10_13/zari-bert-cda.tar.gz -o zari-bert-cda.tar.gz
tar xvzf zari-bert-cda.tar.gz
```
Convert
```
source ../../env/bin/activate
transformers-cli convert --model_type bert \
--tf_checkpoint zari-bert-cda/model.ckpt \
--config zari-bert-cda/bert_config.json \
--pytorch_dump_output zari-bert-cda/pytorch_model.bin
cp zari-bert-cda/bert_config.json zari-bert-cda/config.json
```
Copy to docker directory
```
mkdir ../../py/zari-bert-cda
cp zari-bert-cda/config.json ../../py/zari-bert-cda/config.json
cp zari-bert-cda/vocab.txt ../../py/zari-bert-cda/vocab.txt
cp zari-bert-cda/pytorch_model.bin ../../py/zari-bert-cda/pytorch_model.bin
```
## bert-large-uncased-whole-word-masking
```
cd ../py
source env/bin/activate
python model_bert_large_export.py
```
## Upload files
```
cd ../py
gsutil -o "GSUtil:parallel_process_count=1" -m rsync -r zari-bert-cda gs://uncertainty-over-space/zari-bert-cda
```
https://storage.googleapis.com/uncertainty-over-space/zari/zari-bert-cda/vocab.txt
|