uer
/

roberta-base-finetuned-cluener2020-chinese

Token Classification

Inference Endpoints

Model card Files Files and versions Community

uer commited on Apr 27, 2021

Commit

6f886ed

•

1 Parent(s): 46cf9b3

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -40,13 +40,13 @@ The model is fine-tuned by [UER-py](https://github.com/dbiir/UER-py/) on [Tencen
 ```
 python3 run_ner.py --pretrained_model_path models/cluecorpussmall_roberta_base_seq512_model.bin-250000 \
-                          --vocab_path models/google_zh_vocab.txt \
-                          --train_path datasets/cluener2020/train.tsv \
-                          --dev_path datasets/cluener2020/dev.tsv \
-                          --label2id_path datasets/cluener2020/label2id.json \
-                          --output_model_path models/cluener2020_classifier_model.bin \
-                          --learning_rate 3e-5 --batch_size 32 --epochs_num 5 --seq_length 512 \
-                          --embedding word_pos_seg --encoder transformer --mask fully_visible
 ```
 Finally, we convert the pre-trained model into Huggingface's format:

 ```
 python3 run_ner.py --pretrained_model_path models/cluecorpussmall_roberta_base_seq512_model.bin-250000 \
+                   --vocab_path models/google_zh_vocab.txt \
+                   --train_path datasets/cluener2020/train.tsv \
+                   --dev_path datasets/cluener2020/dev.tsv \
+                   --label2id_path datasets/cluener2020/label2id.json \
+                   --output_model_path models/cluener2020_classifier_model.bin \
+                   --learning_rate 3e-5 --batch_size 32 --epochs_num 5 --seq_length 512 \
+                   --embedding word_pos_seg --encoder transformer --mask fully_visible
 ```
 Finally, we convert the pre-trained model into Huggingface's format: