Introduction

This model was trained on TPU and the details are as follows:

Model

Model_name params size Training_corpus Vocab
RoBERTa-tiny-clue
Super_small_model
7.5M 28.3M CLUECorpus2020 CLUEVocab
RoBERTa-tiny-pair
Super_small_sentence_pair_model
7.5M 28.3M CLUECorpus2020 CLUEVocab
RoBERTa-tiny3L768-clue
small_model
38M 110M CLUECorpus2020 CLUEVocab
RoBERTa-tiny3L312-clue
small_model
<7.5M 24M CLUECorpus2020 CLUEVocab
RoBERTa-large-clue
Large_model
290M 1.20G CLUECorpus2020 CLUEVocab
RoBERTa-large-pair
Large_sentence_pair_model
290M 1.20G CLUECorpus2020 CLUEVocab

Usage

With the help ofHuggingface-Transformers 2.5.1, you could use these model as follows

tokenizer = BertTokenizer.from_pretrained("MODEL_NAME")
model = BertModel.from_pretrained("MODEL_NAME")

MODEL_NAME:

Model_NAME MODEL_LINK
RoBERTa-tiny-clue clue/roberta_chinese_clue_tiny
RoBERTa-tiny-pair clue/roberta_chinese_pair_tiny
RoBERTa-tiny3L768-clue clue/roberta_chinese_3L768_clue_tiny
RoBERTa-tiny3L312-clue clue/roberta_chinese_3L312_clue_tiny
RoBERTa-large-clue clue/roberta_chinese_clue_large
RoBERTa-large-pair clue/roberta_chinese_pair_large

Details

Please read https://arxiv.org/pdf/2003.01355.

Please visit our repository: https://github.com/CLUEbenchmark/CLUEPretrainedModels.git

Downloads last month
13
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.