Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Vocabulary Trimmed xlm-roberta-base: vocabtrimmer/xlm-roberta-base-trimmed-pt-15000

This model is a trimmed version of xlm-roberta-base by vocabtrimmer, a tool for trimming vocabulary of language models to compress the model size. Following table shows a summary of the trimming process.

xlm-roberta-base vocabtrimmer/xlm-roberta-base-trimmed-pt-15000
parameter_size_full 278,295,186 97,580,186
parameter_size_embedding 192,001,536 11,521,536
vocab_size 250,002 15,002
compression_rate_full 100.0 35.06
compression_rate_embedding 100.0 6.0

Following table shows the parameter used to trim vocabulary.

language dataset dataset_column dataset_name dataset_split target_vocab_size min_frequency
pt vocabtrimmer/mc4_validation text pt validation 15000 2
Downloads last month
3
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.