Model: MiniLM
    Lang: IT
  

Model description

This is a MiniLMv2 [1] model for the Italian language, obtained using mMiniLMv2 (L6xH384 mMiniLMv2) as a starting point and focusing it on the Italian language by modifying the embedding layer (as in [2], computing document-level frequencies over the Wikipedia dataset)

The resulting model has 23M parameters, a vocabulary of 30.498 tokens, and a size of ~90 MB.

References

[1] https://arxiv.org/abs/2012.15828

[2] https://arxiv.org/abs/2010.05609

License

The model is released under MIT license

Downloads last month
117
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Collection including osiria/minilm-l6-h384-italian-cased