LightEmbed/sbert-paraphrase-multilingual-mpnet-base-v2-onnx

This is the ONNX version of the Sentence Transformers model sentence-transformers/paraphrase-multilingual-mpnet-base-v2 for sentence embedding, optimized for speed and lightweight performance. By utilizing onnxruntime and tokenizers instead of heavier libraries like sentence-transformers and transformers, this version ensures a smaller library size and faster execution. Below are the details of the model:

  • Base model: sentence-transformers/paraphrase-multilingual-mpnet-base-v2
  • Embedding dimension: 768
  • Max sequence length: 128
  • File size on disk: 1.03 GB
  • Pooling incorporated: Yes

This ONNX model consists all components in the original sentence transformer model: Transformer, Pooling

Usage (LightEmbed)

Using this model becomes easy when you have LightEmbed installed:

pip install -U light-embed

Then you can use the model using the original model name like this:

from light_embed import TextEmbedding
sentences = [
    "This is an example sentence",
    "Each sentence is converted"
]

model = TextEmbedding('sentence-transformers/paraphrase-multilingual-mpnet-base-v2')
embeddings = model.encode(sentences)
print(embeddings)

Then you can use the model using onnx model name like this:

from light_embed import TextEmbedding
sentences = [
    "This is an example sentence",
    "Each sentence is converted"
]

model = TextEmbedding('LightEmbed/sbert-paraphrase-multilingual-mpnet-base-v2-onnx')
embeddings = model.encode(sentences)
print(embeddings)

Citing & Authors

Binh Nguyen / binhcode25@gmail.com

Downloads last month
95
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support sentence-similarity models for light-embed library.