hrusheekeshsawarkar
/

indic-sentence-bert-nli-matryoshka

Sentence Similarity

sentence-transformers

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

hrusheekeshsawarkar commited on Aug 26, 2024

Commit

f61a66e

·

verified ·

1 Parent(s): b6c4f5e

update

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -24,8 +24,9 @@ language:
 # {MODEL_NAME}
-This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 <!--- Describe your model here -->
 ## Usage (Sentence-Transformers)
@@ -42,13 +43,13 @@ Then you can use the model like this:
 from sentence_transformers import SentenceTransformer
 from sentence_transformers.util import cos_sim
-matryoshka_dim = 64
 sentences =
       [
-        "The weather is so nice!",
-        "It's so sunny outside!",
-        "He drove to the stadium.",
       ]
 model = SentenceTransformer("hrusheekeshsawarkar/indic-sentence-bert-nli-matryoshka",truncate_dim=matryoshka_dim)

 # {MODEL_NAME}
+This is a [sentence-transformers](https://www.SBERT.net) model: Sentence Tranformers is a commonly used framework to train embedding models, and it recently implemented support for Matryoshka models. Training a Matryoshka embedding model using Sentence Transformers is quite elementary: rather than applying some loss function on only the full-size embeddings, we also apply that same loss function on truncated portions of the embeddings.
+For example, if a model has an original embedding dimension of 768, it can now be trained on 768, 512, 256, 128 and 64. Each of these losses will be added together, optionally with some weight. this model is specifically finetuned on 11 major Indian languages.
 <!--- Describe your model here -->
 ## Usage (Sentence-Transformers)
 from sentence_transformers import SentenceTransformer
 from sentence_transformers.util import cos_sim
+matryoshka_dim = 64 # Specify the embedding shape here
 sentences =
       [
+        "मौसम बहुत अच्छा है!",
+        "बाहर बहुत धूप है!",
+        "वह गाड़ी चलाकर स्टेडियम गया।",
       ]
 model = SentenceTransformer("hrusheekeshsawarkar/indic-sentence-bert-nli-matryoshka",truncate_dim=matryoshka_dim)