jinaai
/

jina-embeddings-v2-base-zh

@@ -4,6 +4,7 @@ tags:
   - feature-extraction
   - sentence-similarity
   - mteb
 license: apache-2.0
 language:
 - en
@@ -1073,6 +1074,9 @@ model-index:
 <b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a>.</b>
 </p>
 ## Intended Usage & Model Info
@@ -1088,13 +1092,17 @@ Additionally, we provide the following embedding models:
 - [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
 - [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters.
-- [`jina-embeddings-v2-base-zh`](): Chinese-English Bilingual embeddings (soon) **(you are here)**.
-- [`jina-embeddings-v2-base-de`](): German-English Bilingual embeddings (soon).
-- [`jina-embeddings-v2-base-es`](): Spanish-English Bilingual embeddings (soon).
 ## Data & Parameters
-Jina Embeddings V2 [technical report](https://arxiv.org/abs/2310.19923)
 ## Usage
@@ -1157,9 +1165,23 @@ embeddings = model.encode(
 )
 ```
-## Fully-managed Embeddings Service
-Alternatively, you can use Jina AI's [Embedding platform](https://jina.ai/embeddings/) for fully-managed access to Jina Embeddings models.
 ## Use Jina Embeddings for RAG
@@ -1170,12 +1192,6 @@ According to the latest blog post from [LLamaIndex](https://blog.llamaindex.ai/b
 <img src="https://miro.medium.com/v2/resize:fit:4800/format:webp/1*ZP2RVejCZovF3FDCg-Bx3A.png" width="780px">
-## Plans
-1. Bilingual embedding models supporting more European & Asian languages, including Spanish, French, Italian and Japanese.
-2. Multimodal embedding models enable Multimodal RAG applications.
-3. High-performt rerankers.
 ## Contact
 Join our [Discord community](https://discord.jina.ai) and chat with other community members about ideas.

   - feature-extraction
   - sentence-similarity
   - mteb
+inference: false
 license: apache-2.0
 language:
 - en
 <b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a>.</b>
 </p>
+## Quick Start
+The easiest way to starting using `jina-embeddings-v2-base-de` is to use Jina AI's [Embedding API](https://jina.ai/embeddings/).
 ## Intended Usage & Model Info
 - [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
 - [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters.
+- [`jina-embeddings-v2-base-zh`](https://huggingface.co/jinaai/jina-embeddings-v2-base-zh): 161 million parameters Chinese-English Bilingual embeddings. **(you are here)**
+- [`jina-embeddings-v2-base-de`](https://huggingface.co/jinaai/jina-embeddings-v2-base-de): 161 million parameters German-English Bilingual embeddings.
+- _[`jina-embeddings-v2-base-es`](): Spanish-English Bilingual embeddings (soon)._
+- _Bilingual embedding models in other world languages (soon)._
+- _Multimodal-input embedding model (soon)._
+- _High-performing reranking model (soon)._
 ## Data & Parameters
+We will publish a report with technical details about the training of the bilingual models soon.
+The training of the English model is described in this [technical report](https://arxiv.org/abs/2310.19923).
 ## Usage
 )
 ```
+If you want to use the model together with the [sentence-transformers package](https://github.com/UKPLab/sentence-transformers/), make sure that you have installed the latest release and set `trust_remote_code=True` as well:
+```
+!pip install -U sentence-transformers
+from sentence_transformers import SentenceTransformer
+from numpy.linalg import norm
+cos_sim = lambda a,b: (a @ b.T) / (norm(a)*norm(b))
+model = SentenceTransformer('jinaai/jina-embeddings-v2-base-de', trust_remote_code=True)
+embeddings = model.encode(['How is the weather today?', 'Wie ist das Wetter heute?'])
+print(cos_sim(embeddings[0], embeddings[1]))
+```
+## Alternatives to Using Transformers Package
+1. _Managed SaaS_: Get started with a free key on Jina AI's [Embedding API](https://jina.ai/embeddings/).
+2. _Private and high-performance deployment_: Get started by picking from our suite of models and deploy them on [AWS Sagemaker](https://aws.amazon.com/marketplace/seller-profile?id=seller-stch2ludm6vgy).
 ## Use Jina Embeddings for RAG
 <img src="https://miro.medium.com/v2/resize:fit:4800/format:webp/1*ZP2RVejCZovF3FDCg-Bx3A.png" width="780px">
 ## Contact
 Join our [Discord community](https://discord.jina.ai) and chat with other community members about ideas.