Edit model card

This similarity model was trained for 2 epochs based on the sentence-transformers/paraphrase-multilingual-mpnet-base-v2. We used the SQuAD 1.1 dataset (brazilian portuguese version) to compare similarity between questions and sentences containing the answer to the question. We employed the MultipleNegativesRankingLoss as the objective function. To generate negative examples, our strategy involved using BM25 to retrieve similar examples from all sentences in the dataset that did not contain the answer.

We tested this model using Faquad QA portuguese dataset and improved dense retrieval by 10% in MRR@10 compared to the base model.

Downloads last month
23
Safetensors
Model size
278M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.