beeformer
/

Llama-goodlens-mpnet

Sentence Similarity

sentence-transformers

feature-extraction

Inference Endpoints

Model card Files Files and versions Community

beeformer commited on Aug 2, 2024

Commit

b75a927

·

verified ·

1 Parent(s): 8f9a4ed

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ We use the pretrained [`sentence-transformers/all-mpnet-base-v2`](https://huggin
  We use the initial model without modifying its architecture or pre-trained model parameters.
  However, we reduce the processed sequence length to 384 to reduce the training time of the model.
  Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
- We use constant learning rate of 1e-5, and we train the model for five epochs.
 ### Dataset

  We use the initial model without modifying its architecture or pre-trained model parameters.
  However, we reduce the processed sequence length to 384 to reduce the training time of the model.
  Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
+ We use constant learning rate of 1e-5, and we train the model for ten epochs.
 ### Dataset