Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ We use the pretrained [`sentence-transformers/all-mpnet-base-v2`](https://huggin
|
|
43 |
We use the initial model without modifying its architecture or pre-trained model parameters.
|
44 |
However, we reduce the processed sequence length to 384 to reduce the training time of the model.
|
45 |
Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
|
46 |
-
We use constant learning rate of 1e-5, and we train the model for
|
47 |
|
48 |
### Dataset
|
49 |
|
|
|
43 |
We use the initial model without modifying its architecture or pre-trained model parameters.
|
44 |
However, we reduce the processed sequence length to 384 to reduce the training time of the model.
|
45 |
Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
|
46 |
+
We use constant learning rate of 1e-5, and we train the model for ten epochs.
|
47 |
|
48 |
### Dataset
|
49 |
|