Update README.md
Browse files
README.md
CHANGED
@@ -42,7 +42,7 @@ We use the pretrained [`sentence-transformers/all-mpnet-base-v2`](https://huggin
|
|
42 |
We use the initial model without modifying its architecture or pre-trained model parameters.
|
43 |
However, we reduce the processed sequence length to 256 to reduce the training time of the model.
|
44 |
Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
|
45 |
-
We use constant learning rate of 1e-5, and we train the model for
|
46 |
We finetuned our model on the MovieLens-20M dataset. For details please see our paper (link TBA).
|
47 |
|
48 |
For item ids used during training please see (links TBA).
|
|
|
42 |
We use the initial model without modifying its architecture or pre-trained model parameters.
|
43 |
However, we reduce the processed sequence length to 256 to reduce the training time of the model.
|
44 |
Regarding other hyperparameters, we use the same interaction data batch size of 1024; we use the negative sampling parameter m = 10000.
|
45 |
+
We use constant learning rate of 1e-5, and we train the model for five epochs.
|
46 |
We finetuned our model on the MovieLens-20M dataset. For details please see our paper (link TBA).
|
47 |
|
48 |
For item ids used during training please see (links TBA).
|