danielpark
/

sentence-transformers-all-mini-lm-l6-v2

@@ -53,6 +53,26 @@ We are forking sentence-transformers/all-MiniLM-L6-v2 as it is similar to the ta
 - Used the AdamW optimizer with a 2e-5 learning rate.
 - The full training script is accessible in this current repository: `train_script.py`.
 #### Datasets
 | Dataset                                                  | Paper                                    | Number of training tuples  |
 |--------------------------------------------------------|:----------------------------------------:|:--------------------------:|

 - Used the AdamW optimizer with a 2e-5 learning rate.
 - The full training script is accessible in this current repository: `train_script.py`.
+#### Performance
+| Model Name                                    | Performance Sentence Embeddings (14 Datasets) | Performance Semantic Search (6 Datasets) | Avg. Performance | Speed | Model Size |
+|----------------------------------------------|-----------------------------------------------|----------------------------------------|-----------------|-------|------------|
+| all-mpnet-base-v2                            | 69.57                                         | 57.02                                  | 63.30           | 2800  | 420 MB     |
+| multi-qa-mpnet-base-dot-v1                   | 66.76                                         | 57.60                                  | 62.18           | 2800  | 420 MB     |
+| all-distilroberta-v1                         | 68.73                                         | 50.94                                  | 59.84           | 4000  | 290 MB     |
+| all-MiniLM-L12-v2                            | 68.70                                         | 50.82                                  | 59.76           | 7500  | 120 MB     |
+| multi-qa-distilbert-cos-v1                   | 65.98                                         | 52.83                                  | 59.41           | 4000  | 250 MB     |
+| **all-MiniLM-L6-v2**                             | **68.06**                                         | **49.54**                                  | **58.80**           | **14200** | **80 MB**      |
+| multi-qa-MiniLM-L6-cos-v1                    | 64.33                                         | 51.83                                  | 58.08           | 14200 | 80 MB      |
+| paraphrase-multilingual-mpnet-base-v2        | 65.83                                         | 41.68                                  | 53.75           | 2500  | 970 MB     |
+| paraphrase-albert-small-v2                   | 64.46                                         | 40.04                                  | 52.25           | 5000  | 43 MB      |
+| paraphrase-multilingual-MiniLM-L12-v2        | 64.25                                         | 39.19                                  | 51.72           | 7500  | 420 MB     |
+| paraphrase-MiniLM-L3-v2                      | 62.29                                         | 39.19                                  | 50.74           | 19000 | 61 MB      |
+| distiluse-base-multilingual-cased-v1          | 61.30                                         | 29.87                                  | 45.59           | 4000  | 480 MB     |
+| distiluse-base-multilingual-cased-v2          | 60.18                                         | 27.35                                  | 43.77           | 4000  | 480 MB     |
 #### Datasets
 | Dataset                                                  | Paper                                    | Number of training tuples  |
 |--------------------------------------------------------|:----------------------------------------:|:--------------------------:|