Add new SentenceTransformer model.

Browse files

Files changed (3) hide show

README.md +114 -114
model.safetensors +1 -1
runs/Sep12_19-08-06_default/events.out.tfevents.1726168095.default.1847.0 +3 -0

README.md CHANGED Viewed

@@ -46,31 +46,31 @@ tags:
 - dataset_size:680
 - loss:ContrastiveLoss
 widget:
-- source_sentence: 他の選択肢は？
   sentences:
-  - どこを探す？
-  - 物の姿を変える魔法が使える村人を知っている？
-  - 村長選で忙しいから
-- source_sentence: ジャックについて教えて
   sentences:
-  - 井戸へ訪れた？
-  - 青いオーブがどこにあるか知ってる？
-  - それは物の見た目を変える魔法
-- source_sentence: 物の姿を変える魔法が使える村人を知っている？
   sentences:
-  - タイマツが欲しい
-  - それは何？
-  - どっちがいいと思う？
-- source_sentence: リリアンはどんな魔法が使えるの？
   sentences:
-  - どうしてキャンドルなの？
-  - 物の姿を変える魔法が使える村人を知っている？
-  - 物体を変える
-- source_sentence: なにするんだっけ？
   sentences:
-  - 魔法使い
-  - なにすればいい？
-  - どっちをさがせばいい？
 model-index:
 - name: SentenceTransformer based on colorfulscoop/sbert-base-ja
   results:
@@ -82,109 +82,109 @@ model-index:
       type: custom-arc-semantics-data-jp
     metrics:
     - type: cosine_accuracy
-      value: 0.875
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
-      value: 0.7639791965484619
       name: Cosine Accuracy Threshold
     - type: cosine_f1
-      value: 0.896969696969697
       name: Cosine F1
     - type: cosine_f1_threshold
-      value: 0.7639791965484619
       name: Cosine F1 Threshold
     - type: cosine_precision
-      value: 0.8705882352941177
       name: Cosine Precision
     - type: cosine_recall
-      value: 0.925
       name: Cosine Recall
     - type: cosine_ap
-      value: 0.852066796474829
       name: Cosine Ap
     - type: dot_accuracy
-      value: 0.875
       name: Dot Accuracy
     - type: dot_accuracy_threshold
-      value: 398.1038513183594
       name: Dot Accuracy Threshold
     - type: dot_f1
-      value: 0.9017341040462428
       name: Dot F1
     - type: dot_f1_threshold
-      value: 398.1038513183594
       name: Dot F1 Threshold
     - type: dot_precision
-      value: 0.8387096774193549
       name: Dot Precision
     - type: dot_recall
-      value: 0.975
       name: Dot Recall
     - type: dot_ap
-      value: 0.8574534537645885
       name: Dot Ap
     - type: manhattan_accuracy
-      value: 0.875
       name: Manhattan Accuracy
     - type: manhattan_accuracy_threshold
-      value: 349.35498046875
       name: Manhattan Accuracy Threshold
     - type: manhattan_f1
-      value: 0.896969696969697
       name: Manhattan F1
     - type: manhattan_f1_threshold
-      value: 363.05401611328125
       name: Manhattan F1 Threshold
     - type: manhattan_precision
-      value: 0.8705882352941177
       name: Manhattan Precision
     - type: manhattan_recall
-      value: 0.925
       name: Manhattan Recall
     - type: manhattan_ap
-      value: 0.8514114774274522
       name: Manhattan Ap
     - type: euclidean_accuracy
-      value: 0.875
       name: Euclidean Accuracy
     - type: euclidean_accuracy_threshold
-      value: 15.954280853271484
       name: Euclidean Accuracy Threshold
     - type: euclidean_f1
-      value: 0.896969696969697
       name: Euclidean F1
     - type: euclidean_f1_threshold
-      value: 16.386924743652344
       name: Euclidean F1 Threshold
     - type: euclidean_precision
-      value: 0.8705882352941177
       name: Euclidean Precision
     - type: euclidean_recall
-      value: 0.925
       name: Euclidean Recall
     - type: euclidean_ap
-      value: 0.851318148268234
       name: Euclidean Ap
     - type: max_accuracy
-      value: 0.875
       name: Max Accuracy
     - type: max_accuracy_threshold
-      value: 398.1038513183594
       name: Max Accuracy Threshold
     - type: max_f1
-      value: 0.9017341040462428
       name: Max F1
     - type: max_f1_threshold
-      value: 398.1038513183594
       name: Max F1 Threshold
     - type: max_precision
-      value: 0.8705882352941177
       name: Max Precision
     - type: max_recall
-      value: 0.975
       name: Max Recall
     - type: max_ap
-      value: 0.8574534537645885
       name: Max Ap
 ---
@@ -238,9 +238,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
-    'なにするんだっけ？',
-    'なにすればいい？',
-    '魔法使い',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -286,41 +286,41 @@ You can finetune this model on your own dataset.
 | Metric                       | Value      |
 |:-----------------------------|:-----------|
-| cosine_accuracy              | 0.875      |
-| cosine_accuracy_threshold    | 0.764      |
-| cosine_f1                    | 0.897      |
-| cosine_f1_threshold          | 0.764      |
-| cosine_precision             | 0.8706     |
-| cosine_recall                | 0.925      |
-| cosine_ap                    | 0.8521     |
-| dot_accuracy                 | 0.875      |
-| dot_accuracy_threshold       | 398.1039   |
-| dot_f1                       | 0.9017     |
-| dot_f1_threshold             | 398.1039   |
-| dot_precision                | 0.8387     |
-| dot_recall                   | 0.975      |
-| dot_ap                       | 0.8575     |
-| manhattan_accuracy           | 0.875      |
-| manhattan_accuracy_threshold | 349.355    |
-| manhattan_f1                 | 0.897      |
-| manhattan_f1_threshold       | 363.054    |
-| manhattan_precision          | 0.8706     |
-| manhattan_recall             | 0.925      |
-| manhattan_ap                 | 0.8514     |
-| euclidean_accuracy           | 0.875      |
-| euclidean_accuracy_threshold | 15.9543    |
-| euclidean_f1                 | 0.897      |
-| euclidean_f1_threshold       | 16.3869    |
-| euclidean_precision          | 0.8706     |
-| euclidean_recall             | 0.925      |
-| euclidean_ap                 | 0.8513     |
-| max_accuracy                 | 0.875      |
-| max_accuracy_threshold       | 398.1039   |
-| max_f1                       | 0.9017     |
-| max_f1_threshold             | 398.1039   |
-| max_precision                | 0.8706     |
-| max_recall                   | 0.975      |
-| **max_ap**                   | **0.8575** |
 <!--
 ## Bias, Risks and Limitations
@@ -347,13 +347,13 @@ You can finetune this model on your own dataset.
   |         | text1                                                                            | text2                                                                            | label                                           |
   |:--------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                           | string                                                                           | int                                             |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 8.34 tokens</li><li>max: 15 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 8.06 tokens</li><li>max: 14 tokens</li></ul> | <ul><li>0: ~41.36%</li><li>1: ~58.64%</li></ul> |
 * Samples:
-  | text1                    | text2                       | label          |
-  |:-------------------------|:----------------------------|:---------------|
-  | <code>夕ご飯は何を食べたの？</code> | <code>昨晩何を食べたの？</code>      | <code>1</code> |
-  | <code>キャンドルがいいな</code>   | <code>タイマツ</code>           | <code>0</code> |
-  | <code>当番表を見た</code>      | <code>木にスカーフがひっかかってる</code> | <code>0</code> |
 * Loss: [<code>ContrastiveLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#contrastiveloss) with these parameters:
   ```json
   {
@@ -371,16 +371,16 @@ You can finetune this model on your own dataset.
 * Size: 680 evaluation samples
 * Columns: <code>text1</code>, <code>text2</code>, and <code>label</code>
 * Approximate statistics based on the first 680 samples:
-  |         | text1                                                                           | text2                                                                            | label                                           |
-  |:--------|:--------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                          | string                                                                           | int                                             |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 8.1 tokens</li><li>max: 14 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 7.76 tokens</li><li>max: 14 tokens</li></ul> | <ul><li>0: ~41.18%</li><li>1: ~58.82%</li></ul> |
 * Samples:
-  | text1                     | text2                    | label          |
-  |:--------------------------|:-------------------------|:---------------|
-  | <code>何を思い出せるかな？</code>   | <code>井戸</code>          | <code>0</code> |
-  | <code>自分で探せ</code>        | <code>いらない</code>        | <code>1</code> |
-  | <code>カーテンが揺れていたから</code> | <code>辛いスープがあったから</code> | <code>0</code> |
 * Loss: [<code>ContrastiveLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#contrastiveloss) with these parameters:
   ```json
   {
@@ -520,12 +520,12 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch  | Step | Training Loss | loss   | custom-arc-semantics-data-jp_max_ap |
 |:------:|:----:|:-------------:|:------:|:-----------------------------------:|
-| None   | 0    | -             | -      | 0.8251                              |
-| 1.0147 | 69   | 0.0212        | 0.0175 | 0.8337                              |
-| 2.0147 | 138  | 0.015         | 0.0156 | 0.8460                              |
-| 3.0147 | 207  | 0.0123        | 0.0149 | 0.8538                              |
-| 4.0147 | 276  | 0.0106        | 0.0146 | 0.8574                              |
-| 4.9412 | 340  | 0.0096        | 0.0145 | 0.8575                              |
 ### Framework Versions

 - dataset_size:680
 - loss:ContrastiveLoss
 widget:
+- source_sentence: 木材の山の中にスカーフはある？
   sentences:
+  - 巻き割をした？
+  - どっちが欲しい？
+  - おすすめは？
+- source_sentence: ' 君は猫なの？'
   sentences:
+  - どこ探すんだっけ？
+  - 足元よりも更に深くってなに？
+  - キミって猫？
+- source_sentence: 欲しくない
   sentences:
+  - 物体を変化できる人
+  - どっちも欲しくない
+  - スカーフがキャンプファイヤーで燃えてる
+- source_sentence: 外を見てみよう
   sentences:
+  - 誰かが魔法の呪文で花をぬいぐるみに変えた
+  - キミって猫？
+  - 長老
+- source_sentence: 他には選べないの？
   sentences:
+  - お鍋から匂いがしたから
+  - どっちがおすすめ？
+  - なにするんだっけ？
 model-index:
 - name: SentenceTransformer based on colorfulscoop/sbert-base-ja
   results:
       type: custom-arc-semantics-data-jp
     metrics:
     - type: cosine_accuracy
+      value: 0.8235294117647058
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
+      value: 0.6800776720046997
       name: Cosine Accuracy Threshold
     - type: cosine_f1
+      value: 0.8571428571428572
       name: Cosine F1
     - type: cosine_f1_threshold
+      value: 0.6610503196716309
       name: Cosine F1 Threshold
     - type: cosine_precision
+      value: 0.7912087912087912
       name: Cosine Precision
     - type: cosine_recall
+      value: 0.935064935064935
       name: Cosine Recall
     - type: cosine_ap
+      value: 0.8465974769503343
       name: Cosine Ap
     - type: dot_accuracy
+      value: 0.8161764705882353
       name: Dot Accuracy
     - type: dot_accuracy_threshold
+      value: 441.6131591796875
       name: Dot Accuracy Threshold
     - type: dot_f1
+      value: 0.8520710059171598
       name: Dot F1
     - type: dot_f1_threshold
+      value: 379.92266845703125
       name: Dot F1 Threshold
     - type: dot_precision
+      value: 0.782608695652174
       name: Dot Precision
     - type: dot_recall
+      value: 0.935064935064935
       name: Dot Recall
     - type: dot_ap
+      value: 0.8509292792079832
       name: Dot Ap
     - type: manhattan_accuracy
+      value: 0.8308823529411765
       name: Manhattan Accuracy
     - type: manhattan_accuracy_threshold
+      value: 420.1961975097656
       name: Manhattan Accuracy Threshold
     - type: manhattan_f1
+      value: 0.8622754491017963
       name: Manhattan F1
     - type: manhattan_f1_threshold
+      value: 430.6374206542969
       name: Manhattan F1 Threshold
     - type: manhattan_precision
+      value: 0.8
       name: Manhattan Precision
     - type: manhattan_recall
+      value: 0.935064935064935
       name: Manhattan Recall
     - type: manhattan_ap
+      value: 0.848438229073751
       name: Manhattan Ap
     - type: euclidean_accuracy
+      value: 0.8308823529411765
       name: Euclidean Accuracy
     - type: euclidean_accuracy_threshold
+      value: 18.93894386291504
       name: Euclidean Accuracy Threshold
     - type: euclidean_f1
+      value: 0.8588957055214723
       name: Euclidean F1
     - type: euclidean_f1_threshold
+      value: 18.93894386291504
       name: Euclidean F1 Threshold
     - type: euclidean_precision
+      value: 0.813953488372093
       name: Euclidean Precision
     - type: euclidean_recall
+      value: 0.9090909090909091
       name: Euclidean Recall
     - type: euclidean_ap
+      value: 0.8470258990606743
       name: Euclidean Ap
     - type: max_accuracy
+      value: 0.8308823529411765
       name: Max Accuracy
     - type: max_accuracy_threshold
+      value: 441.6131591796875
       name: Max Accuracy Threshold
     - type: max_f1
+      value: 0.8622754491017963
       name: Max F1
     - type: max_f1_threshold
+      value: 430.6374206542969
       name: Max F1 Threshold
     - type: max_precision
+      value: 0.813953488372093
       name: Max Precision
     - type: max_recall
+      value: 0.935064935064935
       name: Max Recall
     - type: max_ap
+      value: 0.8509292792079832
       name: Max Ap
 ---
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
+    '他には選べないの？',
+    'どっちがおすすめ？',
+    'お鍋から匂いがしたから',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 | Metric                       | Value      |
 |:-----------------------------|:-----------|
+| cosine_accuracy              | 0.8235     |
+| cosine_accuracy_threshold    | 0.6801     |
+| cosine_f1                    | 0.8571     |
+| cosine_f1_threshold          | 0.6611     |
+| cosine_precision             | 0.7912     |
+| cosine_recall                | 0.9351     |
+| cosine_ap                    | 0.8466     |
+| dot_accuracy                 | 0.8162     |
+| dot_accuracy_threshold       | 441.6132   |
+| dot_f1                       | 0.8521     |
+| dot_f1_threshold             | 379.9227   |
+| dot_precision                | 0.7826     |
+| dot_recall                   | 0.9351     |
+| dot_ap                       | 0.8509     |
+| manhattan_accuracy           | 0.8309     |
+| manhattan_accuracy_threshold | 420.1962   |
+| manhattan_f1                 | 0.8623     |
+| manhattan_f1_threshold       | 430.6374   |
+| manhattan_precision          | 0.8        |
+| manhattan_recall             | 0.9351     |
+| manhattan_ap                 | 0.8484     |
+| euclidean_accuracy           | 0.8309     |
+| euclidean_accuracy_threshold | 18.9389    |
+| euclidean_f1                 | 0.8589     |
+| euclidean_f1_threshold       | 18.9389    |
+| euclidean_precision          | 0.814      |
+| euclidean_recall             | 0.9091     |
+| euclidean_ap                 | 0.847      |
+| max_accuracy                 | 0.8309     |
+| max_accuracy_threshold       | 441.6132   |
+| max_f1                       | 0.8623     |
+| max_f1_threshold             | 430.6374   |
+| max_precision                | 0.814      |
+| max_recall                   | 0.9351     |
+| **max_ap**                   | **0.8509** |
 <!--
 ## Bias, Risks and Limitations
   |         | text1                                                                            | text2                                                                            | label                                           |
   |:--------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                           | string                                                                           | int                                             |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 8.31 tokens</li><li>max: 15 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 8.03 tokens</li><li>max: 14 tokens</li></ul> | <ul><li>0: ~40.81%</li><li>1: ~59.19%</li></ul> |
 * Samples:
+  | text1                    | text2                          | label          |
+  |:-------------------------|:-------------------------------|:---------------|
+  | <code>姿かたちを変える魔法</code>  | <code>物の姿を変えられる魔法</code>       | <code>1</code> |
+  | <code>青いオーブを見かけた？</code> | <code>青いオーブがどこにあるか知ってる？</code> | <code>1</code> |
+  | <code>猫のぬいぐるみを見たよ</code> | <code>猫のぬいぐるみを失くさなかった？</code>  | <code>1</code> |
 * Loss: [<code>ContrastiveLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#contrastiveloss) with these parameters:
   ```json
   {
 * Size: 680 evaluation samples
 * Columns: <code>text1</code>, <code>text2</code>, and <code>label</code>
 * Approximate statistics based on the first 680 samples:
+  |         | text1                                                                            | text2                                                                            | label                                           |
+  |:--------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                           | string                                                                           | int                                             |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 8.24 tokens</li><li>max: 15 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 7.88 tokens</li><li>max: 14 tokens</li></ul> | <ul><li>0: ~43.38%</li><li>1: ~56.62%</li></ul> |
 * Samples:
+  | text1                   | text2                    | label          |
+  |:------------------------|:-------------------------|:---------------|
+  | <code>調子はどう？</code>     | <code>最近どう？</code>       | <code>1</code> |
+  | <code>なにも要らない</code>    | <code>家の中</code>         | <code>0</code> |
+  | <code>昨日は何を作ったの？</code> | <code>ビーフシチュー食べた？</code> | <code>0</code> |
 * Loss: [<code>ContrastiveLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#contrastiveloss) with these parameters:
   ```json
   {
 ### Training Logs
 | Epoch  | Step | Training Loss | loss   | custom-arc-semantics-data-jp_max_ap |
 |:------:|:----:|:-------------:|:------:|:-----------------------------------:|
+| None   | 0    | -             | -      | 0.7957                              |
+| 1.0147 | 69   | 0.0205        | 0.0199 | 0.8294                              |
+| 2.0147 | 138  | 0.0148        | 0.0180 | 0.8410                              |
+| 3.0147 | 207  | 0.0118        | 0.0173 | 0.8455                              |
+| 4.0147 | 276  | 0.0104        | 0.0170 | 0.8489                              |
+| 4.9412 | 340  | 0.0098        | 0.0168 | 0.8509                              |
 ### Framework Versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8fe7d723f6f2967a407c28454804bf0b0d4f54133d6c4cc0cf1969aa91b6e299
 size 442491744

 version https://git-lfs.github.com/spec/v1
+oid sha256:df532cbf8c515730b079cb46df0f8397bd0412de5a0619608c49d03caf8e7902
 size 442491744

runs/Sep12_19-08-06_default/events.out.tfevents.1726168095.default.1847.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:32eddd6cc79cf3f51591dc64c1ddf3bba17c026db5c91ff0bfdf16cb3f7d029c
+size 22423