Model save

Browse files

Files changed (8) hide show

1_Pooling/config.json +10 -0
README.md +63 -0
clf_head/config.json +1 -0
clf_head/model.safetensors +3 -0
config_sentence_transformers.json +10 -0
config_topicfit.json +4 -0
modules.json +20 -0
sentence_bert_config.json +4 -0

1_Pooling/config.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+  "word_embedding_dimension": 768,
+  "pooling_mode_cls_token": false,
+  "pooling_mode_mean_tokens": true,
+  "pooling_mode_max_tokens": false,
+  "pooling_mode_mean_sqrt_len_tokens": false,
+  "pooling_mode_weightedmean_tokens": false,
+  "pooling_mode_lasttoken": false,
+  "include_prompt": true
+}

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+tags:
+- generated_from_trainer
+model-index:
+- name: topicfit-all-mpnet-base-v2-sentence-label-diff_prod-epoch-8
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# topicfit-all-mpnet-base-v2-sentence-label-diff_prod-epoch-8
+This model was trained from scratch on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0000
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 0.0073        | 1.0   | 4880  | 0.0028          |
+| 0.0027        | 2.0   | 9760  | 0.0007          |
+| 0.0016        | 3.0   | 14640 | 0.0002          |
+| 0.0           | 4.0   | 19520 | 0.0000          |
+| 0.0           | 5.0   | 24400 | 0.0000          |
+| 0.0           | 6.0   | 29280 | 0.0000          |
+| 0.0           | 7.0   | 34160 | 0.0000          |
+| 0.0           | 8.0   | 39040 | 0.0000          |
+### Framework versions
+- Transformers 4.42.3
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
+- Tokenizers 0.19.1

clf_head/config.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"in_features": 3072, "hidden_dim": 768, "dropout_prob": 0.3, "activation_function": "torch.nn.modules.activation.ReLU"}

clf_head/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:44709ac049d3b22caea8775f1b42ef79b22702608e63b0d0ef24e6d208f21344
+size 9443652

config_sentence_transformers.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+  "__version__": {
+    "sentence_transformers": "3.0.1",
+    "transformers": "4.42.3",
+    "pytorch": "2.3.1+cu121"
+  },
+  "prompts": {},
+  "default_prompt_name": null,
+  "similarity_fn_name": null
+}

config_topicfit.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "use_classification_head": true,
+  "feature_concat_mode": "diff_prod"
+}

modules.json ADDED Viewed

	@@ -0,0 +1,20 @@

+[
+  {
+    "idx": 0,
+    "name": "0",
+    "path": "",
+    "type": "sentence_transformers.models.Transformer"
+  },
+  {
+    "idx": 1,
+    "name": "1",
+    "path": "1_Pooling",
+    "type": "sentence_transformers.models.Pooling"
+  },
+  {
+    "idx": 2,
+    "name": "2",
+    "path": "2_Normalize",
+    "type": "sentence_transformers.models.Normalize"
+  }
+]

sentence_bert_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "max_seq_length": 384,
+  "do_lower_case": false
+}