Add model, config, tokenizer, and custom code

Files changed (6) hide show

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the iva_mt_wslot dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.0838
-- Bleu: 0.1564
-- Gen Len: 7.7876
 ## Model description
@@ -55,11 +55,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:------:|:-------:|
-| 7.381         | 0.9992 | 636  | 7.0427          | 0.0009 | 4.4319  |
-| 6.5076        | 2.0    | 1273 | 6.4710          | 0.1178 | 12.0321 |
-| 6.0729        | 2.9992 | 1909 | 6.2192          | 0.1097 | 8.2657  |
-| 5.8922        | 4.0    | 2546 | 6.1254          | 0.1464 | 9.1024  |
-| 5.7405        | 4.9961 | 3180 | 6.0838          | 0.1564 | 7.7876  |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the iva_mt_wslot dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.0627
+- Bleu: 0.0253
+- Gen Len: 5.1184
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:------:|:-------:|
+| 7.4078        | 0.9992 | 636  | 7.0700          | 0.0    | 3.9948  |
+| 6.525         | 2.0    | 1273 | 6.4606          | 0.0225 | 6.3877  |
+| 6.082         | 2.9992 | 1909 | 6.2044          | 0.0103 | 4.9663  |
+| 5.8782        | 4.0    | 2546 | 6.1041          | 0.0248 | 5.9008  |
+| 5.756         | 4.9961 | 3180 | 6.0627          | 0.0253 | 5.1184  |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eaec370b26e0baba405b59911ef9b1b3b4ebaf4e23efc5558e94f51d22fb5c29
 size 109582088

 version https://git-lfs.github.com/spec/v1
+oid sha256:a21690d680740ed313ee98e2983df7a120317f7b6c97bd0e5aaaea74e8724e9e
 size 109582088

runs/Nov11_17-56-19_0d13c4d4bf27/events.out.tfevents.1731347780.0d13c4d4bf27.458.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a98feeac48e1035f1f96e56030abcc0cd4f7e7f48268b965732b4e36eb00e684
+size 20281

runs/Nov11_18-38-16_0d13c4d4bf27/events.out.tfevents.1731350296.0d13c4d4bf27.458.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:40e280280ff45a05d7e0680a411332044d8b64f23535267e5b13d26eb4ba011a
+size 20281

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 128,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 128
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 0,
-    "pad_type_id": 0,
-    "pad_token": "<pad>"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:755599dcde2e330c7f603634715d07f5e72b285fb4719a689c6fad2302af3f3e
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:ae8ab13b397906dd96ac35412ceecffae90fb25805059f75fab3fc9de21dec11
 size 5368