avramesh/ft-attempt1

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3855
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.0204        | 0.9231 | 3    | 3.4582          |
-| 3.8317        | 1.8462 | 6    | 3.2753          |
-| 3.6194        | 2.7692 | 9    | 3.1305          |
-| 2.5725        | 4.0    | 13   | 2.9379          |
-| 3.2435        | 4.9231 | 16   | 2.7861          |
-| 3.0346        | 5.8462 | 19   | 2.6554          |
-| 2.8818        | 6.7692 | 22   | 2.5453          |
-| 2.0646        | 8.0    | 26   | 2.4340          |
-| 2.6678        | 8.9231 | 29   | 2.3907          |
-| 1.8536        | 9.2308 | 30   | 2.3855          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3938
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 4.0211        | 0.9231 | 3    | 3.4617          |
+| 3.8387        | 1.8462 | 6    | 3.2827          |
+| 3.6289        | 2.7692 | 9    | 3.1373          |
+| 2.5801        | 4.0    | 13   | 2.9469          |
+| 3.2557        | 4.9231 | 16   | 2.7959          |
+| 3.0471        | 5.8462 | 19   | 2.6640          |
+| 2.8925        | 6.7692 | 22   | 2.5527          |
+| 2.0712        | 8.0    | 26   | 2.4415          |
+| 2.6765        | 8.9231 | 29   | 2.3988          |
+| 1.8573        | 9.2308 | 30   | 2.3938          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6333943ac3447250b6c80e0a602d45b7693b00aa9dc2a91c75d766b761a5c17
 size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:b7ae5cf5a82d62af8b2822c2cf362ede59148c2f98ddc68bc4febdd74c5e3418
 size 8397056

runs/Jul05_17-17-55_palomino/events.out.tfevents.1720199875.palomino.763240.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b960af4f332e478fc087c64871b6ac6495fcd9390b355ee5f917f13e1d1831e
+size 5851

runs/Jul05_17-18-32_palomino/events.out.tfevents.1720199912.palomino.763441.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2ef75c067e2bb21a9017fc8b0418ec1ef5d7639ee1940294b31f94cf923251f4
+size 9983

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:613400c88a1864f1cadcdaa20e9a7b355cd6ac9da324e91f927acc5455dc1fbe
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:a55c8d6d0f17977eddfca857e5504b37710103b8264a8d44bd0c3bbb85a12eb3
 size 5112