Darshankumar commited on
Commit
8890078
1 Parent(s): a686d63

End of training

Browse files
README.md ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/git-base
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - imagefolder
8
+ model-index:
9
+ - name: git-base-pokemon
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # git-base-pokemon
17
+
18
+ This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on the imagefolder dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.1481
21
+ - Wer Score: 7.2150
22
+
23
+ ## Model description
24
+
25
+ More information needed
26
+
27
+ ## Intended uses & limitations
28
+
29
+ More information needed
30
+
31
+ ## Training and evaluation data
32
+
33
+ More information needed
34
+
35
+ ## Training procedure
36
+
37
+ ### Training hyperparameters
38
+
39
+ The following hyperparameters were used during training:
40
+ - learning_rate: 5e-05
41
+ - train_batch_size: 2
42
+ - eval_batch_size: 2
43
+ - seed: 42
44
+ - gradient_accumulation_steps: 2
45
+ - total_train_batch_size: 4
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 50
49
+ - mixed_precision_training: Native AMP
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Wer Score |
54
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|
55
+ | 0.0359 | 3.12 | 50 | 0.1192 | 0.8131 |
56
+ | 0.0174 | 6.25 | 100 | 0.1257 | 3.0654 |
57
+ | 0.0132 | 9.38 | 150 | 0.1283 | 0.7850 |
58
+ | 0.011 | 12.5 | 200 | 0.1297 | 1.4112 |
59
+ | 0.0095 | 15.62 | 250 | 0.1332 | 5.1028 |
60
+ | 0.0083 | 18.75 | 300 | 0.1376 | 5.5701 |
61
+ | 0.0077 | 21.88 | 350 | 0.1368 | 0.7944 |
62
+ | 0.0068 | 25.0 | 400 | 0.1366 | 5.6168 |
63
+ | 0.0061 | 28.12 | 450 | 0.1417 | 4.4299 |
64
+ | 0.0057 | 31.25 | 500 | 0.1406 | 6.6636 |
65
+ | 0.0047 | 34.38 | 550 | 0.1438 | 7.3738 |
66
+ | 0.0038 | 37.5 | 600 | 0.1448 | 7.6262 |
67
+ | 0.0032 | 40.62 | 650 | 0.1468 | 9.0841 |
68
+ | 0.0027 | 43.75 | 700 | 0.1473 | 6.8598 |
69
+ | 0.0024 | 46.88 | 750 | 0.1480 | 7.3178 |
70
+ | 0.0021 | 50.0 | 800 | 0.1481 | 7.2150 |
71
+
72
+
73
+ ### Framework versions
74
+
75
+ - Transformers 4.35.0
76
+ - Pytorch 2.1.0+cu118
77
+ - Datasets 2.14.6
78
+ - Tokenizers 0.14.1
generation_config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 101,
4
+ "eos_token_id": 102,
5
+ "pad_token_id": 0,
6
+ "transformers_version": "4.35.0"
7
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e7c8e099ba49c1ed13348e480cf359d5248dfda00e1adedd7da32bde9e3a971e
3
  size 706516040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36c32d63cbd6e6d4c10ff5d4d4a289ec610efd4b69d4bb58377fcf5382641133
3
  size 706516040
runs/Nov13_12-18-57_8f40eeb4e341/events.out.tfevents.1699878938.8f40eeb4e341.19050.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f1d024fd7c39b6437dd2a89bc050177f716313dbbca2eb6231a4638963caf17d
3
- size 12111
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b87159935da34be76d065a9f4a98b890acc828a26fb1ea809fb8ea14e2f336c
3
+ size 12465