gustavomalkomes commited on
Commit
feeab0d
·
verified ·
1 Parent(s): 24ddd8b

End of training

Browse files
README.md CHANGED
@@ -3,6 +3,8 @@ library_name: transformers
3
  license: apache-2.0
4
  base_model: google/vit-base-patch16-224-in21k
5
  tags:
 
 
6
  - generated_from_trainer
7
  model-index:
8
  - name: vit-base-patch16-224-in21k
@@ -14,7 +16,18 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # vit-base-patch16-224-in21k
16
 
17
- This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on an unknown dataset.
 
 
 
 
 
 
 
 
 
 
 
18
 
19
  ## Model description
20
 
@@ -41,13 +54,9 @@ The following hyperparameters were used during training:
41
  - lr_scheduler_type: linear
42
  - num_epochs: 3.0
43
 
44
- ### Training results
45
-
46
-
47
-
48
  ### Framework versions
49
 
50
  - Transformers 4.45.2
51
- - Pytorch 2.5.0a0+git6bdd7f4
52
  - Datasets 3.1.0
53
  - Tokenizers 0.20.3
 
3
  license: apache-2.0
4
  base_model: google/vit-base-patch16-224-in21k
5
  tags:
6
+ - image-classification
7
+ - vision
8
  - generated_from_trainer
9
  model-index:
10
  - name: vit-base-patch16-224-in21k
 
16
 
17
  # vit-base-patch16-224-in21k
18
 
19
+ This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the chainyo/rvl-cdip dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - eval_loss: 2.7757
22
+ - eval_model_preparation_time: 0.0077
23
+ - eval_accuracy: 0.0569
24
+ - eval_runtime: 242.1935
25
+ - eval_samples_per_second: 198.189
26
+ - eval_steps_per_second: 3.097
27
+ - memory_allocated (GB): 0.68
28
+ - max_memory_allocated (GB): 0.76
29
+ - total_memory_available (GB): 126.62
30
+ - step: 0
31
 
32
  ## Model description
33
 
 
54
  - lr_scheduler_type: linear
55
  - num_epochs: 3.0
56
 
 
 
 
 
57
  ### Framework versions
58
 
59
  - Transformers 4.45.2
60
+ - Pytorch 2.4.0a0+git12138a8
61
  - Datasets 3.1.0
62
  - Tokenizers 0.20.3
all_results.json CHANGED
@@ -1,16 +1,11 @@
1
  {
2
- "epoch": 3.0,
3
- "eval_accuracy": 0.06302083333333333,
4
- "eval_loss": NaN,
5
- "eval_runtime": 242.9557,
6
- "eval_samples_per_second": 197.567,
7
- "eval_steps_per_second": 3.087,
8
- "max_memory_allocated (GB)": 1.61,
9
- "memory_allocated (GB)": 1.53,
10
- "total_flos": 6.324139790696448e+19,
11
- "total_memory_available (GB)": 126.62,
12
- "train_loss": 0.0,
13
- "train_runtime": 0.0058,
14
- "train_samples_per_second": 140961263.636,
15
- "train_steps_per_second": 17620222.735
16
  }
 
1
  {
2
+ "eval_accuracy": 0.056854166666666664,
3
+ "eval_loss": 2.775695562362671,
4
+ "eval_model_preparation_time": 0.0077,
5
+ "eval_runtime": 242.1935,
6
+ "eval_samples_per_second": 198.189,
7
+ "eval_steps_per_second": 3.097,
8
+ "max_memory_allocated (GB)": 0.76,
9
+ "memory_allocated (GB)": 0.68,
10
+ "total_memory_available (GB)": 126.62
 
 
 
 
 
11
  }
eval_results.json CHANGED
@@ -1,11 +1,11 @@
1
  {
2
- "epoch": 3.0,
3
- "eval_accuracy": 0.06302083333333333,
4
- "eval_loss": NaN,
5
- "eval_runtime": 242.9557,
6
- "eval_samples_per_second": 197.567,
7
- "eval_steps_per_second": 3.087,
8
- "max_memory_allocated (GB)": 1.61,
9
- "memory_allocated (GB)": 1.53,
10
  "total_memory_available (GB)": 126.62
11
  }
 
1
  {
2
+ "eval_accuracy": 0.056854166666666664,
3
+ "eval_loss": 2.775695562362671,
4
+ "eval_model_preparation_time": 0.0077,
5
+ "eval_runtime": 242.1935,
6
+ "eval_samples_per_second": 198.189,
7
+ "eval_steps_per_second": 3.097,
8
+ "max_memory_allocated (GB)": 0.76,
9
+ "memory_allocated (GB)": 0.68,
10
  "total_memory_available (GB)": 126.62
11
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cee937384821f6a29edd22802024c28f69d8d2f494c9e3f9226139369360880f
3
  size 343267040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e19e27533b68b1a92962c45efb7747d6958b8e330b232bef39e45f6353f2b20b
3
  size 343267040
runs/Nov07_16-54-53_gtown-28NZK54/events.out.tfevents.1730998742.gtown-28NZK54.5246.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d4c7a774dbe16c01f9c94c443f1cf3bf052c9725a754d57251211af0dc94112
3
+ size 560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c918ca48e8707d0ec9cca8c7f991990d16019ca7aef9c462322d1bd3ede7b72
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a2d98ff30a5170f30cb47e0160611c448ea2f3c18d9809dca93793ed9059dd5
3
  size 5112