reddgr commited on
Commit
519ce50
1 Parent(s): e3aed10

Upload TFDistilBertForSequenceClassification

Browse files
Files changed (2) hide show
  1. README.md +64 -65
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -1,65 +1,64 @@
1
- ---
2
- library_name: transformers
3
- license: apache-2.0
4
- base_model: distilbert-base-uncased
5
- tags:
6
- - generated_from_keras_callback
7
- model-index:
8
- - name: tl-test-learn-prompt-classifier
9
- results: []
10
- datasets:
11
- - reddgr/tl-test-learn-prompts
12
- ---
13
-
14
- <!-- This model card has been generated automatically according to the information Keras had access to. You should
15
- probably proofread and complete it, then remove this comment. -->
16
-
17
- # tl-test-learn-prompt-classifier
18
-
19
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
20
- It achieves the following results on the evaluation set:
21
- - Train Loss: 0.1733
22
- - Train Accuracy: 0.9756
23
- - Validation Loss: 0.3006
24
- - Validation Accuracy: 0.8977
25
- - Epoch: 6
26
-
27
- ## Model description
28
-
29
- More information needed
30
-
31
- ## Intended uses & limitations
32
-
33
- More information needed
34
-
35
- ## Training and evaluation data
36
-
37
- More information needed
38
-
39
- ## Training procedure
40
-
41
- ### Training hyperparameters
42
-
43
- The following hyperparameters were used during training:
44
- - optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': False, 'is_legacy_optimizer': False, 'learning_rate': 5e-06, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
45
- - training_precision: float32
46
-
47
- ### Training results
48
-
49
- | Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
50
- |:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
51
- | 0.6870 | 0.5707 | 0.6656 | 0.6136 | 0 |
52
- | 0.6542 | 0.6293 | 0.6289 | 0.6477 | 1 |
53
- | 0.5970 | 0.7902 | 0.5541 | 0.7955 | 2 |
54
- | 0.4936 | 0.8829 | 0.4490 | 0.8523 | 3 |
55
- | 0.3649 | 0.9415 | 0.3775 | 0.875 | 4 |
56
- | 0.2563 | 0.9561 | 0.3254 | 0.8977 | 5 |
57
- | 0.1733 | 0.9756 | 0.3006 | 0.8977 | 6 |
58
-
59
-
60
- ### Framework versions
61
-
62
- - Transformers 4.44.2
63
- - TensorFlow 2.18.0-dev20240717
64
- - Datasets 2.21.0
65
- - Tokenizers 0.19.1
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: distilbert-base-uncased
5
+ tags:
6
+ - generated_from_keras_callback
7
+ model-index:
8
+ - name: tl-test-learn-prompt-classifier
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information Keras had access to. You should
13
+ probably proofread and complete it, then remove this comment. -->
14
+
15
+ # tl-test-learn-prompt-classifier
16
+
17
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Train Loss: 0.1146
20
+ - Train Accuracy: 0.9871
21
+ - Validation Loss: 0.3037
22
+ - Validation Accuracy: 0.8600
23
+ - Epoch: 7
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': False, 'is_legacy_optimizer': False, 'learning_rate': 5e-06, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
43
+ - training_precision: float32
44
+
45
+ ### Training results
46
+
47
+ | Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
48
+ |:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
49
+ | 0.6898 | 0.5431 | 0.6834 | 0.5300 | 0 |
50
+ | 0.6735 | 0.5517 | 0.6590 | 0.5800 | 1 |
51
+ | 0.6284 | 0.7112 | 0.5814 | 0.8700 | 2 |
52
+ | 0.5261 | 0.8707 | 0.4635 | 0.8400 | 3 |
53
+ | 0.3826 | 0.9310 | 0.3632 | 0.8400 | 4 |
54
+ | 0.2473 | 0.9569 | 0.3114 | 0.8500 | 5 |
55
+ | 0.1700 | 0.9741 | 0.2882 | 0.8700 | 6 |
56
+ | 0.1146 | 0.9871 | 0.3037 | 0.8600 | 7 |
57
+
58
+
59
+ ### Framework versions
60
+
61
+ - Transformers 4.44.2
62
+ - TensorFlow 2.18.0-dev20240717
63
+ - Datasets 2.21.0
64
+ - Tokenizers 0.19.1
 
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a74d43b14980d59d36249fa00d6e2161b380d0b5c56ee5677def4d21fa682ba1
3
  size 267955144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d303255162e68eeb029d18d19adeb17032be80c8327d51ac98242a8e01da5bf
3
  size 267955144