Plim commited on
Commit
e8b3151
1 Parent(s): 505559b

clean start

Browse files
Files changed (34) hide show
  1. .gitignore +3 -1
  2. .ipynb_checkpoints/README-checkpoint.md +0 -70
  3. .ipynb_checkpoints/run-checkpoint.sh +3 -5
  4. README.md +0 -62
  5. all_results.json +0 -14
  6. config.json +0 -107
  7. eval_results.json +0 -9
  8. preprocessor_config.json +0 -9
  9. pytorch_model.bin +0 -3
  10. run.sh +3 -5
  11. train_results.json +0 -8
  12. trainer_state.json +0 -25
  13. training_args.bin +0 -3
  14. wandb/debug-internal.log +0 -1
  15. wandb/debug.log +0 -1
  16. wandb/latest-run +0 -1
  17. wandb/run-20220130_224738-2uzt3kt1/files/conda-environment.yaml +0 -0
  18. wandb/run-20220130_224738-2uzt3kt1/files/config.yaml +0 -698
  19. wandb/run-20220130_224738-2uzt3kt1/files/output.log +0 -66
  20. wandb/run-20220130_224738-2uzt3kt1/files/requirements.txt +0 -180
  21. wandb/run-20220130_224738-2uzt3kt1/files/wandb-metadata.json +0 -63
  22. wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json +0 -1
  23. wandb/run-20220130_224738-2uzt3kt1/logs/debug-internal.log +0 -210
  24. wandb/run-20220130_224738-2uzt3kt1/logs/debug.log +0 -146
  25. wandb/run-20220130_224738-2uzt3kt1/run-2uzt3kt1.wandb +0 -0
  26. wandb/run-20220130_230018-ktkg6ghu/files/conda-environment.yaml +0 -0
  27. wandb/run-20220130_230018-ktkg6ghu/files/config.yaml +0 -692
  28. wandb/run-20220130_230018-ktkg6ghu/files/output.log +0 -62
  29. wandb/run-20220130_230018-ktkg6ghu/files/requirements.txt +0 -180
  30. wandb/run-20220130_230018-ktkg6ghu/files/wandb-metadata.json +0 -63
  31. wandb/run-20220130_230018-ktkg6ghu/files/wandb-summary.json +0 -1
  32. wandb/run-20220130_230018-ktkg6ghu/logs/debug-internal.log +0 -110
  33. wandb/run-20220130_230018-ktkg6ghu/logs/debug.log +0 -24
  34. wandb/run-20220130_230018-ktkg6ghu/run-ktkg6ghu.wandb +0 -0
.gitignore CHANGED
@@ -1 +1,3 @@
1
- checkpoint-*/
 
 
 
1
+ checkpoint-*/
2
+
3
+ wandb
.ipynb_checkpoints/README-checkpoint.md DELETED
@@ -1,70 +0,0 @@
1
- ---
2
- language:
3
- - fr
4
- license: apache-2.0
5
- tags:
6
- - automatic-speech-recognition
7
- - mozilla-foundation/common_voice_7_0
8
- - generated_from_trainer
9
- datasets:
10
- - common_voice
11
- model-index:
12
- - name: ''
13
- results: []
14
- ---
15
-
16
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
- should probably proofread and complete it, then remove this comment. -->
18
-
19
- #
20
-
21
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - FR dataset.
22
- It achieves the following results on the evaluation set:
23
- - Loss: 0.5417
24
- - Wer: 0.4479
25
-
26
- ## Model description
27
-
28
- More information needed
29
-
30
- ## Intended uses & limitations
31
-
32
- More information needed
33
-
34
- ## Training and evaluation data
35
-
36
- More information needed
37
-
38
- ## Training procedure
39
-
40
- ### Training hyperparameters
41
-
42
- The following hyperparameters were used during training:
43
- - learning_rate: 7.5e-05
44
- - train_batch_size: 8
45
- - eval_batch_size: 8
46
- - seed: 42
47
- - gradient_accumulation_steps: 4
48
- - total_train_batch_size: 32
49
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
- - lr_scheduler_type: linear
51
- - lr_scheduler_warmup_steps: 2000
52
- - num_epochs: 0.2
53
- - mixed_precision_training: Native AMP
54
-
55
- ### Training results
56
-
57
- | Training Loss | Epoch | Step | Validation Loss | Wer |
58
- |:-------------:|:-----:|:----:|:---------------:|:------:|
59
- | 6.9106 | 0.04 | 500 | 6.7171 | 1.0 |
60
- | 3.0034 | 0.08 | 1000 | 3.0126 | 1.0 |
61
- | 2.8699 | 0.12 | 1500 | 2.8509 | 0.9817 |
62
- | 1.629 | 0.16 | 2000 | 0.7787 | 0.5861 |
63
-
64
-
65
- ### Framework versions
66
-
67
- - Transformers 4.17.0.dev0
68
- - Pytorch 1.10.2+cu102
69
- - Datasets 1.18.2.dev0
70
- - Tokenizers 0.11.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
.ipynb_checkpoints/run-checkpoint.sh CHANGED
@@ -20,14 +20,12 @@ python run_speech_recognition_ctc.py \
20
  --mask_feature_prob="0.25" \
21
  --mask_time_length="10" \
22
  --mask_time_prob="0.75" \
23
- --max_train_samples="1000" \
24
- --max_eval_samples="200" \
25
  --model_name_or_path="facebook/wav2vec2-xls-r-300m" \
26
- --num_train_epochs="0.4" \
27
  --output_dir="./" \
28
  --overwrite_output_dir \
29
- --per_device_train_batch_size="8" \
30
- --per_device_eval_batch_size="8" \
31
  --preprocessing_num_workers="4" \
32
  --push_to_hub \
33
  --report_to="wandb" \
 
20
  --mask_feature_prob="0.25" \
21
  --mask_time_length="10" \
22
  --mask_time_prob="0.75" \
 
 
23
  --model_name_or_path="facebook/wav2vec2-xls-r-300m" \
24
+ --num_train_epochs="2.0" \
25
  --output_dir="./" \
26
  --overwrite_output_dir \
27
+ --per_device_train_batch_size="16" \
28
+ --per_device_eval_batch_size="16" \
29
  --preprocessing_num_workers="4" \
30
  --push_to_hub \
31
  --report_to="wandb" \
README.md DELETED
@@ -1,62 +0,0 @@
1
- ---
2
- language:
3
- - fr
4
- license: apache-2.0
5
- tags:
6
- - automatic-speech-recognition
7
- - mozilla-foundation/common_voice_7_0
8
- - generated_from_trainer
9
- model-index:
10
- - name: ''
11
- results: []
12
- ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- #
18
-
19
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - FR dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 16.9129
22
- - Wer: 2.3789
23
-
24
- ## Model description
25
-
26
- More information needed
27
-
28
- ## Intended uses & limitations
29
-
30
- More information needed
31
-
32
- ## Training and evaluation data
33
-
34
- More information needed
35
-
36
- ## Training procedure
37
-
38
- ### Training hyperparameters
39
-
40
- The following hyperparameters were used during training:
41
- - learning_rate: 7.5e-05
42
- - train_batch_size: 8
43
- - eval_batch_size: 8
44
- - seed: 42
45
- - gradient_accumulation_steps: 8
46
- - total_train_batch_size: 64
47
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
- - lr_scheduler_type: linear
49
- - lr_scheduler_warmup_steps: 2000
50
- - num_epochs: 0.4
51
- - mixed_precision_training: Native AMP
52
-
53
- ### Training results
54
-
55
-
56
-
57
- ### Framework versions
58
-
59
- - Transformers 4.17.0.dev0
60
- - Pytorch 1.10.2+cu102
61
- - Datasets 1.18.2.dev0
62
- - Tokenizers 0.11.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
all_results.json DELETED
@@ -1,14 +0,0 @@
1
- {
2
- "epoch": 0.38,
3
- "eval_loss": 16.912879943847656,
4
- "eval_runtime": 8.6337,
5
- "eval_samples": 200,
6
- "eval_samples_per_second": 23.165,
7
- "eval_steps_per_second": 2.896,
8
- "eval_wer": 2.3789039481437833,
9
- "train_loss": 13.584136962890625,
10
- "train_runtime": 23.2007,
11
- "train_samples": 1000,
12
- "train_samples_per_second": 17.241,
13
- "train_steps_per_second": 0.259
14
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
config.json DELETED
@@ -1,107 +0,0 @@
1
- {
2
- "_name_or_path": "facebook/wav2vec2-xls-r-300m",
3
- "activation_dropout": 0.1,
4
- "adapter_kernel_size": 3,
5
- "adapter_stride": 2,
6
- "add_adapter": false,
7
- "apply_spec_augment": true,
8
- "architectures": [
9
- "Wav2Vec2ForCTC"
10
- ],
11
- "attention_dropout": 0.0,
12
- "bos_token_id": 1,
13
- "classifier_proj_size": 256,
14
- "codevector_dim": 768,
15
- "contrastive_logits_temperature": 0.1,
16
- "conv_bias": true,
17
- "conv_dim": [
18
- 512,
19
- 512,
20
- 512,
21
- 512,
22
- 512,
23
- 512,
24
- 512
25
- ],
26
- "conv_kernel": [
27
- 10,
28
- 3,
29
- 3,
30
- 3,
31
- 3,
32
- 2,
33
- 2
34
- ],
35
- "conv_stride": [
36
- 5,
37
- 2,
38
- 2,
39
- 2,
40
- 2,
41
- 2,
42
- 2
43
- ],
44
- "ctc_loss_reduction": "mean",
45
- "ctc_zero_infinity": false,
46
- "diversity_loss_weight": 0.1,
47
- "do_stable_layer_norm": true,
48
- "eos_token_id": 2,
49
- "feat_extract_activation": "gelu",
50
- "feat_extract_dropout": 0.0,
51
- "feat_extract_norm": "layer",
52
- "feat_proj_dropout": 0.0,
53
- "feat_quantizer_dropout": 0.0,
54
- "final_dropout": 0.0,
55
- "hidden_act": "gelu",
56
- "hidden_dropout": 0.0,
57
- "hidden_size": 1024,
58
- "initializer_range": 0.02,
59
- "intermediate_size": 4096,
60
- "layer_norm_eps": 1e-05,
61
- "layerdrop": 0.0,
62
- "mask_feature_length": 64,
63
- "mask_feature_min_masks": 0,
64
- "mask_feature_prob": 0.25,
65
- "mask_time_length": 10,
66
- "mask_time_min_masks": 2,
67
- "mask_time_prob": 0.75,
68
- "model_type": "wav2vec2",
69
- "num_adapter_layers": 3,
70
- "num_attention_heads": 16,
71
- "num_codevector_groups": 2,
72
- "num_codevectors_per_group": 320,
73
- "num_conv_pos_embedding_groups": 16,
74
- "num_conv_pos_embeddings": 128,
75
- "num_feat_extract_layers": 7,
76
- "num_hidden_layers": 24,
77
- "num_negatives": 100,
78
- "output_hidden_size": 1024,
79
- "pad_token_id": 40,
80
- "proj_codevector_dim": 768,
81
- "tdnn_dilation": [
82
- 1,
83
- 2,
84
- 3,
85
- 1,
86
- 1
87
- ],
88
- "tdnn_dim": [
89
- 512,
90
- 512,
91
- 512,
92
- 512,
93
- 1500
94
- ],
95
- "tdnn_kernel": [
96
- 5,
97
- 3,
98
- 3,
99
- 1,
100
- 1
101
- ],
102
- "torch_dtype": "float32",
103
- "transformers_version": "4.17.0.dev0",
104
- "use_weighted_layer_sum": false,
105
- "vocab_size": 41,
106
- "xvector_output_dim": 512
107
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
eval_results.json DELETED
@@ -1,9 +0,0 @@
1
- {
2
- "epoch": 0.38,
3
- "eval_loss": 16.912879943847656,
4
- "eval_runtime": 8.6337,
5
- "eval_samples": 200,
6
- "eval_samples_per_second": 23.165,
7
- "eval_steps_per_second": 2.896,
8
- "eval_wer": 2.3789039481437833
9
- }
 
 
 
 
 
 
 
 
 
 
preprocessor_config.json DELETED
@@ -1,9 +0,0 @@
1
- {
2
- "do_normalize": true,
3
- "feature_extractor_type": "Wav2Vec2FeatureExtractor",
4
- "feature_size": 1,
5
- "padding_side": "right",
6
- "padding_value": 0,
7
- "return_attention_mask": true,
8
- "sampling_rate": 16000
9
- }
 
 
 
 
 
 
 
 
 
 
pytorch_model.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:d1d7bd7ffd2ed6a01faf9f143c79eb1c4a1e163dd1a62f92c26af28850511bcd
3
- size 1262091761
 
 
 
 
run.sh CHANGED
@@ -20,14 +20,12 @@ python run_speech_recognition_ctc.py \
20
  --mask_feature_prob="0.25" \
21
  --mask_time_length="10" \
22
  --mask_time_prob="0.75" \
23
- --max_train_samples="1000" \
24
- --max_eval_samples="200" \
25
  --model_name_or_path="facebook/wav2vec2-xls-r-300m" \
26
- --num_train_epochs="0.4" \
27
  --output_dir="./" \
28
  --overwrite_output_dir \
29
- --per_device_train_batch_size="8" \
30
- --per_device_eval_batch_size="8" \
31
  --preprocessing_num_workers="4" \
32
  --push_to_hub \
33
  --report_to="wandb" \
 
20
  --mask_feature_prob="0.25" \
21
  --mask_time_length="10" \
22
  --mask_time_prob="0.75" \
 
 
23
  --model_name_or_path="facebook/wav2vec2-xls-r-300m" \
24
+ --num_train_epochs="2.0" \
25
  --output_dir="./" \
26
  --overwrite_output_dir \
27
+ --per_device_train_batch_size="16" \
28
+ --per_device_eval_batch_size="16" \
29
  --preprocessing_num_workers="4" \
30
  --push_to_hub \
31
  --report_to="wandb" \
train_results.json DELETED
@@ -1,8 +0,0 @@
1
- {
2
- "epoch": 0.38,
3
- "train_loss": 13.584136962890625,
4
- "train_runtime": 23.2007,
5
- "train_samples": 1000,
6
- "train_samples_per_second": 17.241,
7
- "train_steps_per_second": 0.259
8
- }
 
 
 
 
 
 
 
 
 
trainer_state.json DELETED
@@ -1,25 +0,0 @@
1
- {
2
- "best_metric": null,
3
- "best_model_checkpoint": null,
4
- "epoch": 0.384,
5
- "global_step": 6,
6
- "is_hyper_param_search": false,
7
- "is_local_process_zero": true,
8
- "is_world_process_zero": true,
9
- "log_history": [
10
- {
11
- "epoch": 0.38,
12
- "step": 6,
13
- "total_flos": 5.41371015650304e+16,
14
- "train_loss": 13.584136962890625,
15
- "train_runtime": 23.2007,
16
- "train_samples_per_second": 17.241,
17
- "train_steps_per_second": 0.259
18
- }
19
- ],
20
- "max_steps": 6,
21
- "num_train_epochs": 1,
22
- "total_flos": 5.41371015650304e+16,
23
- "trial_name": null,
24
- "trial_params": null
25
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
training_args.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:a4f6b5e530f353910710ef95150e349ddf6f70545b8095619803d7da46090983
3
- size 2991
 
 
 
 
wandb/debug-internal.log DELETED
@@ -1 +0,0 @@
1
- run-20220130_230018-ktkg6ghu/logs/debug-internal.log
 
 
wandb/debug.log DELETED
@@ -1 +0,0 @@
1
- run-20220130_230018-ktkg6ghu/logs/debug.log
 
 
wandb/latest-run DELETED
@@ -1 +0,0 @@
1
- run-20220130_230018-ktkg6ghu
 
 
wandb/run-20220130_224738-2uzt3kt1/files/conda-environment.yaml DELETED
File without changes
wandb/run-20220130_224738-2uzt3kt1/files/config.yaml DELETED
@@ -1,698 +0,0 @@
1
- wandb_version: 1
2
-
3
- _n_gpu:
4
- desc: null
5
- value: 1
6
- _name_or_path:
7
- desc: null
8
- value: facebook/wav2vec2-xls-r-300m
9
- _wandb:
10
- desc: null
11
- value:
12
- cli_version: 0.12.9
13
- framework: huggingface
14
- huggingface_version: 4.17.0.dev0
15
- is_jupyter_run: false
16
- is_kaggle_kernel: false
17
- m:
18
- - 1: train/global_step
19
- 6:
20
- - 3
21
- - 1: train/train_runtime
22
- 5: 1
23
- 6:
24
- - 1
25
- - 1: train/train_samples_per_second
26
- 5: 1
27
- 6:
28
- - 1
29
- - 1: train/train_steps_per_second
30
- 5: 1
31
- 6:
32
- - 1
33
- - 1: train/total_flos
34
- 5: 1
35
- 6:
36
- - 1
37
- - 1: train/train_loss
38
- 5: 1
39
- 6:
40
- - 1
41
- - 1: train/epoch
42
- 5: 1
43
- 6:
44
- - 1
45
- - 1: eval/loss
46
- 5: 1
47
- 6:
48
- - 1
49
- - 1: eval/wer
50
- 5: 1
51
- 6:
52
- - 1
53
- - 1: eval/runtime
54
- 5: 1
55
- 6:
56
- - 1
57
- - 1: eval/samples_per_second
58
- 5: 1
59
- 6:
60
- - 1
61
- - 1: eval/steps_per_second
62
- 5: 1
63
- 6:
64
- - 1
65
- python_version: 3.8.8
66
- start_time: 1643582858
67
- t:
68
- 1:
69
- - 1
70
- - 5
71
- - 11
72
- 2:
73
- - 1
74
- - 5
75
- - 11
76
- 3:
77
- - 1
78
- - 7
79
- - 13
80
- 4: 3.8.8
81
- 5: 0.12.9
82
- 6: 4.17.0.dev0
83
- 8:
84
- - 5
85
- activation_dropout:
86
- desc: null
87
- value: 0.1
88
- adafactor:
89
- desc: null
90
- value: false
91
- adam_beta1:
92
- desc: null
93
- value: 0.9
94
- adam_beta2:
95
- desc: null
96
- value: 0.999
97
- adam_epsilon:
98
- desc: null
99
- value: 1.0e-08
100
- adapter_kernel_size:
101
- desc: null
102
- value: 3
103
- adapter_stride:
104
- desc: null
105
- value: 2
106
- add_adapter:
107
- desc: null
108
- value: false
109
- add_cross_attention:
110
- desc: null
111
- value: false
112
- apply_spec_augment:
113
- desc: null
114
- value: true
115
- architectures:
116
- desc: null
117
- value:
118
- - Wav2Vec2ForPreTraining
119
- attention_dropout:
120
- desc: null
121
- value: 0.0
122
- bad_words_ids:
123
- desc: null
124
- value: null
125
- bf16:
126
- desc: null
127
- value: false
128
- bf16_full_eval:
129
- desc: null
130
- value: false
131
- bos_token_id:
132
- desc: null
133
- value: 1
134
- chunk_size_feed_forward:
135
- desc: null
136
- value: 0
137
- classifier_proj_size:
138
- desc: null
139
- value: 256
140
- codevector_dim:
141
- desc: null
142
- value: 768
143
- contrastive_logits_temperature:
144
- desc: null
145
- value: 0.1
146
- conv_bias:
147
- desc: null
148
- value: true
149
- conv_dim:
150
- desc: null
151
- value:
152
- - 512
153
- - 512
154
- - 512
155
- - 512
156
- - 512
157
- - 512
158
- - 512
159
- conv_kernel:
160
- desc: null
161
- value:
162
- - 10
163
- - 3
164
- - 3
165
- - 3
166
- - 3
167
- - 2
168
- - 2
169
- conv_stride:
170
- desc: null
171
- value:
172
- - 5
173
- - 2
174
- - 2
175
- - 2
176
- - 2
177
- - 2
178
- - 2
179
- cross_attention_hidden_size:
180
- desc: null
181
- value: null
182
- ctc_loss_reduction:
183
- desc: null
184
- value: mean
185
- ctc_zero_infinity:
186
- desc: null
187
- value: false
188
- dataloader_drop_last:
189
- desc: null
190
- value: false
191
- dataloader_num_workers:
192
- desc: null
193
- value: 0
194
- dataloader_pin_memory:
195
- desc: null
196
- value: true
197
- ddp_bucket_cap_mb:
198
- desc: null
199
- value: None
200
- ddp_find_unused_parameters:
201
- desc: null
202
- value: None
203
- debug:
204
- desc: null
205
- value: '[]'
206
- decoder_start_token_id:
207
- desc: null
208
- value: null
209
- deepspeed:
210
- desc: null
211
- value: None
212
- disable_tqdm:
213
- desc: null
214
- value: false
215
- diversity_loss_weight:
216
- desc: null
217
- value: 0.1
218
- diversity_penalty:
219
- desc: null
220
- value: 0.0
221
- do_eval:
222
- desc: null
223
- value: true
224
- do_predict:
225
- desc: null
226
- value: false
227
- do_sample:
228
- desc: null
229
- value: false
230
- do_stable_layer_norm:
231
- desc: null
232
- value: true
233
- do_train:
234
- desc: null
235
- value: true
236
- early_stopping:
237
- desc: null
238
- value: false
239
- encoder_no_repeat_ngram_size:
240
- desc: null
241
- value: 0
242
- eos_token_id:
243
- desc: null
244
- value: 2
245
- eval_accumulation_steps:
246
- desc: null
247
- value: None
248
- eval_batch_size:
249
- desc: null
250
- value: 8
251
- eval_steps:
252
- desc: null
253
- value: 500
254
- evaluation_strategy:
255
- desc: null
256
- value: steps
257
- feat_extract_activation:
258
- desc: null
259
- value: gelu
260
- feat_extract_dropout:
261
- desc: null
262
- value: 0.0
263
- feat_extract_norm:
264
- desc: null
265
- value: layer
266
- feat_proj_dropout:
267
- desc: null
268
- value: 0.0
269
- feat_quantizer_dropout:
270
- desc: null
271
- value: 0.0
272
- final_dropout:
273
- desc: null
274
- value: 0.0
275
- finetuning_task:
276
- desc: null
277
- value: null
278
- forced_bos_token_id:
279
- desc: null
280
- value: null
281
- forced_eos_token_id:
282
- desc: null
283
- value: null
284
- fp16:
285
- desc: null
286
- value: true
287
- fp16_backend:
288
- desc: null
289
- value: auto
290
- fp16_full_eval:
291
- desc: null
292
- value: false
293
- fp16_opt_level:
294
- desc: null
295
- value: O1
296
- gradient_accumulation_steps:
297
- desc: null
298
- value: 8
299
- gradient_checkpointing:
300
- desc: null
301
- value: true
302
- greater_is_better:
303
- desc: null
304
- value: false
305
- group_by_length:
306
- desc: null
307
- value: true
308
- half_precision_backend:
309
- desc: null
310
- value: amp
311
- hidden_act:
312
- desc: null
313
- value: gelu
314
- hidden_dropout:
315
- desc: null
316
- value: 0.0
317
- hidden_size:
318
- desc: null
319
- value: 1024
320
- hub_model_id:
321
- desc: null
322
- value: None
323
- hub_strategy:
324
- desc: null
325
- value: every_save
326
- hub_token:
327
- desc: null
328
- value: <HUB_TOKEN>
329
- id2label:
330
- desc: null
331
- value:
332
- '0': LABEL_0
333
- '1': LABEL_1
334
- ignore_data_skip:
335
- desc: null
336
- value: false
337
- initializer_range:
338
- desc: null
339
- value: 0.02
340
- intermediate_size:
341
- desc: null
342
- value: 4096
343
- is_decoder:
344
- desc: null
345
- value: false
346
- is_encoder_decoder:
347
- desc: null
348
- value: false
349
- label2id:
350
- desc: null
351
- value:
352
- LABEL_0: 0
353
- LABEL_1: 1
354
- label_names:
355
- desc: null
356
- value: None
357
- label_smoothing_factor:
358
- desc: null
359
- value: 0.0
360
- layer_norm_eps:
361
- desc: null
362
- value: 1.0e-05
363
- layerdrop:
364
- desc: null
365
- value: 0.0
366
- learning_rate:
367
- desc: null
368
- value: 7.5e-05
369
- length_column_name:
370
- desc: null
371
- value: input_length
372
- length_penalty:
373
- desc: null
374
- value: 1.0
375
- load_best_model_at_end:
376
- desc: null
377
- value: true
378
- local_rank:
379
- desc: null
380
- value: -1
381
- log_level:
382
- desc: null
383
- value: -1
384
- log_level_replica:
385
- desc: null
386
- value: -1
387
- log_on_each_node:
388
- desc: null
389
- value: true
390
- logging_dir:
391
- desc: null
392
- value: ./runs/Jan30_22-46-41_job-3261699b-76eb-4c28-8419-66a66c5c9199
393
- logging_first_step:
394
- desc: null
395
- value: false
396
- logging_nan_inf_filter:
397
- desc: null
398
- value: true
399
- logging_steps:
400
- desc: null
401
- value: 100
402
- logging_strategy:
403
- desc: null
404
- value: steps
405
- lr_scheduler_type:
406
- desc: null
407
- value: linear
408
- mask_feature_length:
409
- desc: null
410
- value: 64
411
- mask_feature_min_masks:
412
- desc: null
413
- value: 0
414
- mask_feature_prob:
415
- desc: null
416
- value: 0.25
417
- mask_time_length:
418
- desc: null
419
- value: 10
420
- mask_time_min_masks:
421
- desc: null
422
- value: 2
423
- mask_time_prob:
424
- desc: null
425
- value: 0.75
426
- max_grad_norm:
427
- desc: null
428
- value: 1.0
429
- max_length:
430
- desc: null
431
- value: 20
432
- max_steps:
433
- desc: null
434
- value: -1
435
- metric_for_best_model:
436
- desc: null
437
- value: loss
438
- min_length:
439
- desc: null
440
- value: 0
441
- model_type:
442
- desc: null
443
- value: wav2vec2
444
- mp_parameters:
445
- desc: null
446
- value: ''
447
- no_cuda:
448
- desc: null
449
- value: false
450
- no_repeat_ngram_size:
451
- desc: null
452
- value: 0
453
- num_adapter_layers:
454
- desc: null
455
- value: 3
456
- num_attention_heads:
457
- desc: null
458
- value: 16
459
- num_beam_groups:
460
- desc: null
461
- value: 1
462
- num_beams:
463
- desc: null
464
- value: 1
465
- num_codevector_groups:
466
- desc: null
467
- value: 2
468
- num_codevectors_per_group:
469
- desc: null
470
- value: 320
471
- num_conv_pos_embedding_groups:
472
- desc: null
473
- value: 16
474
- num_conv_pos_embeddings:
475
- desc: null
476
- value: 128
477
- num_feat_extract_layers:
478
- desc: null
479
- value: 7
480
- num_hidden_layers:
481
- desc: null
482
- value: 24
483
- num_negatives:
484
- desc: null
485
- value: 100
486
- num_return_sequences:
487
- desc: null
488
- value: 1
489
- num_train_epochs:
490
- desc: null
491
- value: 0.2
492
- optim:
493
- desc: null
494
- value: adamw_hf
495
- output_attentions:
496
- desc: null
497
- value: false
498
- output_dir:
499
- desc: null
500
- value: ./
501
- output_hidden_size:
502
- desc: null
503
- value: 1024
504
- output_hidden_states:
505
- desc: null
506
- value: false
507
- output_scores:
508
- desc: null
509
- value: false
510
- overwrite_output_dir:
511
- desc: null
512
- value: true
513
- pad_token_id:
514
- desc: null
515
- value: 40
516
- past_index:
517
- desc: null
518
- value: -1
519
- per_device_eval_batch_size:
520
- desc: null
521
- value: 8
522
- per_device_train_batch_size:
523
- desc: null
524
- value: 8
525
- per_gpu_eval_batch_size:
526
- desc: null
527
- value: None
528
- per_gpu_train_batch_size:
529
- desc: null
530
- value: None
531
- prediction_loss_only:
532
- desc: null
533
- value: false
534
- prefix:
535
- desc: null
536
- value: null
537
- problem_type:
538
- desc: null
539
- value: null
540
- proj_codevector_dim:
541
- desc: null
542
- value: 768
543
- pruned_heads:
544
- desc: null
545
- value: {}
546
- push_to_hub:
547
- desc: null
548
- value: true
549
- push_to_hub_model_id:
550
- desc: null
551
- value: None
552
- push_to_hub_organization:
553
- desc: null
554
- value: None
555
- push_to_hub_token:
556
- desc: null
557
- value: <PUSH_TO_HUB_TOKEN>
558
- remove_invalid_values:
559
- desc: null
560
- value: false
561
- remove_unused_columns:
562
- desc: null
563
- value: true
564
- repetition_penalty:
565
- desc: null
566
- value: 1.0
567
- report_to:
568
- desc: null
569
- value: '[''wandb'']'
570
- resume_from_checkpoint:
571
- desc: null
572
- value: None
573
- return_dict:
574
- desc: null
575
- value: true
576
- return_dict_in_generate:
577
- desc: null
578
- value: false
579
- run_name:
580
- desc: null
581
- value: ./
582
- save_on_each_node:
583
- desc: null
584
- value: false
585
- save_steps:
586
- desc: null
587
- value: 500
588
- save_strategy:
589
- desc: null
590
- value: steps
591
- save_total_limit:
592
- desc: null
593
- value: 3
594
- seed:
595
- desc: null
596
- value: 42
597
- sep_token_id:
598
- desc: null
599
- value: null
600
- sharded_ddp:
601
- desc: null
602
- value: '[]'
603
- skip_memory_metrics:
604
- desc: null
605
- value: true
606
- task_specific_params:
607
- desc: null
608
- value: null
609
- tdnn_dilation:
610
- desc: null
611
- value:
612
- - 1
613
- - 2
614
- - 3
615
- - 1
616
- - 1
617
- tdnn_dim:
618
- desc: null
619
- value:
620
- - 512
621
- - 512
622
- - 512
623
- - 512
624
- - 1500
625
- tdnn_kernel:
626
- desc: null
627
- value:
628
- - 5
629
- - 3
630
- - 3
631
- - 1
632
- - 1
633
- temperature:
634
- desc: null
635
- value: 1.0
636
- tf32:
637
- desc: null
638
- value: None
639
- tie_encoder_decoder:
640
- desc: null
641
- value: false
642
- tie_word_embeddings:
643
- desc: null
644
- value: true
645
- tokenizer_class:
646
- desc: null
647
- value: null
648
- top_k:
649
- desc: null
650
- value: 50
651
- top_p:
652
- desc: null
653
- value: 1.0
654
- torch_dtype:
655
- desc: null
656
- value: float32
657
- torchscript:
658
- desc: null
659
- value: false
660
- tpu_metrics_debug:
661
- desc: null
662
- value: false
663
- tpu_num_cores:
664
- desc: null
665
- value: None
666
- train_batch_size:
667
- desc: null
668
- value: 8
669
- transformers_version:
670
- desc: null
671
- value: 4.17.0.dev0
672
- use_bfloat16:
673
- desc: null
674
- value: false
675
- use_legacy_prediction_loop:
676
- desc: null
677
- value: false
678
- use_weighted_layer_sum:
679
- desc: null
680
- value: false
681
- vocab_size:
682
- desc: null
683
- value: 41
684
- warmup_ratio:
685
- desc: null
686
- value: 0.0
687
- warmup_steps:
688
- desc: null
689
- value: 2000
690
- weight_decay:
691
- desc: null
692
- value: 0.0
693
- xpu_backend:
694
- desc: null
695
- value: None
696
- xvector_output_dim:
697
- desc: null
698
- value: 512
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/files/output.log DELETED
@@ -1,66 +0,0 @@
1
-
2
-
3
-
4
- 67%|██████████████████████████████████████████████████████████████████████████████████████████ | 2/3 [00:07<00:03, 3.88s/it]
5
- 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:10<00:00, 3.23s/it]
6
- Training completed. Do not forget to share your model on huggingface.co/models =)
7
- 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:10<00:00, 3.48s/it]
8
- Saving model checkpoint to ./
9
- Configuration saved in ./config.json
10
- Model weights saved in ./pytorch_model.bin
11
- Configuration saved in ./preprocessor_config.json
12
- Saving model checkpoint to ./
13
- Configuration saved in ./config.json
14
- Model weights saved in ./pytorch_model.bin
15
- Configuration saved in ./preprocessor_config.json
16
- Upload file pytorch_model.bin: 0%| | 3.39k/1.18G [00:00<?, ?B/s]
17
- Upload file training_args.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:00<?, ?B/s]
18
- 01/30/2022 22:49:01 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
19
- 1d17287..8ac44c4 main -> main0%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:00<?, ?B/s]
20
- Upload file pytorch_model.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████| 1.18G/1.18G [00:42<00:00, 29.7MB/s]
21
- Upload file training_args.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:42<?, ?B/s]
22
- Dropping the following result as it does not have all the necessary fields:██████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:42<?, ?B/s]
23
- {}
24
- 01/30/2022 22:49:07 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
25
- 8ac44c4..77260d3 main -> main
26
- To https://huggingface.co/Plim/xls-r-300m-fr
27
- 8ac44c4..77260d3 main -> main
28
- The following columns in the evaluation set don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.
29
- ***** Running Evaluation *****
30
- Num examples = 200
31
- Batch size = 8
32
- 0%| | 0/25 [00:00<?, ?it/s]
33
- ***** train metrics *****
34
- epoch = 0.19
35
- train_loss = 12.4969
36
- train_runtime = 0:00:12.89
37
- train_samples = 1000
38
- train_samples_per_second = 15.512
39
- train_steps_per_second = 0.233
40
-
41
-
42
-
43
-
44
- 96%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▋ | 24/25 [00:07<00:00, 2.74it/s]
45
- ***** eval metrics *****
46
- epoch = 0.19
47
- eval_loss = 16.9132
48
- eval_runtime = 0:00:08.67
49
- eval_samples = 200
50
- eval_samples_per_second = 23.067
51
- eval_steps_per_second = 2.883
52
- 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 25/25 [00:08<00:00, 3.02it/s]
53
- Saving model checkpoint to ./
54
- Configuration saved in ./config.json
55
- Model weights saved in ./pytorch_model.bin
56
- Configuration saved in ./preprocessor_config.json
57
- 01/30/2022 22:49:41 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
58
- 77260d3..45cb5d4 main -> main
59
- To https://huggingface.co/Plim/xls-r-300m-fr
60
- 77260d3..45cb5d4 main -> main
61
- Dropping the following result as it does not have all the necessary fields:
62
- {}
63
- 01/30/2022 22:49:47 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
64
- 45cb5d4..1fb68dc main -> main
65
- To https://huggingface.co/Plim/xls-r-300m-fr
66
- 45cb5d4..1fb68dc main -> main
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/files/requirements.txt DELETED
@@ -1,180 +0,0 @@
1
- aiohttp==3.8.1
2
- aiosignal==1.2.0
3
- analytics-python==1.4.0
4
- anyio==3.5.0
5
- appdirs==1.4.4
6
- argon2-cffi-bindings==21.2.0
7
- argon2-cffi==21.3.0
8
- asgiref==3.5.0
9
- asttokens==2.0.5
10
- async-timeout==4.0.2
11
- attrs==21.4.0
12
- audioread==2.1.9
13
- backcall==0.2.0
14
- backoff==1.10.0
15
- bcrypt==3.2.0
16
- beautifulsoup4==4.9.3
17
- black==21.12b0
18
- bleach==4.1.0
19
- brotlipy==0.7.0
20
- certifi==2020.12.5
21
- cffi==1.14.3
22
- chardet==3.0.4
23
- charset-normalizer==2.0.10
24
- click==8.0.3
25
- conda-build==3.21.4
26
- conda-package-handling==1.7.2
27
- conda==4.9.2
28
- configparser==5.2.0
29
- cryptography==3.2.1
30
- cycler==0.11.0
31
- datasets==1.18.2.dev0
32
- debugpy==1.5.1
33
- decorator==4.4.2
34
- defusedxml==0.7.1
35
- dill==0.3.4
36
- dnspython==2.1.0
37
- docker-pycreds==0.4.0
38
- entrypoints==0.3
39
- executing==0.8.2
40
- fastapi==0.73.0
41
- ffmpy==0.3.0
42
- filelock==3.0.12
43
- fonttools==4.29.0
44
- frozenlist==1.3.0
45
- fsspec==2022.1.0
46
- gitdb==4.0.9
47
- gitpython==3.1.26
48
- glob2==0.7
49
- gradio==2.7.5.2
50
- h11==0.13.0
51
- huggingface-hub==0.4.0
52
- idna==2.10
53
- importlib-resources==5.4.0
54
- ipykernel==6.7.0
55
- ipython-genutils==0.2.0
56
- ipython==8.0.1
57
- ipywidgets==7.6.3
58
- jedi==0.17.0
59
- jinja2==2.11.3
60
- jiwer==2.3.0
61
- joblib==1.1.0
62
- json5==0.9.6
63
- jsonschema==4.4.0
64
- jupyter-client==7.1.2
65
- jupyter-core==4.9.1
66
- jupyterlab-pygments==0.1.2
67
- jupyterlab-server==1.2.0
68
- jupyterlab-widgets==1.0.2
69
- jupyterlab==2.2.9
70
- kiwisolver==1.3.2
71
- libarchive-c==2.9
72
- librosa==0.8.1
73
- llvmlite==0.38.0
74
- markdown2==2.4.2
75
- markupsafe==1.1.1
76
- matplotlib-inline==0.1.3
77
- matplotlib==3.5.1
78
- mistune==0.8.4
79
- mkl-fft==1.3.0
80
- mkl-random==1.1.1
81
- mkl-service==2.3.0
82
- monotonic==1.6
83
- multidict==6.0.2
84
- multiprocess==0.70.12.2
85
- mypy-extensions==0.4.3
86
- nano==0.10.0
87
- nbclient==0.5.10
88
- nbconvert==6.4.1
89
- nbformat==5.1.3
90
- nest-asyncio==1.5.4
91
- notebook==6.4.8
92
- numba==0.55.1
93
- numpy==1.19.2
94
- olefile==0.46
95
- packaging==21.3
96
- pandas==1.4.0
97
- pandocfilters==1.5.0
98
- paramiko==2.9.2
99
- parso==0.8.1
100
- pathspec==0.9.0
101
- pathtools==0.1.2
102
- pexpect==4.8.0
103
- pickleshare==0.7.5
104
- pillow==8.1.2
105
- pip==21.3.1
106
- pkginfo==1.7.0
107
- platformdirs==2.4.1
108
- pooch==1.6.0
109
- prometheus-client==0.13.0
110
- promise==2.3
111
- prompt-toolkit==3.0.8
112
- protobuf==3.19.4
113
- psutil==5.8.0
114
- ptyprocess==0.7.0
115
- pure-eval==0.2.2
116
- pyarrow==6.0.1
117
- pycosat==0.6.3
118
- pycparser==2.20
119
- pycryptodome==3.13.0
120
- pydantic==1.9.0
121
- pydub==0.25.1
122
- pygments==2.8.0
123
- pynacl==1.5.0
124
- pyopenssl==19.1.0
125
- pyparsing==3.0.7
126
- pyrsistent==0.18.1
127
- pysocks==1.7.1
128
- python-dateutil==2.8.2
129
- python-etcd==0.4.5
130
- python-levenshtein==0.12.2
131
- python-multipart==0.0.5
132
- pytz==2021.1
133
- pyyaml==5.4.1
134
- pyzmq==22.3.0
135
- regex==2022.1.18
136
- requests==2.24.0
137
- resampy==0.2.2
138
- ruamel-yaml==0.15.87
139
- sacremoses==0.0.47
140
- scikit-learn==1.0.2
141
- scipy==1.7.3
142
- send2trash==1.8.0
143
- sentry-sdk==1.5.4
144
- setuptools==50.3.1.post20201107
145
- shortuuid==1.0.8
146
- six==1.15.0
147
- smmap==5.0.0
148
- sniffio==1.2.0
149
- soundfile==0.10.3.post1
150
- soupsieve==2.2
151
- stack-data==0.1.4
152
- starlette==0.17.1
153
- subprocess32==3.5.4
154
- termcolor==1.1.0
155
- terminado==0.13.1
156
- testpath==0.5.0
157
- threadpoolctl==3.0.0
158
- tokenizers==0.11.4
159
- tomli==1.2.3
160
- torch==1.10.2
161
- torchaudio==0.10.2
162
- torchelastic==0.2.2
163
- torchtext==0.9.1
164
- torchvision==0.9.1
165
- tornado==6.1
166
- tqdm==4.62.3
167
- traitlets==5.1.1
168
- transformers==4.17.0.dev0
169
- typing-extensions==4.0.1
170
- urllib3==1.25.11
171
- uvicorn==0.17.1
172
- wandb==0.12.9
173
- wcwidth==0.2.5
174
- webencodings==0.5.1
175
- wheel==0.35.1
176
- widgetsnbextension==3.5.2
177
- xxhash==2.0.2
178
- yarl==1.7.2
179
- yaspin==2.1.0
180
- zipp==3.7.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/files/wandb-metadata.json DELETED
@@ -1,63 +0,0 @@
1
- {
2
- "os": "Linux-4.15.0-151-generic-x86_64-with-glibc2.10",
3
- "python": "3.8.8",
4
- "heartbeatAt": "2022-01-30T22:47:39.607019",
5
- "startedAt": "2022-01-30T22:47:38.310593",
6
- "docker": null,
7
- "gpu": "Tesla V100S-PCIE-32GB",
8
- "gpu_count": 1,
9
- "cpu_count": 60,
10
- "cuda": null,
11
- "args": [
12
- "--activation_dropout=0.1",
13
- "--dataset_name=mozilla-foundation/common_voice_7_0",
14
- "--dataset_config_name=fr",
15
- "--eval_steps=500",
16
- "--evaluation_strategy=steps",
17
- "--feat_proj_dropout=0.0",
18
- "--freeze_feature_encoder",
19
- "--fp16",
20
- "--gradient_accumulation_steps=8",
21
- "--gradient_checkpointing",
22
- "--group_by_length",
23
- "--layerdrop=0.0",
24
- "--learning_rate=7.5e-5",
25
- "--length_column_name=input_length",
26
- "--load_best_model_at_end",
27
- "--logging_steps=100",
28
- "--mask_feature_length=64",
29
- "--mask_feature_prob=0.25",
30
- "--mask_time_length=10",
31
- "--mask_time_prob=0.75",
32
- "--max_train_samples=1000",
33
- "--max_eval_samples=200",
34
- "--model_name_or_path=facebook/wav2vec2-xls-r-300m",
35
- "--num_train_epochs=0.2",
36
- "--output_dir=./",
37
- "--overwrite_output_dir",
38
- "--per_device_train_batch_size=8",
39
- "--per_device_eval_batch_size=8",
40
- "--preprocessing_num_workers=4",
41
- "--push_to_hub",
42
- "--report_to=wandb",
43
- "--save_steps=500",
44
- "--save_total_limit=3",
45
- "--text_column_name=sentence",
46
- "--use_auth_token",
47
- "--warmup_steps=2000",
48
- "--do_train",
49
- "--do_eval"
50
- ],
51
- "state": "running",
52
- "program": "run_speech_recognition_ctc.py",
53
- "codePath": "run_speech_recognition_ctc.py",
54
- "git": {
55
- "remote": "https://huggingface.co/Plim/xls-r-300m-fr",
56
- "commit": "1d172876193bf100999c8d09d283f8d0894252f2"
57
- },
58
- "email": "lim.pascal93@gmail.com",
59
- "root": "/workspace/xls-r-300m-fr",
60
- "host": "job-3261699b-76eb-4c28-8419-66a66c5c9199",
61
- "username": "ovh",
62
- "executable": "/opt/conda/bin/python"
63
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json DELETED
@@ -1 +0,0 @@
1
- {"train/train_runtime": 12.893, "train/train_samples_per_second": 15.512, "train/train_steps_per_second": 0.233, "train/total_flos": 2.67196543170048e+16, "train/train_loss": 12.496875762939453, "train/epoch": 0.19, "train/global_step": 3, "_runtime": 100, "_timestamp": 1643582958, "_step": 1, "eval/loss": 16.913198471069336, "eval/wer": 2.3629935179728934, "eval/runtime": 8.6705, "eval/samples_per_second": 23.067, "eval/steps_per_second": 2.883, "_wandb": {"runtime": 133}}
 
 
wandb/run-20220130_224738-2uzt3kt1/logs/debug-internal.log DELETED
@@ -1,210 +0,0 @@
1
- 2022-01-30 22:47:39,297 INFO MainThread:23196 [internal.py:wandb_internal():87] W&B internal server running at pid: 23196, started at: 2022-01-30 22:47:39.296970
2
- 2022-01-30 22:47:39,300 INFO WriterThread:23196 [datastore.py:open_for_write():77] open: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/run-2uzt3kt1.wandb
3
- 2022-01-30 22:47:39,301 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: check_version
4
- 2022-01-30 22:47:39,304 DEBUG SenderThread:23196 [sender.py:send():234] send: header
5
- 2022-01-30 22:47:39,304 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: check_version
6
- 2022-01-30 22:47:39,377 DEBUG SenderThread:23196 [sender.py:send():234] send: run
7
- 2022-01-30 22:47:39,597 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: run_start
8
- 2022-01-30 22:47:39,599 INFO SenderThread:23196 [dir_watcher.py:__init__():169] watching files in: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files
9
- 2022-01-30 22:47:39,600 INFO SenderThread:23196 [sender.py:_start_run_threads():804] run started: 2uzt3kt1 with start time 1643582858
10
- 2022-01-30 22:47:39,600 DEBUG SenderThread:23196 [sender.py:send():234] send: summary
11
- 2022-01-30 22:47:39,600 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
12
- 2022-01-30 22:47:39,606 DEBUG HandlerThread:23196 [meta.py:__init__():40] meta init
13
- 2022-01-30 22:47:39,606 DEBUG HandlerThread:23196 [meta.py:__init__():54] meta init done
14
- 2022-01-30 22:47:39,606 DEBUG HandlerThread:23196 [meta.py:probe():214] probe
15
- 2022-01-30 22:47:39,615 DEBUG HandlerThread:23196 [meta.py:_setup_git():204] setup git
16
- 2022-01-30 22:47:39,653 DEBUG HandlerThread:23196 [meta.py:_setup_git():211] setup git done
17
- 2022-01-30 22:47:39,653 DEBUG HandlerThread:23196 [meta.py:_save_pip():58] save pip
18
- 2022-01-30 22:47:39,654 DEBUG HandlerThread:23196 [meta.py:_save_pip():72] save pip done
19
- 2022-01-30 22:47:39,655 DEBUG HandlerThread:23196 [meta.py:_save_conda():79] save conda
20
- 2022-01-30 22:47:40,176 DEBUG HandlerThread:23196 [meta.py:_save_conda():89] save conda done
21
- 2022-01-30 22:47:40,176 DEBUG HandlerThread:23196 [meta.py:probe():252] probe done
22
- 2022-01-30 22:47:40,185 DEBUG SenderThread:23196 [sender.py:send():234] send: files
23
- 2022-01-30 22:47:40,186 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-metadata.json with policy now
24
- 2022-01-30 22:47:40,197 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
25
- 2022-01-30 22:47:40,198 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
26
- 2022-01-30 22:47:40,354 DEBUG SenderThread:23196 [sender.py:send():234] send: config
27
- 2022-01-30 22:47:40,357 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
28
- 2022-01-30 22:47:40,357 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
29
- 2022-01-30 22:47:40,357 WARNING SenderThread:23196 [sender.py:send_metric():897] Seen metric with glob (shouldnt happen)
30
- 2022-01-30 22:47:40,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/requirements.txt
31
- 2022-01-30 22:47:40,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json
32
- 2022-01-30 22:47:40,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
33
- 2022-01-30 22:47:40,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-metadata.json
34
- 2022-01-30 22:47:40,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/conda-environment.yaml
35
- 2022-01-30 22:47:40,709 INFO Thread-11 :23196 [upload_job.py:push():137] Uploaded file /tmp/tmp51rrl_mrwandb/2urk4c1m-wandb-metadata.json
36
- 2022-01-30 22:47:42,601 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
37
- 2022-01-30 22:47:46,604 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
38
- 2022-01-30 22:47:50,606 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
39
- 2022-01-30 22:47:50,676 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
40
- 2022-01-30 22:47:50,677 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
41
- 2022-01-30 22:47:50,677 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
42
- 2022-01-30 22:47:50,677 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
43
- 2022-01-30 22:47:50,677 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
44
- 2022-01-30 22:47:50,677 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
45
- 2022-01-30 22:47:50,678 DEBUG SenderThread:23196 [sender.py:send():234] send: history
46
- 2022-01-30 22:47:50,678 DEBUG SenderThread:23196 [sender.py:send():234] send: summary
47
- 2022-01-30 22:47:50,679 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
48
- 2022-01-30 22:47:51,607 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json
49
- 2022-01-30 22:47:52,608 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
50
- 2022-01-30 22:47:54,610 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
51
- 2022-01-30 22:47:55,488 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
52
- 2022-01-30 22:47:55,488 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
53
- 2022-01-30 22:47:56,611 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
54
- 2022-01-30 22:48:07,838 DEBUG SenderThread:23196 [sender.py:send():234] send: stats
55
- 2022-01-30 22:48:10,621 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/config.yaml
56
- 2022-01-30 22:48:10,741 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
57
- 2022-01-30 22:48:10,742 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
58
- 2022-01-30 22:48:20,628 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
59
- 2022-01-30 22:48:25,903 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
60
- 2022-01-30 22:48:25,903 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
61
- 2022-01-30 22:48:38,112 DEBUG SenderThread:23196 [sender.py:send():234] send: stats
62
- 2022-01-30 22:48:41,064 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
63
- 2022-01-30 22:48:41,065 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
64
- 2022-01-30 22:48:56,227 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
65
- 2022-01-30 22:48:56,228 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
66
- 2022-01-30 22:49:02,656 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
67
- 2022-01-30 22:49:06,659 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
68
- 2022-01-30 22:49:08,329 DEBUG SenderThread:23196 [sender.py:send():234] send: stats
69
- 2022-01-30 22:49:08,660 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
70
- 2022-01-30 22:49:10,661 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
71
- 2022-01-30 22:49:11,388 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
72
- 2022-01-30 22:49:11,389 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
73
- 2022-01-30 22:49:12,663 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
74
- 2022-01-30 22:49:14,664 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
75
- 2022-01-30 22:49:16,666 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
76
- 2022-01-30 22:49:18,553 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
77
- 2022-01-30 22:49:18,554 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
78
- 2022-01-30 22:49:18,554 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
79
- 2022-01-30 22:49:18,554 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
80
- 2022-01-30 22:49:18,554 DEBUG SenderThread:23196 [sender.py:send():234] send: metric
81
- 2022-01-30 22:49:18,555 DEBUG SenderThread:23196 [sender.py:send():234] send: history
82
- 2022-01-30 22:49:18,555 DEBUG SenderThread:23196 [sender.py:send():234] send: summary
83
- 2022-01-30 22:49:18,556 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
84
- 2022-01-30 22:49:18,667 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json
85
- 2022-01-30 22:49:18,667 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
86
- 2022-01-30 22:49:20,670 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
87
- 2022-01-30 22:49:22,672 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
88
- 2022-01-30 22:49:26,645 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
89
- 2022-01-30 22:49:26,645 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
90
- 2022-01-30 22:49:38,549 DEBUG SenderThread:23196 [sender.py:send():234] send: stats
91
- 2022-01-30 22:49:41,813 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: stop_status
92
- 2022-01-30 22:49:41,814 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: stop_status
93
- 2022-01-30 22:49:42,687 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/config.yaml
94
- 2022-01-30 22:49:42,688 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
95
- 2022-01-30 22:49:46,690 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
96
- 2022-01-30 22:49:48,691 INFO Thread-8 :23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
97
- 2022-01-30 22:49:52,753 DEBUG SenderThread:23196 [sender.py:send():234] send: telemetry
98
- 2022-01-30 22:49:52,754 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
99
- 2022-01-30 22:49:52,755 DEBUG SenderThread:23196 [sender.py:send():234] send: exit
100
- 2022-01-30 22:49:52,755 INFO SenderThread:23196 [sender.py:send_exit():366] handling exit code: 0
101
- 2022-01-30 22:49:52,756 INFO SenderThread:23196 [sender.py:send_exit():368] handling runtime: 133
102
- 2022-01-30 22:49:52,757 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
103
- 2022-01-30 22:49:52,757 INFO SenderThread:23196 [sender.py:send_exit():374] send defer
104
- 2022-01-30 22:49:52,757 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
105
- 2022-01-30 22:49:52,759 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
106
- 2022-01-30 22:49:52,759 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 0
107
- 2022-01-30 22:49:52,759 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
108
- 2022-01-30 22:49:52,759 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 0
109
- 2022-01-30 22:49:52,760 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 1
110
- 2022-01-30 22:49:52,760 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
111
- 2022-01-30 22:49:52,760 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 1
112
- 2022-01-30 22:49:52,860 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
113
- 2022-01-30 22:49:52,860 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 1
114
- 2022-01-30 22:49:52,860 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 2
115
- 2022-01-30 22:49:52,862 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
116
- 2022-01-30 22:49:52,862 DEBUG SenderThread:23196 [sender.py:send():234] send: stats
117
- 2022-01-30 22:49:52,863 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
118
- 2022-01-30 22:49:52,864 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
119
- 2022-01-30 22:49:52,864 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 2
120
- 2022-01-30 22:49:52,865 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
121
- 2022-01-30 22:49:52,865 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 2
122
- 2022-01-30 22:49:52,865 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 3
123
- 2022-01-30 22:49:52,866 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
124
- 2022-01-30 22:49:52,866 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 3
125
- 2022-01-30 22:49:52,869 DEBUG SenderThread:23196 [sender.py:send():234] send: summary
126
- 2022-01-30 22:49:52,870 INFO SenderThread:23196 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
127
- 2022-01-30 22:49:52,870 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
128
- 2022-01-30 22:49:52,871 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 3
129
- 2022-01-30 22:49:52,871 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 4
130
- 2022-01-30 22:49:52,871 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
131
- 2022-01-30 22:49:52,872 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 4
132
- 2022-01-30 22:49:52,872 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
133
- 2022-01-30 22:49:52,872 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 4
134
- 2022-01-30 22:49:52,966 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
135
- 2022-01-30 22:49:53,114 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 5
136
- 2022-01-30 22:49:53,114 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
137
- 2022-01-30 22:49:53,116 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
138
- 2022-01-30 22:49:53,116 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 5
139
- 2022-01-30 22:49:53,116 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
140
- 2022-01-30 22:49:53,116 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 5
141
- 2022-01-30 22:49:53,116 INFO SenderThread:23196 [dir_watcher.py:finish():283] shutting down directory watcher
142
- 2022-01-30 22:49:53,218 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
143
- 2022-01-30 22:49:53,695 INFO SenderThread:23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/config.yaml
144
- 2022-01-30 22:49:53,696 INFO SenderThread:23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
145
- 2022-01-30 22:49:53,696 INFO SenderThread:23196 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json
146
- 2022-01-30 22:49:53,697 INFO SenderThread:23196 [dir_watcher.py:finish():313] scan: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files
147
- 2022-01-30 22:49:53,697 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/conda-environment.yaml conda-environment.yaml
148
- 2022-01-30 22:49:53,698 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-metadata.json wandb-metadata.json
149
- 2022-01-30 22:49:53,698 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log output.log
150
- 2022-01-30 22:49:53,698 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/requirements.txt requirements.txt
151
- 2022-01-30 22:49:53,699 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/config.yaml config.yaml
152
- 2022-01-30 22:49:53,700 INFO SenderThread:23196 [dir_watcher.py:finish():327] scan save: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json wandb-summary.json
153
- 2022-01-30 22:49:53,700 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 6
154
- 2022-01-30 22:49:53,701 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
155
- 2022-01-30 22:49:53,713 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
156
- 2022-01-30 22:49:53,720 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 6
157
- 2022-01-30 22:49:53,724 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
158
- 2022-01-30 22:49:53,729 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 6
159
- 2022-01-30 22:49:53,729 INFO SenderThread:23196 [file_pusher.py:finish():177] shutting down file pusher
160
- 2022-01-30 22:49:53,804 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
161
- 2022-01-30 22:49:53,804 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
162
- 2022-01-30 22:49:53,909 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
163
- 2022-01-30 22:49:53,910 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
164
- 2022-01-30 22:49:54,015 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
165
- 2022-01-30 22:49:54,016 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
166
- 2022-01-30 22:49:54,119 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
167
- 2022-01-30 22:49:54,119 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
168
- 2022-01-30 22:49:54,223 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
169
- 2022-01-30 22:49:54,223 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
170
- 2022-01-30 22:49:54,253 INFO Thread-14 :23196 [upload_job.py:push():137] Uploaded file /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/config.yaml
171
- 2022-01-30 22:49:54,257 INFO Thread-13 :23196 [upload_job.py:push():137] Uploaded file /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/requirements.txt
172
- 2022-01-30 22:49:54,291 INFO Thread-15 :23196 [upload_job.py:push():137] Uploaded file /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/wandb-summary.json
173
- 2022-01-30 22:49:54,327 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
174
- 2022-01-30 22:49:54,327 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
175
- 2022-01-30 22:49:54,339 INFO Thread-12 :23196 [upload_job.py:push():137] Uploaded file /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/files/output.log
176
- 2022-01-30 22:49:54,431 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
177
- 2022-01-30 22:49:54,431 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
178
- 2022-01-30 22:49:54,534 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
179
- 2022-01-30 22:49:54,535 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
180
- 2022-01-30 22:49:54,540 INFO Thread-7 :23196 [sender.py:transition_state():387] send defer: 7
181
- 2022-01-30 22:49:54,541 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
182
- 2022-01-30 22:49:54,541 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 7
183
- 2022-01-30 22:49:54,542 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
184
- 2022-01-30 22:49:54,542 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 7
185
- 2022-01-30 22:49:54,638 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
186
- 2022-01-30 22:49:56,342 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 8
187
- 2022-01-30 22:49:56,343 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
188
- 2022-01-30 22:49:56,344 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
189
- 2022-01-30 22:49:56,345 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 8
190
- 2022-01-30 22:49:56,345 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
191
- 2022-01-30 22:49:56,345 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 8
192
- 2022-01-30 22:49:56,346 INFO SenderThread:23196 [sender.py:transition_state():387] send defer: 9
193
- 2022-01-30 22:49:56,347 DEBUG SenderThread:23196 [sender.py:send():234] send: final
194
- 2022-01-30 22:49:56,348 DEBUG SenderThread:23196 [sender.py:send():234] send: footer
195
- 2022-01-30 22:49:56,348 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: defer
196
- 2022-01-30 22:49:56,348 INFO HandlerThread:23196 [handler.py:handle_request_defer():147] handle defer: 9
197
- 2022-01-30 22:49:56,349 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: defer
198
- 2022-01-30 22:49:56,349 INFO SenderThread:23196 [sender.py:send_request_defer():383] handle sender defer: 9
199
- 2022-01-30 22:49:56,447 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: poll_exit
200
- 2022-01-30 22:49:56,447 DEBUG SenderThread:23196 [sender.py:send_request():248] send_request: poll_exit
201
- 2022-01-30 22:49:56,448 INFO SenderThread:23196 [file_pusher.py:join():182] waiting for file pusher
202
- 2022-01-30 22:49:56,761 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: get_summary
203
- 2022-01-30 22:49:56,763 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: sampled_history
204
- 2022-01-30 22:49:56,766 DEBUG HandlerThread:23196 [handler.py:handle_request():130] handle_request: shutdown
205
- 2022-01-30 22:49:56,766 INFO HandlerThread:23196 [handler.py:finish():731] shutting down handler
206
- 2022-01-30 22:49:57,348 INFO WriterThread:23196 [datastore.py:close():281] close: /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/run-2uzt3kt1.wandb
207
- 2022-01-30 22:49:57,758 INFO SenderThread:23196 [sender.py:finish():1070] shutting down sender
208
- 2022-01-30 22:49:57,759 INFO SenderThread:23196 [file_pusher.py:finish():177] shutting down file pusher
209
- 2022-01-30 22:49:57,759 INFO SenderThread:23196 [file_pusher.py:join():182] waiting for file pusher
210
- 2022-01-30 22:49:57,763 INFO MainThread:23196 [internal.py:handle_exit():77] Internal process exited
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/logs/debug.log DELETED
@@ -1,146 +0,0 @@
1
- 2022-01-30 22:47:38,315 INFO MainThread:22602 [wandb_setup.py:_flush():71] setting env: {}
2
- 2022-01-30 22:47:38,315 INFO MainThread:22602 [wandb_setup.py:_flush():71] setting login settings: {}
3
- 2022-01-30 22:47:38,315 INFO MainThread:22602 [wandb_init.py:_log_setup():371] Logging user logs to /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/logs/debug.log
4
- 2022-01-30 22:47:38,315 INFO MainThread:22602 [wandb_init.py:_log_setup():372] Logging internal logs to /workspace/xls-r-300m-fr/wandb/run-20220130_224738-2uzt3kt1/logs/debug-internal.log
5
- 2022-01-30 22:47:38,316 INFO MainThread:22602 [wandb_init.py:init():404] calling init triggers
6
- 2022-01-30 22:47:38,316 INFO MainThread:22602 [wandb_init.py:init():409] wandb.init called with sweep_config: {}
7
- config: {}
8
- 2022-01-30 22:47:38,316 INFO MainThread:22602 [wandb_init.py:init():460] starting backend
9
- 2022-01-30 22:47:38,316 INFO MainThread:22602 [backend.py:_multiprocessing_setup():99] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
10
- 2022-01-30 22:47:38,420 INFO MainThread:22602 [backend.py:ensure_launched():216] starting backend process...
11
- 2022-01-30 22:47:38,497 INFO MainThread:22602 [backend.py:ensure_launched():221] started backend process with pid: 23196
12
- 2022-01-30 22:47:38,499 INFO MainThread:22602 [wandb_init.py:init():469] backend started and connected
13
- 2022-01-30 22:47:38,508 INFO MainThread:22602 [wandb_init.py:init():533] updated telemetry
14
- 2022-01-30 22:47:38,663 INFO MainThread:22602 [wandb_init.py:init():563] communicating current version
15
- 2022-01-30 22:47:39,375 INFO MainThread:22602 [wandb_init.py:init():568] got version response
16
- 2022-01-30 22:47:39,375 INFO MainThread:22602 [wandb_init.py:init():578] communicating run to backend with 30 second timeout
17
- 2022-01-30 22:47:39,596 INFO MainThread:22602 [wandb_init.py:init():606] starting run threads in backend
18
- 2022-01-30 22:47:40,195 INFO MainThread:22602 [wandb_run.py:_console_start():1810] atexit reg
19
- 2022-01-30 22:47:40,196 INFO MainThread:22602 [wandb_run.py:_redirect():1684] redirect: SettingsConsole.REDIRECT
20
- 2022-01-30 22:47:40,197 INFO MainThread:22602 [wandb_run.py:_redirect():1689] Redirecting console.
21
- 2022-01-30 22:47:40,203 INFO MainThread:22602 [wandb_run.py:_redirect():1745] Redirects installed.
22
- 2022-01-30 22:47:40,203 INFO MainThread:22602 [wandb_init.py:init():633] run started, returning control to user process
23
- 2022-01-30 22:47:40,206 INFO MainThread:22602 [wandb_run.py:_config_callback():956] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'chunk_size_feed_forward': 0, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'architectures': ['Wav2Vec2ForPreTraining'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 40, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'facebook/wav2vec2-xls-r-300m', 'transformers_version': '4.17.0.dev0', 'feat_extract_dropout': 0.0, 'model_type': 'wav2vec2', 'num_feat_extract_layers': 7, 'hidden_size': 1024, 'feat_extract_norm': 'layer', 'feat_extract_activation': 'gelu', 'conv_dim': [512, 512, 512, 512, 512, 512, 512], 'conv_stride': [5, 2, 2, 2, 2, 2, 2], 'conv_kernel': [10, 3, 3, 3, 3, 2, 2], 'conv_bias': True, 'num_conv_pos_embeddings': 128, 'num_conv_pos_embedding_groups': 16, 'num_hidden_layers': 24, 'intermediate_size': 4096, 'hidden_act': 'gelu', 'num_attention_heads': 16, 'hidden_dropout': 0.0, 'attention_dropout': 0.0, 'activation_dropout': 0.1, 'feat_proj_dropout': 0.0, 'final_dropout': 0.0, 'layerdrop': 0.0, 'layer_norm_eps': 1e-05, 'initializer_range': 0.02, 'vocab_size': 41, 'do_stable_layer_norm': True, 'use_weighted_layer_sum': False, 'apply_spec_augment': True, 'mask_time_prob': 0.75, 'mask_time_length': 10, 'mask_time_min_masks': 2, 'mask_feature_prob': 0.25, 'mask_feature_length': 64, 'mask_feature_min_masks': 0, 'num_codevectors_per_group': 320, 'num_codevector_groups': 2, 'contrastive_logits_temperature': 0.1, 'feat_quantizer_dropout': 0.0, 'num_negatives': 100, 'codevector_dim': 768, 'proj_codevector_dim': 768, 'diversity_loss_weight': 0.1, 'ctc_loss_reduction': 'mean', 'ctc_zero_infinity': False, 'add_adapter': False, 'adapter_kernel_size': 3, 'adapter_stride': 2, 'num_adapter_layers': 3, 'output_hidden_size': 1024, 'classifier_proj_size': 256, 'tdnn_dim': [512, 512, 512, 512, 1500], 'tdnn_kernel': [5, 3, 3, 1, 1], 'tdnn_dilation': [1, 2, 3, 1, 1], 'xvector_output_dim': 512, 'output_dir': './', 'overwrite_output_dir': True, 'do_train': True, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'steps', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': 'None', 'per_gpu_eval_batch_size': 'None', 'gradient_accumulation_steps': 8, 'eval_accumulation_steps': 'None', 'learning_rate': 7.5e-05, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 0.2, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'warmup_ratio': 0.0, 'warmup_steps': 2000, 'log_level': -1, 'log_level_replica': -1, 'log_on_each_node': True, 'logging_dir': './runs/Jan30_22-46-41_job-3261699b-76eb-4c28-8419-66a66c5c9199', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 100, 'logging_nan_inf_filter': True, 'save_strategy': 'steps', 'save_steps': 500, 'save_total_limit': 3, 'save_on_each_node': False, 'no_cuda': False, 'seed': 42, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'amp', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': 'None', 'local_rank': -1, 'xpu_backend': 'None', 'tpu_num_cores': 'None', 'tpu_metrics_debug': False, 'debug': '[]', 'dataloader_drop_last': False, 'eval_steps': 500, 'dataloader_num_workers': 0, 'past_index': -1, 'run_name': './', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': 'None', 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'sharded_ddp': '[]', 'deepspeed': 'None', 'label_smoothing_factor': 0.0, 'optim': 'adamw_hf', 'adafactor': False, 'group_by_length': True, 'length_column_name': 'input_length', 'report_to': "['wandb']", 'ddp_find_unused_parameters': 'None', 'ddp_bucket_cap_mb': 'None', 'dataloader_pin_memory': True, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': True, 'resume_from_checkpoint': 'None', 'hub_model_id': 'None', 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'gradient_checkpointing': True, 'fp16_backend': 'auto', 'push_to_hub_model_id': 'None', 'push_to_hub_organization': 'None', 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', '_n_gpu': 1, 'mp_parameters': '', 'train_batch_size': 8, 'eval_batch_size': 8}
24
- 2022-01-30 22:47:40,212 INFO MainThread:22602 [wandb_watch.py:watch():43] Watching
25
- 2022-01-30 22:49:50,144 INFO MainThread:22602 [wandb_run.py:_atexit_cleanup():1780] got exitcode: 0
26
- 2022-01-30 22:49:50,148 INFO MainThread:22602 [wandb_run.py:_restore():1752] restore
27
- 2022-01-30 22:49:52,759 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
28
- wandb_count: 1
29
- }
30
- pusher_stats {
31
- uploaded_bytes: 2099
32
- total_bytes: 2099
33
- }
34
-
35
- 2022-01-30 22:49:52,864 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
36
- wandb_count: 1
37
- }
38
- pusher_stats {
39
- uploaded_bytes: 2099
40
- total_bytes: 2099
41
- }
42
-
43
- 2022-01-30 22:49:53,116 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
44
- wandb_count: 1
45
- }
46
- pusher_stats {
47
- uploaded_bytes: 2099
48
- total_bytes: 2099
49
- }
50
-
51
- 2022-01-30 22:49:53,702 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
52
- wandb_count: 4
53
- }
54
- pusher_stats {
55
- uploaded_bytes: 2099
56
- total_bytes: 21650
57
- }
58
-
59
- 2022-01-30 22:49:53,806 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
60
- wandb_count: 5
61
- }
62
- pusher_stats {
63
- uploaded_bytes: 2099
64
- total_bytes: 22128
65
- }
66
-
67
- 2022-01-30 22:49:53,914 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
68
- wandb_count: 5
69
- }
70
- pusher_stats {
71
- uploaded_bytes: 2099
72
- total_bytes: 22128
73
- }
74
-
75
- 2022-01-30 22:49:54,017 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
76
- wandb_count: 5
77
- }
78
- pusher_stats {
79
- uploaded_bytes: 22128
80
- total_bytes: 22128
81
- }
82
-
83
- 2022-01-30 22:49:54,121 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
84
- wandb_count: 5
85
- }
86
- pusher_stats {
87
- uploaded_bytes: 22128
88
- total_bytes: 22128
89
- }
90
-
91
- 2022-01-30 22:49:54,225 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
92
- wandb_count: 5
93
- }
94
- pusher_stats {
95
- uploaded_bytes: 22128
96
- total_bytes: 22128
97
- }
98
-
99
- 2022-01-30 22:49:54,329 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
100
- wandb_count: 5
101
- }
102
- pusher_stats {
103
- uploaded_bytes: 22128
104
- total_bytes: 22128
105
- }
106
-
107
- 2022-01-30 22:49:54,433 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
108
- wandb_count: 5
109
- }
110
- pusher_stats {
111
- uploaded_bytes: 22128
112
- total_bytes: 22128
113
- }
114
-
115
- 2022-01-30 22:49:54,537 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
116
- wandb_count: 5
117
- }
118
- pusher_stats {
119
- uploaded_bytes: 22128
120
- total_bytes: 22128
121
- }
122
-
123
- 2022-01-30 22:49:56,345 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: file_counts {
124
- wandb_count: 5
125
- }
126
- pusher_stats {
127
- uploaded_bytes: 22128
128
- total_bytes: 22128
129
- }
130
-
131
- 2022-01-30 22:49:56,759 INFO MainThread:22602 [wandb_run.py:_wait_for_finish():1912] got exit ret: done: true
132
- exit_result {
133
- }
134
- file_counts {
135
- wandb_count: 5
136
- }
137
- pusher_stats {
138
- uploaded_bytes: 22128
139
- total_bytes: 22128
140
- }
141
- local_info {
142
- }
143
-
144
- 2022-01-30 22:49:57,908 INFO MainThread:22602 [wandb_run.py:_append_history():2130] rendering history
145
- 2022-01-30 22:49:57,909 INFO MainThread:22602 [wandb_run.py:_append_summary():2085] rendering summary
146
- 2022-01-30 22:49:57,909 INFO MainThread:22602 [wandb_run.py:_append_files():2180] logging synced files
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_224738-2uzt3kt1/run-2uzt3kt1.wandb DELETED
Binary file (18.1 kB)
 
wandb/run-20220130_230018-ktkg6ghu/files/conda-environment.yaml DELETED
File without changes
wandb/run-20220130_230018-ktkg6ghu/files/config.yaml DELETED
@@ -1,692 +0,0 @@
1
- wandb_version: 1
2
-
3
- _n_gpu:
4
- desc: null
5
- value: 1
6
- _name_or_path:
7
- desc: null
8
- value: facebook/wav2vec2-xls-r-300m
9
- _wandb:
10
- desc: null
11
- value:
12
- cli_version: 0.12.9
13
- framework: huggingface
14
- huggingface_version: 4.17.0.dev0
15
- is_jupyter_run: false
16
- is_kaggle_kernel: false
17
- m:
18
- - 1: train/global_step
19
- 6:
20
- - 3
21
- - 1: train/train_runtime
22
- 5: 1
23
- 6:
24
- - 1
25
- - 1: train/train_samples_per_second
26
- 5: 1
27
- 6:
28
- - 1
29
- - 1: train/train_steps_per_second
30
- 5: 1
31
- 6:
32
- - 1
33
- - 1: train/total_flos
34
- 5: 1
35
- 6:
36
- - 1
37
- - 1: train/train_loss
38
- 5: 1
39
- 6:
40
- - 1
41
- - 1: train/epoch
42
- 5: 1
43
- 6:
44
- - 1
45
- - 1: eval/loss
46
- 5: 1
47
- 6:
48
- - 1
49
- - 1: eval/wer
50
- 5: 1
51
- 6:
52
- - 1
53
- - 1: eval/runtime
54
- 5: 1
55
- 6:
56
- - 1
57
- - 1: eval/samples_per_second
58
- 5: 1
59
- 6:
60
- - 1
61
- - 1: eval/steps_per_second
62
- 5: 1
63
- 6:
64
- - 1
65
- python_version: 3.8.8
66
- start_time: 1643583619
67
- t:
68
- 1:
69
- - 1
70
- - 5
71
- - 11
72
- 3:
73
- - 13
74
- 4: 3.8.8
75
- 5: 0.12.9
76
- 6: 4.17.0.dev0
77
- 8:
78
- - 5
79
- activation_dropout:
80
- desc: null
81
- value: 0.1
82
- adafactor:
83
- desc: null
84
- value: false
85
- adam_beta1:
86
- desc: null
87
- value: 0.9
88
- adam_beta2:
89
- desc: null
90
- value: 0.999
91
- adam_epsilon:
92
- desc: null
93
- value: 1.0e-08
94
- adapter_kernel_size:
95
- desc: null
96
- value: 3
97
- adapter_stride:
98
- desc: null
99
- value: 2
100
- add_adapter:
101
- desc: null
102
- value: false
103
- add_cross_attention:
104
- desc: null
105
- value: false
106
- apply_spec_augment:
107
- desc: null
108
- value: true
109
- architectures:
110
- desc: null
111
- value:
112
- - Wav2Vec2ForPreTraining
113
- attention_dropout:
114
- desc: null
115
- value: 0.0
116
- bad_words_ids:
117
- desc: null
118
- value: null
119
- bf16:
120
- desc: null
121
- value: false
122
- bf16_full_eval:
123
- desc: null
124
- value: false
125
- bos_token_id:
126
- desc: null
127
- value: 1
128
- chunk_size_feed_forward:
129
- desc: null
130
- value: 0
131
- classifier_proj_size:
132
- desc: null
133
- value: 256
134
- codevector_dim:
135
- desc: null
136
- value: 768
137
- contrastive_logits_temperature:
138
- desc: null
139
- value: 0.1
140
- conv_bias:
141
- desc: null
142
- value: true
143
- conv_dim:
144
- desc: null
145
- value:
146
- - 512
147
- - 512
148
- - 512
149
- - 512
150
- - 512
151
- - 512
152
- - 512
153
- conv_kernel:
154
- desc: null
155
- value:
156
- - 10
157
- - 3
158
- - 3
159
- - 3
160
- - 3
161
- - 2
162
- - 2
163
- conv_stride:
164
- desc: null
165
- value:
166
- - 5
167
- - 2
168
- - 2
169
- - 2
170
- - 2
171
- - 2
172
- - 2
173
- cross_attention_hidden_size:
174
- desc: null
175
- value: null
176
- ctc_loss_reduction:
177
- desc: null
178
- value: mean
179
- ctc_zero_infinity:
180
- desc: null
181
- value: false
182
- dataloader_drop_last:
183
- desc: null
184
- value: false
185
- dataloader_num_workers:
186
- desc: null
187
- value: 0
188
- dataloader_pin_memory:
189
- desc: null
190
- value: true
191
- ddp_bucket_cap_mb:
192
- desc: null
193
- value: None
194
- ddp_find_unused_parameters:
195
- desc: null
196
- value: None
197
- debug:
198
- desc: null
199
- value: '[]'
200
- decoder_start_token_id:
201
- desc: null
202
- value: null
203
- deepspeed:
204
- desc: null
205
- value: None
206
- disable_tqdm:
207
- desc: null
208
- value: false
209
- diversity_loss_weight:
210
- desc: null
211
- value: 0.1
212
- diversity_penalty:
213
- desc: null
214
- value: 0.0
215
- do_eval:
216
- desc: null
217
- value: true
218
- do_predict:
219
- desc: null
220
- value: false
221
- do_sample:
222
- desc: null
223
- value: false
224
- do_stable_layer_norm:
225
- desc: null
226
- value: true
227
- do_train:
228
- desc: null
229
- value: true
230
- early_stopping:
231
- desc: null
232
- value: false
233
- encoder_no_repeat_ngram_size:
234
- desc: null
235
- value: 0
236
- eos_token_id:
237
- desc: null
238
- value: 2
239
- eval_accumulation_steps:
240
- desc: null
241
- value: None
242
- eval_batch_size:
243
- desc: null
244
- value: 8
245
- eval_steps:
246
- desc: null
247
- value: 500
248
- evaluation_strategy:
249
- desc: null
250
- value: steps
251
- feat_extract_activation:
252
- desc: null
253
- value: gelu
254
- feat_extract_dropout:
255
- desc: null
256
- value: 0.0
257
- feat_extract_norm:
258
- desc: null
259
- value: layer
260
- feat_proj_dropout:
261
- desc: null
262
- value: 0.0
263
- feat_quantizer_dropout:
264
- desc: null
265
- value: 0.0
266
- final_dropout:
267
- desc: null
268
- value: 0.0
269
- finetuning_task:
270
- desc: null
271
- value: null
272
- forced_bos_token_id:
273
- desc: null
274
- value: null
275
- forced_eos_token_id:
276
- desc: null
277
- value: null
278
- fp16:
279
- desc: null
280
- value: true
281
- fp16_backend:
282
- desc: null
283
- value: auto
284
- fp16_full_eval:
285
- desc: null
286
- value: false
287
- fp16_opt_level:
288
- desc: null
289
- value: O1
290
- gradient_accumulation_steps:
291
- desc: null
292
- value: 8
293
- gradient_checkpointing:
294
- desc: null
295
- value: true
296
- greater_is_better:
297
- desc: null
298
- value: false
299
- group_by_length:
300
- desc: null
301
- value: true
302
- half_precision_backend:
303
- desc: null
304
- value: amp
305
- hidden_act:
306
- desc: null
307
- value: gelu
308
- hidden_dropout:
309
- desc: null
310
- value: 0.0
311
- hidden_size:
312
- desc: null
313
- value: 1024
314
- hub_model_id:
315
- desc: null
316
- value: None
317
- hub_strategy:
318
- desc: null
319
- value: every_save
320
- hub_token:
321
- desc: null
322
- value: <HUB_TOKEN>
323
- id2label:
324
- desc: null
325
- value:
326
- '0': LABEL_0
327
- '1': LABEL_1
328
- ignore_data_skip:
329
- desc: null
330
- value: false
331
- initializer_range:
332
- desc: null
333
- value: 0.02
334
- intermediate_size:
335
- desc: null
336
- value: 4096
337
- is_decoder:
338
- desc: null
339
- value: false
340
- is_encoder_decoder:
341
- desc: null
342
- value: false
343
- label2id:
344
- desc: null
345
- value:
346
- LABEL_0: 0
347
- LABEL_1: 1
348
- label_names:
349
- desc: null
350
- value: None
351
- label_smoothing_factor:
352
- desc: null
353
- value: 0.0
354
- layer_norm_eps:
355
- desc: null
356
- value: 1.0e-05
357
- layerdrop:
358
- desc: null
359
- value: 0.0
360
- learning_rate:
361
- desc: null
362
- value: 7.5e-05
363
- length_column_name:
364
- desc: null
365
- value: input_length
366
- length_penalty:
367
- desc: null
368
- value: 1.0
369
- load_best_model_at_end:
370
- desc: null
371
- value: true
372
- local_rank:
373
- desc: null
374
- value: -1
375
- log_level:
376
- desc: null
377
- value: -1
378
- log_level_replica:
379
- desc: null
380
- value: -1
381
- log_on_each_node:
382
- desc: null
383
- value: true
384
- logging_dir:
385
- desc: null
386
- value: ./runs/Jan30_22-59-56_job-3261699b-76eb-4c28-8419-66a66c5c9199
387
- logging_first_step:
388
- desc: null
389
- value: false
390
- logging_nan_inf_filter:
391
- desc: null
392
- value: true
393
- logging_steps:
394
- desc: null
395
- value: 100
396
- logging_strategy:
397
- desc: null
398
- value: steps
399
- lr_scheduler_type:
400
- desc: null
401
- value: linear
402
- mask_feature_length:
403
- desc: null
404
- value: 64
405
- mask_feature_min_masks:
406
- desc: null
407
- value: 0
408
- mask_feature_prob:
409
- desc: null
410
- value: 0.25
411
- mask_time_length:
412
- desc: null
413
- value: 10
414
- mask_time_min_masks:
415
- desc: null
416
- value: 2
417
- mask_time_prob:
418
- desc: null
419
- value: 0.75
420
- max_grad_norm:
421
- desc: null
422
- value: 1.0
423
- max_length:
424
- desc: null
425
- value: 20
426
- max_steps:
427
- desc: null
428
- value: -1
429
- metric_for_best_model:
430
- desc: null
431
- value: loss
432
- min_length:
433
- desc: null
434
- value: 0
435
- model_type:
436
- desc: null
437
- value: wav2vec2
438
- mp_parameters:
439
- desc: null
440
- value: ''
441
- no_cuda:
442
- desc: null
443
- value: false
444
- no_repeat_ngram_size:
445
- desc: null
446
- value: 0
447
- num_adapter_layers:
448
- desc: null
449
- value: 3
450
- num_attention_heads:
451
- desc: null
452
- value: 16
453
- num_beam_groups:
454
- desc: null
455
- value: 1
456
- num_beams:
457
- desc: null
458
- value: 1
459
- num_codevector_groups:
460
- desc: null
461
- value: 2
462
- num_codevectors_per_group:
463
- desc: null
464
- value: 320
465
- num_conv_pos_embedding_groups:
466
- desc: null
467
- value: 16
468
- num_conv_pos_embeddings:
469
- desc: null
470
- value: 128
471
- num_feat_extract_layers:
472
- desc: null
473
- value: 7
474
- num_hidden_layers:
475
- desc: null
476
- value: 24
477
- num_negatives:
478
- desc: null
479
- value: 100
480
- num_return_sequences:
481
- desc: null
482
- value: 1
483
- num_train_epochs:
484
- desc: null
485
- value: 0.4
486
- optim:
487
- desc: null
488
- value: adamw_hf
489
- output_attentions:
490
- desc: null
491
- value: false
492
- output_dir:
493
- desc: null
494
- value: ./
495
- output_hidden_size:
496
- desc: null
497
- value: 1024
498
- output_hidden_states:
499
- desc: null
500
- value: false
501
- output_scores:
502
- desc: null
503
- value: false
504
- overwrite_output_dir:
505
- desc: null
506
- value: true
507
- pad_token_id:
508
- desc: null
509
- value: 40
510
- past_index:
511
- desc: null
512
- value: -1
513
- per_device_eval_batch_size:
514
- desc: null
515
- value: 8
516
- per_device_train_batch_size:
517
- desc: null
518
- value: 8
519
- per_gpu_eval_batch_size:
520
- desc: null
521
- value: None
522
- per_gpu_train_batch_size:
523
- desc: null
524
- value: None
525
- prediction_loss_only:
526
- desc: null
527
- value: false
528
- prefix:
529
- desc: null
530
- value: null
531
- problem_type:
532
- desc: null
533
- value: null
534
- proj_codevector_dim:
535
- desc: null
536
- value: 768
537
- pruned_heads:
538
- desc: null
539
- value: {}
540
- push_to_hub:
541
- desc: null
542
- value: true
543
- push_to_hub_model_id:
544
- desc: null
545
- value: None
546
- push_to_hub_organization:
547
- desc: null
548
- value: None
549
- push_to_hub_token:
550
- desc: null
551
- value: <PUSH_TO_HUB_TOKEN>
552
- remove_invalid_values:
553
- desc: null
554
- value: false
555
- remove_unused_columns:
556
- desc: null
557
- value: true
558
- repetition_penalty:
559
- desc: null
560
- value: 1.0
561
- report_to:
562
- desc: null
563
- value: '[''wandb'']'
564
- resume_from_checkpoint:
565
- desc: null
566
- value: None
567
- return_dict:
568
- desc: null
569
- value: true
570
- return_dict_in_generate:
571
- desc: null
572
- value: false
573
- run_name:
574
- desc: null
575
- value: ./
576
- save_on_each_node:
577
- desc: null
578
- value: false
579
- save_steps:
580
- desc: null
581
- value: 500
582
- save_strategy:
583
- desc: null
584
- value: steps
585
- save_total_limit:
586
- desc: null
587
- value: 3
588
- seed:
589
- desc: null
590
- value: 42
591
- sep_token_id:
592
- desc: null
593
- value: null
594
- sharded_ddp:
595
- desc: null
596
- value: '[]'
597
- skip_memory_metrics:
598
- desc: null
599
- value: true
600
- task_specific_params:
601
- desc: null
602
- value: null
603
- tdnn_dilation:
604
- desc: null
605
- value:
606
- - 1
607
- - 2
608
- - 3
609
- - 1
610
- - 1
611
- tdnn_dim:
612
- desc: null
613
- value:
614
- - 512
615
- - 512
616
- - 512
617
- - 512
618
- - 1500
619
- tdnn_kernel:
620
- desc: null
621
- value:
622
- - 5
623
- - 3
624
- - 3
625
- - 1
626
- - 1
627
- temperature:
628
- desc: null
629
- value: 1.0
630
- tf32:
631
- desc: null
632
- value: None
633
- tie_encoder_decoder:
634
- desc: null
635
- value: false
636
- tie_word_embeddings:
637
- desc: null
638
- value: true
639
- tokenizer_class:
640
- desc: null
641
- value: null
642
- top_k:
643
- desc: null
644
- value: 50
645
- top_p:
646
- desc: null
647
- value: 1.0
648
- torch_dtype:
649
- desc: null
650
- value: float32
651
- torchscript:
652
- desc: null
653
- value: false
654
- tpu_metrics_debug:
655
- desc: null
656
- value: false
657
- tpu_num_cores:
658
- desc: null
659
- value: None
660
- train_batch_size:
661
- desc: null
662
- value: 8
663
- transformers_version:
664
- desc: null
665
- value: 4.17.0.dev0
666
- use_bfloat16:
667
- desc: null
668
- value: false
669
- use_legacy_prediction_loop:
670
- desc: null
671
- value: false
672
- use_weighted_layer_sum:
673
- desc: null
674
- value: false
675
- vocab_size:
676
- desc: null
677
- value: 41
678
- warmup_ratio:
679
- desc: null
680
- value: 0.0
681
- warmup_steps:
682
- desc: null
683
- value: 2000
684
- weight_decay:
685
- desc: null
686
- value: 0.0
687
- xpu_backend:
688
- desc: null
689
- value: None
690
- xvector_output_dim:
691
- desc: null
692
- value: 512
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/files/output.log DELETED
@@ -1,62 +0,0 @@
1
-
2
-
3
-
4
-
5
-
6
-
7
- 83%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████▌ | 5/6 [00:18<00:03, 3.64s/it]
8
- 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:20<00:00, 3.23s/it]
9
- Training completed. Do not forget to share your model on huggingface.co/models =)
10
- 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:20<00:00, 3.47s/it]
11
- Saving model checkpoint to ./
12
- Configuration saved in ./config.json
13
- Model weights saved in ./pytorch_model.bin
14
- Configuration saved in ./preprocessor_config.json
15
- Saving model checkpoint to ./
16
- Configuration saved in ./config.json
17
- Model weights saved in ./pytorch_model.bin
18
- Configuration saved in ./preprocessor_config.json
19
- Upload file pytorch_model.bin: 1%|▋ | 8.55M/1.18G [00:01<02:22, 8.82MB/s]
20
- Upload file training_args.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:00<?, ?B/s]
21
- 01/30/2022 23:02:55 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
22
- 1fb68dc..ab9abf3 main -> main0%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:00<?, ?B/s]
23
- Upload file pytorch_model.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████| 1.18G/1.18G [00:50<00:00, 25.0MB/s]
24
- Upload file training_args.bin: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:50<?, ?B/s]
25
- Dropping the following result as it does not have all the necessary fields:██████████████████████████████████████████████████████████████████| 2.92k/2.92k [00:50<?, ?B/s]
26
- {}
27
- 01/30/2022 23:03:03 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
28
- ab9abf3..565d898 main -> main
29
- To https://huggingface.co/Plim/xls-r-300m-fr
30
- ab9abf3..565d898 main -> main
31
- ***** train metrics *****
32
- epoch = 0.38
33
- train_loss = 13.5841
34
- train_runtime = 0:00:23.20
35
- train_samples = 1000
36
- train_samples_per_second = 17.241
37
- train_steps_per_second = 0.259
38
- 01/30/2022 23:03:06 - INFO - __main__ - *** Evaluate ***
39
- The following columns in the evaluation set don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.
40
- ***** Running Evaluation *****
41
- Num examples = 200
42
- Batch size = 8
43
-
44
-
45
-
46
-
47
- 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 25/25 [00:08<00:00, 2.79it/s]
48
- ***** eval metrics *****
49
- epoch = 0.38
50
- eval_loss = 16.9129
51
- eval_runtime = 0:00:08.63
52
- eval_samples = 200
53
- eval_samples_per_second = 23.165
54
- eval_steps_per_second = 2.896
55
- 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 25/25 [00:08<00:00, 3.03it/s]
56
- Saving model checkpoint to ./
57
- Configuration saved in ./config.json
58
- Model weights saved in ./pytorch_model.bin
59
- Configuration saved in ./preprocessor_config.json
60
- 01/30/2022 23:03:39 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-300m-fr
61
- 565d898..1fb6b5d main -> main
62
- To https://huggingface.co/Plim/xls-r-300m-fr
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/files/requirements.txt DELETED
@@ -1,180 +0,0 @@
1
- aiohttp==3.8.1
2
- aiosignal==1.2.0
3
- analytics-python==1.4.0
4
- anyio==3.5.0
5
- appdirs==1.4.4
6
- argon2-cffi-bindings==21.2.0
7
- argon2-cffi==21.3.0
8
- asgiref==3.5.0
9
- asttokens==2.0.5
10
- async-timeout==4.0.2
11
- attrs==21.4.0
12
- audioread==2.1.9
13
- backcall==0.2.0
14
- backoff==1.10.0
15
- bcrypt==3.2.0
16
- beautifulsoup4==4.9.3
17
- black==21.12b0
18
- bleach==4.1.0
19
- brotlipy==0.7.0
20
- certifi==2020.12.5
21
- cffi==1.14.3
22
- chardet==3.0.4
23
- charset-normalizer==2.0.10
24
- click==8.0.3
25
- conda-build==3.21.4
26
- conda-package-handling==1.7.2
27
- conda==4.9.2
28
- configparser==5.2.0
29
- cryptography==3.2.1
30
- cycler==0.11.0
31
- datasets==1.18.2.dev0
32
- debugpy==1.5.1
33
- decorator==4.4.2
34
- defusedxml==0.7.1
35
- dill==0.3.4
36
- dnspython==2.1.0
37
- docker-pycreds==0.4.0
38
- entrypoints==0.3
39
- executing==0.8.2
40
- fastapi==0.73.0
41
- ffmpy==0.3.0
42
- filelock==3.0.12
43
- fonttools==4.29.0
44
- frozenlist==1.3.0
45
- fsspec==2022.1.0
46
- gitdb==4.0.9
47
- gitpython==3.1.26
48
- glob2==0.7
49
- gradio==2.7.5.2
50
- h11==0.13.0
51
- huggingface-hub==0.4.0
52
- idna==2.10
53
- importlib-resources==5.4.0
54
- ipykernel==6.7.0
55
- ipython-genutils==0.2.0
56
- ipython==8.0.1
57
- ipywidgets==7.6.3
58
- jedi==0.17.0
59
- jinja2==2.11.3
60
- jiwer==2.3.0
61
- joblib==1.1.0
62
- json5==0.9.6
63
- jsonschema==4.4.0
64
- jupyter-client==7.1.2
65
- jupyter-core==4.9.1
66
- jupyterlab-pygments==0.1.2
67
- jupyterlab-server==1.2.0
68
- jupyterlab-widgets==1.0.2
69
- jupyterlab==2.2.9
70
- kiwisolver==1.3.2
71
- libarchive-c==2.9
72
- librosa==0.8.1
73
- llvmlite==0.38.0
74
- markdown2==2.4.2
75
- markupsafe==1.1.1
76
- matplotlib-inline==0.1.3
77
- matplotlib==3.5.1
78
- mistune==0.8.4
79
- mkl-fft==1.3.0
80
- mkl-random==1.1.1
81
- mkl-service==2.3.0
82
- monotonic==1.6
83
- multidict==6.0.2
84
- multiprocess==0.70.12.2
85
- mypy-extensions==0.4.3
86
- nano==0.10.0
87
- nbclient==0.5.10
88
- nbconvert==6.4.1
89
- nbformat==5.1.3
90
- nest-asyncio==1.5.4
91
- notebook==6.4.8
92
- numba==0.55.1
93
- numpy==1.19.2
94
- olefile==0.46
95
- packaging==21.3
96
- pandas==1.4.0
97
- pandocfilters==1.5.0
98
- paramiko==2.9.2
99
- parso==0.8.1
100
- pathspec==0.9.0
101
- pathtools==0.1.2
102
- pexpect==4.8.0
103
- pickleshare==0.7.5
104
- pillow==8.1.2
105
- pip==21.3.1
106
- pkginfo==1.7.0
107
- platformdirs==2.4.1
108
- pooch==1.6.0
109
- prometheus-client==0.13.0
110
- promise==2.3
111
- prompt-toolkit==3.0.8
112
- protobuf==3.19.4
113
- psutil==5.8.0
114
- ptyprocess==0.7.0
115
- pure-eval==0.2.2
116
- pyarrow==6.0.1
117
- pycosat==0.6.3
118
- pycparser==2.20
119
- pycryptodome==3.13.0
120
- pydantic==1.9.0
121
- pydub==0.25.1
122
- pygments==2.8.0
123
- pynacl==1.5.0
124
- pyopenssl==19.1.0
125
- pyparsing==3.0.7
126
- pyrsistent==0.18.1
127
- pysocks==1.7.1
128
- python-dateutil==2.8.2
129
- python-etcd==0.4.5
130
- python-levenshtein==0.12.2
131
- python-multipart==0.0.5
132
- pytz==2021.1
133
- pyyaml==5.4.1
134
- pyzmq==22.3.0
135
- regex==2022.1.18
136
- requests==2.24.0
137
- resampy==0.2.2
138
- ruamel-yaml==0.15.87
139
- sacremoses==0.0.47
140
- scikit-learn==1.0.2
141
- scipy==1.7.3
142
- send2trash==1.8.0
143
- sentry-sdk==1.5.4
144
- setuptools==50.3.1.post20201107
145
- shortuuid==1.0.8
146
- six==1.15.0
147
- smmap==5.0.0
148
- sniffio==1.2.0
149
- soundfile==0.10.3.post1
150
- soupsieve==2.2
151
- stack-data==0.1.4
152
- starlette==0.17.1
153
- subprocess32==3.5.4
154
- termcolor==1.1.0
155
- terminado==0.13.1
156
- testpath==0.5.0
157
- threadpoolctl==3.0.0
158
- tokenizers==0.11.4
159
- tomli==1.2.3
160
- torch==1.10.2
161
- torchaudio==0.10.2
162
- torchelastic==0.2.2
163
- torchtext==0.9.1
164
- torchvision==0.9.1
165
- tornado==6.1
166
- tqdm==4.62.3
167
- traitlets==5.1.1
168
- transformers==4.17.0.dev0
169
- typing-extensions==4.0.1
170
- urllib3==1.25.11
171
- uvicorn==0.17.1
172
- wandb==0.12.9
173
- wcwidth==0.2.5
174
- webencodings==0.5.1
175
- wheel==0.35.1
176
- widgetsnbextension==3.5.2
177
- xxhash==2.0.2
178
- yarl==1.7.2
179
- yaspin==2.1.0
180
- zipp==3.7.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/files/wandb-metadata.json DELETED
@@ -1,63 +0,0 @@
1
- {
2
- "os": "Linux-4.15.0-151-generic-x86_64-with-glibc2.10",
3
- "python": "3.8.8",
4
- "heartbeatAt": "2022-01-30T23:00:20.181812",
5
- "startedAt": "2022-01-30T23:00:18.929948",
6
- "docker": null,
7
- "gpu": "Tesla V100S-PCIE-32GB",
8
- "gpu_count": 1,
9
- "cpu_count": 60,
10
- "cuda": null,
11
- "args": [
12
- "--activation_dropout=0.1",
13
- "--dataset_name=mozilla-foundation/common_voice_7_0",
14
- "--dataset_config_name=fr",
15
- "--eval_steps=500",
16
- "--evaluation_strategy=steps",
17
- "--feat_proj_dropout=0.0",
18
- "--freeze_feature_encoder",
19
- "--fp16",
20
- "--gradient_accumulation_steps=8",
21
- "--gradient_checkpointing",
22
- "--group_by_length",
23
- "--layerdrop=0.0",
24
- "--learning_rate=7.5e-5",
25
- "--length_column_name=input_length",
26
- "--load_best_model_at_end",
27
- "--logging_steps=100",
28
- "--mask_feature_length=64",
29
- "--mask_feature_prob=0.25",
30
- "--mask_time_length=10",
31
- "--mask_time_prob=0.75",
32
- "--max_train_samples=1000",
33
- "--max_eval_samples=200",
34
- "--model_name_or_path=facebook/wav2vec2-xls-r-300m",
35
- "--num_train_epochs=0.4",
36
- "--output_dir=./",
37
- "--overwrite_output_dir",
38
- "--per_device_train_batch_size=8",
39
- "--per_device_eval_batch_size=8",
40
- "--preprocessing_num_workers=4",
41
- "--push_to_hub",
42
- "--report_to=wandb",
43
- "--save_steps=500",
44
- "--save_total_limit=3",
45
- "--text_column_name=sentence",
46
- "--use_auth_token",
47
- "--warmup_steps=2000",
48
- "--do_train",
49
- "--do_eval"
50
- ],
51
- "state": "running",
52
- "program": "run_speech_recognition_ctc.py",
53
- "codePath": "run_speech_recognition_ctc.py",
54
- "git": {
55
- "remote": "https://huggingface.co/Plim/xls-r-300m-fr",
56
- "commit": "1fb68dc4e7aab3ec7e3f3b252fb785ff9e047418"
57
- },
58
- "email": "lim.pascal93@gmail.com",
59
- "root": "/workspace/xls-r-300m-fr",
60
- "host": "job-3261699b-76eb-4c28-8419-66a66c5c9199",
61
- "username": "ovh",
62
- "executable": "/opt/conda/bin/python"
63
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/files/wandb-summary.json DELETED
@@ -1 +0,0 @@
1
- {"train/train_runtime": 23.2007, "train/train_samples_per_second": 17.241, "train/train_steps_per_second": 0.259, "train/total_flos": 5.41371015650304e+16, "train/train_loss": 13.584136962890625, "train/epoch": 0.38, "train/global_step": 6, "_runtime": 176, "_timestamp": 1643583795, "_step": 1, "eval/loss": 16.912879943847656, "eval/wer": 2.3789039481437833, "eval/runtime": 8.6337, "eval/samples_per_second": 23.165, "eval/steps_per_second": 2.896}
 
 
wandb/run-20220130_230018-ktkg6ghu/logs/debug-internal.log DELETED
@@ -1,110 +0,0 @@
1
- 2022-01-30 23:00:19,867 INFO MainThread:28776 [internal.py:wandb_internal():87] W&B internal server running at pid: 28776, started at: 2022-01-30 23:00:19.867526
2
- 2022-01-30 23:00:19,871 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: check_version
3
- 2022-01-30 23:00:19,872 INFO WriterThread:28776 [datastore.py:open_for_write():77] open: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/run-ktkg6ghu.wandb
4
- 2022-01-30 23:00:19,875 DEBUG SenderThread:28776 [sender.py:send():234] send: header
5
- 2022-01-30 23:00:19,876 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: check_version
6
- 2022-01-30 23:00:19,950 DEBUG SenderThread:28776 [sender.py:send():234] send: run
7
- 2022-01-30 23:00:20,171 INFO SenderThread:28776 [dir_watcher.py:__init__():169] watching files in: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files
8
- 2022-01-30 23:00:20,171 INFO SenderThread:28776 [sender.py:_start_run_threads():804] run started: ktkg6ghu with start time 1643583619
9
- 2022-01-30 23:00:20,171 DEBUG SenderThread:28776 [sender.py:send():234] send: summary
10
- 2022-01-30 23:00:20,172 INFO SenderThread:28776 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
11
- 2022-01-30 23:00:20,173 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: run_start
12
- 2022-01-30 23:00:20,181 DEBUG HandlerThread:28776 [meta.py:__init__():40] meta init
13
- 2022-01-30 23:00:20,181 DEBUG HandlerThread:28776 [meta.py:__init__():54] meta init done
14
- 2022-01-30 23:00:20,181 DEBUG HandlerThread:28776 [meta.py:probe():214] probe
15
- 2022-01-30 23:00:20,189 DEBUG HandlerThread:28776 [meta.py:_setup_git():204] setup git
16
- 2022-01-30 23:00:20,223 DEBUG HandlerThread:28776 [meta.py:_setup_git():211] setup git done
17
- 2022-01-30 23:00:20,223 DEBUG HandlerThread:28776 [meta.py:_save_pip():58] save pip
18
- 2022-01-30 23:00:20,224 DEBUG HandlerThread:28776 [meta.py:_save_pip():72] save pip done
19
- 2022-01-30 23:00:20,224 DEBUG HandlerThread:28776 [meta.py:_save_conda():79] save conda
20
- 2022-01-30 23:00:20,778 DEBUG HandlerThread:28776 [meta.py:_save_conda():89] save conda done
21
- 2022-01-30 23:00:20,778 DEBUG HandlerThread:28776 [meta.py:probe():252] probe done
22
- 2022-01-30 23:00:20,787 DEBUG SenderThread:28776 [sender.py:send():234] send: files
23
- 2022-01-30 23:00:20,787 INFO SenderThread:28776 [sender.py:_save_file():939] saving file wandb-metadata.json with policy now
24
- 2022-01-30 23:00:20,797 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
25
- 2022-01-30 23:00:20,798 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
26
- 2022-01-30 23:00:20,961 DEBUG SenderThread:28776 [sender.py:send():234] send: config
27
- 2022-01-30 23:00:20,963 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
28
- 2022-01-30 23:00:20,963 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
29
- 2022-01-30 23:00:20,963 WARNING SenderThread:28776 [sender.py:send_metric():897] Seen metric with glob (shouldnt happen)
30
- 2022-01-30 23:00:21,176 INFO Thread-8 :28776 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/wandb-summary.json
31
- 2022-01-30 23:00:21,176 INFO Thread-8 :28776 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/conda-environment.yaml
32
- 2022-01-30 23:00:21,177 INFO Thread-8 :28776 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
33
- 2022-01-30 23:00:21,177 INFO Thread-8 :28776 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/requirements.txt
34
- 2022-01-30 23:00:21,177 INFO Thread-8 :28776 [dir_watcher.py:_on_file_created():217] file/dir created: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/wandb-metadata.json
35
- 2022-01-30 23:00:21,331 INFO Thread-11 :28776 [upload_job.py:push():137] Uploaded file /tmp/tmpoltwert_wandb/8zi886rt-wandb-metadata.json
36
- 2022-01-30 23:00:23,173 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
37
- 2022-01-30 23:00:27,175 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
38
- 2022-01-30 23:00:31,177 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
39
- 2022-01-30 23:00:33,178 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
40
- 2022-01-30 23:00:36,074 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
41
- 2022-01-30 23:00:36,075 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
42
- 2022-01-30 23:00:37,180 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
43
- 2022-01-30 23:00:41,182 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
44
- 2022-01-30 23:00:41,623 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
45
- 2022-01-30 23:00:41,623 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
46
- 2022-01-30 23:00:41,623 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
47
- 2022-01-30 23:00:41,623 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
48
- 2022-01-30 23:00:41,623 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
49
- 2022-01-30 23:00:41,624 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
50
- 2022-01-30 23:00:41,624 DEBUG SenderThread:28776 [sender.py:send():234] send: history
51
- 2022-01-30 23:00:41,624 DEBUG SenderThread:28776 [sender.py:send():234] send: summary
52
- 2022-01-30 23:00:41,624 INFO SenderThread:28776 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
53
- 2022-01-30 23:00:42,183 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/wandb-summary.json
54
- 2022-01-30 23:00:43,184 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
55
- 2022-01-30 23:00:45,185 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
56
- 2022-01-30 23:00:47,187 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
57
- 2022-01-30 23:00:48,450 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
58
- 2022-01-30 23:00:51,190 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/config.yaml
59
- 2022-01-30 23:00:51,326 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
60
- 2022-01-30 23:00:51,327 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
61
- 2022-01-30 23:01:06,483 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
62
- 2022-01-30 23:01:06,484 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
63
- 2022-01-30 23:01:18,718 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
64
- 2022-01-30 23:01:21,647 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
65
- 2022-01-30 23:01:21,648 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
66
- 2022-01-30 23:01:36,814 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
67
- 2022-01-30 23:01:36,814 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
68
- 2022-01-30 23:01:48,941 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
69
- 2022-01-30 23:01:51,982 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
70
- 2022-01-30 23:01:51,982 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
71
- 2022-01-30 23:02:07,146 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
72
- 2022-01-30 23:02:07,146 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
73
- 2022-01-30 23:02:07,242 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
74
- 2022-01-30 23:02:19,172 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
75
- 2022-01-30 23:02:22,319 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
76
- 2022-01-30 23:02:22,319 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
77
- 2022-01-30 23:02:37,476 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
78
- 2022-01-30 23:02:37,476 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
79
- 2022-01-30 23:02:49,382 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
80
- 2022-01-30 23:02:52,639 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
81
- 2022-01-30 23:02:52,640 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
82
- 2022-01-30 23:02:57,275 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
83
- 2022-01-30 23:02:59,276 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
84
- 2022-01-30 23:03:05,280 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
85
- 2022-01-30 23:03:07,282 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
86
- 2022-01-30 23:03:07,846 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
87
- 2022-01-30 23:03:07,847 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
88
- 2022-01-30 23:03:10,283 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
89
- 2022-01-30 23:03:12,285 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
90
- 2022-01-30 23:03:14,286 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
91
- 2022-01-30 23:03:15,041 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
92
- 2022-01-30 23:03:15,042 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
93
- 2022-01-30 23:03:15,042 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
94
- 2022-01-30 23:03:15,042 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
95
- 2022-01-30 23:03:15,042 DEBUG SenderThread:28776 [sender.py:send():234] send: metric
96
- 2022-01-30 23:03:15,043 DEBUG SenderThread:28776 [sender.py:send():234] send: history
97
- 2022-01-30 23:03:15,043 DEBUG SenderThread:28776 [sender.py:send():234] send: summary
98
- 2022-01-30 23:03:15,044 INFO SenderThread:28776 [sender.py:_save_file():939] saving file wandb-summary.json with policy end
99
- 2022-01-30 23:03:15,287 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/wandb-summary.json
100
- 2022-01-30 23:03:16,288 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
101
- 2022-01-30 23:03:17,289 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
102
- 2022-01-30 23:03:18,289 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
103
- 2022-01-30 23:03:19,619 DEBUG SenderThread:28776 [sender.py:send():234] send: stats
104
- 2022-01-30 23:03:23,106 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
105
- 2022-01-30 23:03:23,107 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
106
- 2022-01-30 23:03:23,294 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/config.yaml
107
- 2022-01-30 23:03:38,276 DEBUG HandlerThread:28776 [handler.py:handle_request():130] handle_request: stop_status
108
- 2022-01-30 23:03:38,277 DEBUG SenderThread:28776 [sender.py:send_request():248] send_request: stop_status
109
- 2022-01-30 23:03:41,306 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
110
- 2022-01-30 23:03:42,306 INFO Thread-8 :28776 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/files/output.log
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/logs/debug.log DELETED
@@ -1,24 +0,0 @@
1
- 2022-01-30 23:00:18,934 INFO MainThread:28546 [wandb_setup.py:_flush():71] setting env: {'project': 'xls-r-300-fr'}
2
- 2022-01-30 23:00:18,934 INFO MainThread:28546 [wandb_setup.py:_flush():71] setting login settings: {}
3
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [wandb_init.py:_log_setup():371] Logging user logs to /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/logs/debug.log
4
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [wandb_init.py:_log_setup():372] Logging internal logs to /workspace/xls-r-300m-fr/wandb/run-20220130_230018-ktkg6ghu/logs/debug-internal.log
5
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [wandb_init.py:init():404] calling init triggers
6
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [wandb_init.py:init():409] wandb.init called with sweep_config: {}
7
- config: {}
8
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [wandb_init.py:init():460] starting backend
9
- 2022-01-30 23:00:18,935 INFO MainThread:28546 [backend.py:_multiprocessing_setup():99] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
10
- 2022-01-30 23:00:19,011 INFO MainThread:28546 [backend.py:ensure_launched():216] starting backend process...
11
- 2022-01-30 23:00:19,071 INFO MainThread:28546 [backend.py:ensure_launched():221] started backend process with pid: 28776
12
- 2022-01-30 23:00:19,073 INFO MainThread:28546 [wandb_init.py:init():469] backend started and connected
13
- 2022-01-30 23:00:19,081 INFO MainThread:28546 [wandb_init.py:init():533] updated telemetry
14
- 2022-01-30 23:00:19,256 INFO MainThread:28546 [wandb_init.py:init():563] communicating current version
15
- 2022-01-30 23:00:19,948 INFO MainThread:28546 [wandb_init.py:init():568] got version response
16
- 2022-01-30 23:00:19,948 INFO MainThread:28546 [wandb_init.py:init():578] communicating run to backend with 30 second timeout
17
- 2022-01-30 23:00:20,172 INFO MainThread:28546 [wandb_init.py:init():606] starting run threads in backend
18
- 2022-01-30 23:00:20,795 INFO MainThread:28546 [wandb_run.py:_console_start():1810] atexit reg
19
- 2022-01-30 23:00:20,796 INFO MainThread:28546 [wandb_run.py:_redirect():1684] redirect: SettingsConsole.REDIRECT
20
- 2022-01-30 23:00:20,797 INFO MainThread:28546 [wandb_run.py:_redirect():1689] Redirecting console.
21
- 2022-01-30 23:00:20,803 INFO MainThread:28546 [wandb_run.py:_redirect():1745] Redirects installed.
22
- 2022-01-30 23:00:20,803 INFO MainThread:28546 [wandb_init.py:init():633] run started, returning control to user process
23
- 2022-01-30 23:00:20,805 INFO MainThread:28546 [wandb_run.py:_config_callback():956] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'chunk_size_feed_forward': 0, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'architectures': ['Wav2Vec2ForPreTraining'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': 1, 'pad_token_id': 40, 'eos_token_id': 2, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'facebook/wav2vec2-xls-r-300m', 'transformers_version': '4.17.0.dev0', 'feat_extract_dropout': 0.0, 'model_type': 'wav2vec2', 'num_feat_extract_layers': 7, 'hidden_size': 1024, 'feat_extract_norm': 'layer', 'feat_extract_activation': 'gelu', 'conv_dim': [512, 512, 512, 512, 512, 512, 512], 'conv_stride': [5, 2, 2, 2, 2, 2, 2], 'conv_kernel': [10, 3, 3, 3, 3, 2, 2], 'conv_bias': True, 'num_conv_pos_embeddings': 128, 'num_conv_pos_embedding_groups': 16, 'num_hidden_layers': 24, 'intermediate_size': 4096, 'hidden_act': 'gelu', 'num_attention_heads': 16, 'hidden_dropout': 0.0, 'attention_dropout': 0.0, 'activation_dropout': 0.1, 'feat_proj_dropout': 0.0, 'final_dropout': 0.0, 'layerdrop': 0.0, 'layer_norm_eps': 1e-05, 'initializer_range': 0.02, 'vocab_size': 41, 'do_stable_layer_norm': True, 'use_weighted_layer_sum': False, 'apply_spec_augment': True, 'mask_time_prob': 0.75, 'mask_time_length': 10, 'mask_time_min_masks': 2, 'mask_feature_prob': 0.25, 'mask_feature_length': 64, 'mask_feature_min_masks': 0, 'num_codevectors_per_group': 320, 'num_codevector_groups': 2, 'contrastive_logits_temperature': 0.1, 'feat_quantizer_dropout': 0.0, 'num_negatives': 100, 'codevector_dim': 768, 'proj_codevector_dim': 768, 'diversity_loss_weight': 0.1, 'ctc_loss_reduction': 'mean', 'ctc_zero_infinity': False, 'add_adapter': False, 'adapter_kernel_size': 3, 'adapter_stride': 2, 'num_adapter_layers': 3, 'output_hidden_size': 1024, 'classifier_proj_size': 256, 'tdnn_dim': [512, 512, 512, 512, 1500], 'tdnn_kernel': [5, 3, 3, 1, 1], 'tdnn_dilation': [1, 2, 3, 1, 1], 'xvector_output_dim': 512, 'output_dir': './', 'overwrite_output_dir': True, 'do_train': True, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'steps', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': 'None', 'per_gpu_eval_batch_size': 'None', 'gradient_accumulation_steps': 8, 'eval_accumulation_steps': 'None', 'learning_rate': 7.5e-05, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 0.4, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'warmup_ratio': 0.0, 'warmup_steps': 2000, 'log_level': -1, 'log_level_replica': -1, 'log_on_each_node': True, 'logging_dir': './runs/Jan30_22-59-56_job-3261699b-76eb-4c28-8419-66a66c5c9199', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 100, 'logging_nan_inf_filter': True, 'save_strategy': 'steps', 'save_steps': 500, 'save_total_limit': 3, 'save_on_each_node': False, 'no_cuda': False, 'seed': 42, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'amp', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': 'None', 'local_rank': -1, 'xpu_backend': 'None', 'tpu_num_cores': 'None', 'tpu_metrics_debug': False, 'debug': '[]', 'dataloader_drop_last': False, 'eval_steps': 500, 'dataloader_num_workers': 0, 'past_index': -1, 'run_name': './', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': 'None', 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'sharded_ddp': '[]', 'deepspeed': 'None', 'label_smoothing_factor': 0.0, 'optim': 'adamw_hf', 'adafactor': False, 'group_by_length': True, 'length_column_name': 'input_length', 'report_to': "['wandb']", 'ddp_find_unused_parameters': 'None', 'ddp_bucket_cap_mb': 'None', 'dataloader_pin_memory': True, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': True, 'resume_from_checkpoint': 'None', 'hub_model_id': 'None', 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'gradient_checkpointing': True, 'fp16_backend': 'auto', 'push_to_hub_model_id': 'None', 'push_to_hub_organization': 'None', 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', '_n_gpu': 1, 'mp_parameters': '', 'train_batch_size': 8, 'eval_batch_size': 8}
24
- 2022-01-30 23:00:20,811 INFO MainThread:28546 [wandb_watch.py:watch():43] Watching
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
wandb/run-20220130_230018-ktkg6ghu/run-ktkg6ghu.wandb DELETED
Binary file (16.2 kB)