siuze commited on
Commit
4f0b12e
1 Parent(s): 1ae6bdf

Update model

Browse files
README.md ADDED
@@ -0,0 +1,329 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - automatic-speech-recognition
6
+ language: foc
7
+ datasets:
8
+ - mini_an4
9
+ license: cc-by-4.0
10
+ ---
11
+
12
+ ## ESPnet2 ASR model
13
+
14
+ ### `siuze/FOC-yngping`
15
+
16
+ This model was trained by siuze using mini_an4 recipe in [espnet](https://github.com/espnet/espnet/).
17
+
18
+ ### Demo: How to use in ESPnet2
19
+
20
+ Follow the [ESPnet installation instructions](https://espnet.github.io/espnet/installation.html)
21
+ if you haven't done that already.
22
+
23
+ ```bash
24
+ cd espnet
25
+ git checkout 52160d6ed337e9dec74dd59695fec1548042e0b2
26
+ pip install -e .
27
+ cd egs2/mini_an4/foc
28
+ ./run.sh --skip_data_prep false --skip_train true --download_model siuze/FOC-yngping
29
+ ```
30
+
31
+ <!-- Generated by scripts/utils/show_asr_result.sh -->
32
+ # RESULTS
33
+ ## Environments
34
+ - date: `Wed Apr 19 20:30:35 CST 2023`
35
+ - python version: `3.8.16 | packaged by conda-forge | (default, Feb 1 2023, 16:01:55) [GCC 11.3.0]`
36
+ - espnet version: `espnet 202301`
37
+ - pytorch version: `pytorch 1.10.0`
38
+ - Git hash: `52160d6ed337e9dec74dd59695fec1548042e0b2`
39
+ - Commit date: `Thu Mar 16 21:37:39 2023 +0000`
40
+
41
+ ## exp/asr_train_asr_transformer_raw_foc_char
42
+ ### WER
43
+
44
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
45
+ |---|---|---|---|---|---|---|---|---|
46
+ |inference_asr_model_valid.acc.ave/test|500|1083|74.5|24.9|0.6|0.4|25.9|41.2|
47
+
48
+ ### CER
49
+
50
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
51
+ |---|---|---|---|---|---|---|---|---|
52
+ |inference_asr_model_valid.acc.ave/test|500|6377|93.8|4.4|1.8|1.9|8.1|41.2|
53
+
54
+ ### TER
55
+
56
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
57
+ |---|---|---|---|---|---|---|---|---|
58
+
59
+ ## ASR config
60
+
61
+ <details><summary>expand</summary>
62
+
63
+ ```
64
+ config: conf/train_asr_transformer.yaml
65
+ print_config: false
66
+ log_level: INFO
67
+ dry_run: false
68
+ iterator_type: sequence
69
+ output_dir: exp/asr_train_asr_transformer_raw_foc_char
70
+ ngpu: 1
71
+ seed: 0
72
+ num_workers: 1
73
+ num_att_plot: 3
74
+ dist_backend: nccl
75
+ dist_init_method: env://
76
+ dist_world_size: null
77
+ dist_rank: null
78
+ local_rank: 0
79
+ dist_master_addr: null
80
+ dist_master_port: null
81
+ dist_launcher: null
82
+ multiprocessing_distributed: false
83
+ unused_parameters: false
84
+ sharded_ddp: false
85
+ cudnn_enabled: true
86
+ cudnn_benchmark: false
87
+ cudnn_deterministic: true
88
+ collect_stats: false
89
+ write_collected_feats: false
90
+ max_epoch: 60
91
+ patience: null
92
+ val_scheduler_criterion:
93
+ - valid
94
+ - loss
95
+ early_stopping_criterion:
96
+ - valid
97
+ - loss
98
+ - min
99
+ best_model_criterion:
100
+ - - valid
101
+ - acc
102
+ - max
103
+ keep_nbest_models: 10
104
+ nbest_averaging_interval: 0
105
+ grad_clip: 5.0
106
+ grad_clip_type: 2.0
107
+ grad_noise: false
108
+ accum_grad: 8
109
+ no_forward_run: false
110
+ resume: true
111
+ train_dtype: float32
112
+ use_amp: false
113
+ log_interval: null
114
+ use_matplotlib: true
115
+ use_tensorboard: true
116
+ create_graph_in_tensorboard: false
117
+ use_wandb: false
118
+ wandb_project: null
119
+ wandb_id: null
120
+ wandb_entity: null
121
+ wandb_name: null
122
+ wandb_model_log_interval: -1
123
+ detect_anomaly: false
124
+ pretrain_path: null
125
+ init_param:
126
+ - /home/pro-c/yewei/espnet/egs2/mini_an4/asr1/exp/asr_train_asr_transformer_raw_can_char/valid.acc.best.pth
127
+ ignore_init_mismatch: true
128
+ freeze_param: []
129
+ num_iters_per_epoch: null
130
+ batch_size: 16
131
+ valid_batch_size: null
132
+ batch_bins: 1000000
133
+ valid_batch_bins: null
134
+ train_shape_file:
135
+ - exp/asr_stats_raw_foc_char/train/speech_shape
136
+ - exp/asr_stats_raw_foc_char/train/text_shape.char
137
+ valid_shape_file:
138
+ - exp/asr_stats_raw_foc_char/valid/speech_shape
139
+ - exp/asr_stats_raw_foc_char/valid/text_shape.char
140
+ batch_type: folded
141
+ valid_batch_type: null
142
+ fold_length:
143
+ - 80000
144
+ - 150
145
+ sort_in_batch: descending
146
+ sort_batch: descending
147
+ multiple_iterator: false
148
+ chunk_length: 500
149
+ chunk_shift_ratio: 0.5
150
+ num_cache_chunks: 1024
151
+ chunk_excluded_key_prefixes: []
152
+ train_data_path_and_name_and_type:
153
+ - - dump/raw/train/wav.scp
154
+ - speech
155
+ - sound
156
+ - - dump/raw/train/text
157
+ - text
158
+ - text
159
+ valid_data_path_and_name_and_type:
160
+ - - dump/raw/dev/wav.scp
161
+ - speech
162
+ - sound
163
+ - - dump/raw/dev/text
164
+ - text
165
+ - text
166
+ allow_variable_data_keys: false
167
+ max_cache_size: 0.0
168
+ max_cache_fd: 32
169
+ valid_max_cache_size: null
170
+ exclude_weight_decay: false
171
+ exclude_weight_decay_conf: {}
172
+ optim: adam
173
+ optim_conf:
174
+ lr: 0.005
175
+ scheduler: warmuplr
176
+ scheduler_conf:
177
+ warmup_steps: 30000
178
+ token_list:
179
+ - <blank>
180
+ - <unk>
181
+ - <space>
182
+ - '3'
183
+ - '2'
184
+ - '5'
185
+ - g
186
+ - o
187
+ - a
188
+ - n
189
+ - i
190
+ - '4'
191
+ - u
192
+ - e
193
+ - k
194
+ - '1'
195
+ - j
196
+ - y
197
+ - z
198
+ - s
199
+ - h
200
+ - d
201
+ - m
202
+ - l
203
+ - c
204
+ - b
205
+ - f
206
+ - t
207
+ - w
208
+ - p
209
+ - r
210
+ - x
211
+ - v
212
+ - q
213
+ - <sos/eos>
214
+ init: xavier_uniform
215
+ input_size: null
216
+ ctc_conf:
217
+ dropout_rate: 0.0
218
+ ctc_type: builtin
219
+ reduce: true
220
+ ignore_nan_grad: null
221
+ zero_infinity: true
222
+ joint_net_conf: null
223
+ use_preprocessor: true
224
+ token_type: char
225
+ bpemodel: null
226
+ non_linguistic_symbols: null
227
+ cleaner: null
228
+ g2p: null
229
+ speech_volume_normalize: null
230
+ rir_scp: null
231
+ rir_apply_prob: 1.0
232
+ noise_scp: null
233
+ noise_apply_prob: 1.0
234
+ noise_db_range: '13_15'
235
+ short_noise_thres: 0.5
236
+ aux_ctc_tasks: []
237
+ frontend: default
238
+ frontend_conf:
239
+ fs: 16k
240
+ specaug: specaug
241
+ specaug_conf:
242
+ apply_time_warp: true
243
+ time_warp_window: 5
244
+ time_warp_mode: bicubic
245
+ apply_freq_mask: true
246
+ freq_mask_width_range:
247
+ - 0
248
+ - 27
249
+ num_freq_mask: 2
250
+ apply_time_mask: true
251
+ time_mask_width_ratio_range:
252
+ - 0.0
253
+ - 0.05
254
+ num_time_mask: 10
255
+ normalize: global_mvn
256
+ normalize_conf:
257
+ stats_file: exp/asr_stats_raw_foc_char/train/feats_stats.npz
258
+ model: espnet
259
+ model_conf:
260
+ ctc_weight: 0.3
261
+ lsm_weight: 0.1
262
+ length_normalized_loss: false
263
+ preencoder: null
264
+ preencoder_conf: {}
265
+ encoder: transformer
266
+ encoder_conf:
267
+ output_size: 256
268
+ attention_heads: 4
269
+ linear_units: 2048
270
+ num_blocks: 12
271
+ dropout_rate: 0.1
272
+ positional_dropout_rate: 0.1
273
+ attention_dropout_rate: 0.0
274
+ input_layer: conv2d
275
+ normalize_before: true
276
+ postencoder: null
277
+ postencoder_conf: {}
278
+ decoder: transformer
279
+ decoder_conf:
280
+ attention_heads: 4
281
+ linear_units: 2048
282
+ num_blocks: 6
283
+ dropout_rate: 0.1
284
+ positional_dropout_rate: 0.1
285
+ self_attention_dropout_rate: 0.0
286
+ src_attention_dropout_rate: 0.0
287
+ preprocessor: default
288
+ preprocessor_conf: {}
289
+ required:
290
+ - output_dir
291
+ - token_list
292
+ version: '202301'
293
+ distributed: false
294
+ ```
295
+
296
+ </details>
297
+
298
+
299
+
300
+ ### Citing ESPnet
301
+
302
+ ```BibTex
303
+ @inproceedings{watanabe2018espnet,
304
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
305
+ title={{ESPnet}: End-to-End Speech Processing Toolkit},
306
+ year={2018},
307
+ booktitle={Proceedings of Interspeech},
308
+ pages={2207--2211},
309
+ doi={10.21437/Interspeech.2018-1456},
310
+ url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
311
+ }
312
+
313
+
314
+
315
+
316
+ ```
317
+
318
+ or arXiv:
319
+
320
+ ```bibtex
321
+ @misc{watanabe2018espnet,
322
+ title={ESPnet: End-to-End Speech Processing Toolkit},
323
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
324
+ year={2018},
325
+ eprint={1804.00015},
326
+ archivePrefix={arXiv},
327
+ primaryClass={cs.CL}
328
+ }
329
+ ```
exp/asr_stats_raw_foc_char/train/feats_stats.npz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7ff098f3723cfd380039890a9e1d5f52c26856fa88e1f77c7dffe2eba1a3c7df
3
+ size 1402
exp/asr_train_asr_transformer_raw_foc_char/RESULTS.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <!-- Generated by scripts/utils/show_asr_result.sh -->
2
+ # RESULTS
3
+ ## Environments
4
+ - date: `Wed Apr 19 20:30:35 CST 2023`
5
+ - python version: `3.8.16 | packaged by conda-forge | (default, Feb 1 2023, 16:01:55) [GCC 11.3.0]`
6
+ - espnet version: `espnet 202301`
7
+ - pytorch version: `pytorch 1.10.0`
8
+ - Git hash: `52160d6ed337e9dec74dd59695fec1548042e0b2`
9
+ - Commit date: `Thu Mar 16 21:37:39 2023 +0000`
10
+
11
+ ## exp/asr_train_asr_transformer_raw_foc_char
12
+ ### WER
13
+
14
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
15
+ |---|---|---|---|---|---|---|---|---|
16
+ |inference_asr_model_valid.acc.ave/test|500|1083|74.5|24.9|0.6|0.4|25.9|41.2|
17
+
18
+ ### CER
19
+
20
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
21
+ |---|---|---|---|---|---|---|---|---|
22
+ |inference_asr_model_valid.acc.ave/test|500|6377|93.8|4.4|1.8|1.9|8.1|41.2|
23
+
24
+ ### TER
25
+
26
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
27
+ |---|---|---|---|---|---|---|---|---|
exp/asr_train_asr_transformer_raw_foc_char/config.yaml ADDED
@@ -0,0 +1,230 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/train_asr_transformer.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ dry_run: false
5
+ iterator_type: sequence
6
+ output_dir: exp/asr_train_asr_transformer_raw_foc_char
7
+ ngpu: 1
8
+ seed: 0
9
+ num_workers: 1
10
+ num_att_plot: 3
11
+ dist_backend: nccl
12
+ dist_init_method: env://
13
+ dist_world_size: null
14
+ dist_rank: null
15
+ local_rank: 0
16
+ dist_master_addr: null
17
+ dist_master_port: null
18
+ dist_launcher: null
19
+ multiprocessing_distributed: false
20
+ unused_parameters: false
21
+ sharded_ddp: false
22
+ cudnn_enabled: true
23
+ cudnn_benchmark: false
24
+ cudnn_deterministic: true
25
+ collect_stats: false
26
+ write_collected_feats: false
27
+ max_epoch: 60
28
+ patience: null
29
+ val_scheduler_criterion:
30
+ - valid
31
+ - loss
32
+ early_stopping_criterion:
33
+ - valid
34
+ - loss
35
+ - min
36
+ best_model_criterion:
37
+ - - valid
38
+ - acc
39
+ - max
40
+ keep_nbest_models: 10
41
+ nbest_averaging_interval: 0
42
+ grad_clip: 5.0
43
+ grad_clip_type: 2.0
44
+ grad_noise: false
45
+ accum_grad: 8
46
+ no_forward_run: false
47
+ resume: true
48
+ train_dtype: float32
49
+ use_amp: false
50
+ log_interval: null
51
+ use_matplotlib: true
52
+ use_tensorboard: true
53
+ create_graph_in_tensorboard: false
54
+ use_wandb: false
55
+ wandb_project: null
56
+ wandb_id: null
57
+ wandb_entity: null
58
+ wandb_name: null
59
+ wandb_model_log_interval: -1
60
+ detect_anomaly: false
61
+ pretrain_path: null
62
+ init_param:
63
+ - /home/pro-c/yewei/espnet/egs2/mini_an4/asr1/exp/asr_train_asr_transformer_raw_can_char/valid.acc.best.pth
64
+ ignore_init_mismatch: true
65
+ freeze_param: []
66
+ num_iters_per_epoch: null
67
+ batch_size: 16
68
+ valid_batch_size: null
69
+ batch_bins: 1000000
70
+ valid_batch_bins: null
71
+ train_shape_file:
72
+ - exp/asr_stats_raw_foc_char/train/speech_shape
73
+ - exp/asr_stats_raw_foc_char/train/text_shape.char
74
+ valid_shape_file:
75
+ - exp/asr_stats_raw_foc_char/valid/speech_shape
76
+ - exp/asr_stats_raw_foc_char/valid/text_shape.char
77
+ batch_type: folded
78
+ valid_batch_type: null
79
+ fold_length:
80
+ - 80000
81
+ - 150
82
+ sort_in_batch: descending
83
+ sort_batch: descending
84
+ multiple_iterator: false
85
+ chunk_length: 500
86
+ chunk_shift_ratio: 0.5
87
+ num_cache_chunks: 1024
88
+ chunk_excluded_key_prefixes: []
89
+ train_data_path_and_name_and_type:
90
+ - - dump/raw/train/wav.scp
91
+ - speech
92
+ - sound
93
+ - - dump/raw/train/text
94
+ - text
95
+ - text
96
+ valid_data_path_and_name_and_type:
97
+ - - dump/raw/dev/wav.scp
98
+ - speech
99
+ - sound
100
+ - - dump/raw/dev/text
101
+ - text
102
+ - text
103
+ allow_variable_data_keys: false
104
+ max_cache_size: 0.0
105
+ max_cache_fd: 32
106
+ valid_max_cache_size: null
107
+ exclude_weight_decay: false
108
+ exclude_weight_decay_conf: {}
109
+ optim: adam
110
+ optim_conf:
111
+ lr: 0.005
112
+ scheduler: warmuplr
113
+ scheduler_conf:
114
+ warmup_steps: 30000
115
+ token_list:
116
+ - <blank>
117
+ - <unk>
118
+ - <space>
119
+ - '3'
120
+ - '2'
121
+ - '5'
122
+ - g
123
+ - o
124
+ - a
125
+ - n
126
+ - i
127
+ - '4'
128
+ - u
129
+ - e
130
+ - k
131
+ - '1'
132
+ - j
133
+ - y
134
+ - z
135
+ - s
136
+ - h
137
+ - d
138
+ - m
139
+ - l
140
+ - c
141
+ - b
142
+ - f
143
+ - t
144
+ - w
145
+ - p
146
+ - r
147
+ - x
148
+ - v
149
+ - q
150
+ - <sos/eos>
151
+ init: xavier_uniform
152
+ input_size: null
153
+ ctc_conf:
154
+ dropout_rate: 0.0
155
+ ctc_type: builtin
156
+ reduce: true
157
+ ignore_nan_grad: null
158
+ zero_infinity: true
159
+ joint_net_conf: null
160
+ use_preprocessor: true
161
+ token_type: char
162
+ bpemodel: null
163
+ non_linguistic_symbols: null
164
+ cleaner: null
165
+ g2p: null
166
+ speech_volume_normalize: null
167
+ rir_scp: null
168
+ rir_apply_prob: 1.0
169
+ noise_scp: null
170
+ noise_apply_prob: 1.0
171
+ noise_db_range: '13_15'
172
+ short_noise_thres: 0.5
173
+ aux_ctc_tasks: []
174
+ frontend: default
175
+ frontend_conf:
176
+ fs: 16k
177
+ specaug: specaug
178
+ specaug_conf:
179
+ apply_time_warp: true
180
+ time_warp_window: 5
181
+ time_warp_mode: bicubic
182
+ apply_freq_mask: true
183
+ freq_mask_width_range:
184
+ - 0
185
+ - 27
186
+ num_freq_mask: 2
187
+ apply_time_mask: true
188
+ time_mask_width_ratio_range:
189
+ - 0.0
190
+ - 0.05
191
+ num_time_mask: 10
192
+ normalize: global_mvn
193
+ normalize_conf:
194
+ stats_file: exp/asr_stats_raw_foc_char/train/feats_stats.npz
195
+ model: espnet
196
+ model_conf:
197
+ ctc_weight: 0.3
198
+ lsm_weight: 0.1
199
+ length_normalized_loss: false
200
+ preencoder: null
201
+ preencoder_conf: {}
202
+ encoder: transformer
203
+ encoder_conf:
204
+ output_size: 256
205
+ attention_heads: 4
206
+ linear_units: 2048
207
+ num_blocks: 12
208
+ dropout_rate: 0.1
209
+ positional_dropout_rate: 0.1
210
+ attention_dropout_rate: 0.0
211
+ input_layer: conv2d
212
+ normalize_before: true
213
+ postencoder: null
214
+ postencoder_conf: {}
215
+ decoder: transformer
216
+ decoder_conf:
217
+ attention_heads: 4
218
+ linear_units: 2048
219
+ num_blocks: 6
220
+ dropout_rate: 0.1
221
+ positional_dropout_rate: 0.1
222
+ self_attention_dropout_rate: 0.0
223
+ src_attention_dropout_rate: 0.0
224
+ preprocessor: default
225
+ preprocessor_conf: {}
226
+ required:
227
+ - output_dir
228
+ - token_list
229
+ version: '202301'
230
+ distributed: false
exp/asr_train_asr_transformer_raw_foc_char/images/acc.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/backward_time.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/cer.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/cer_ctc.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/forward_time.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/gpu_max_cached_mem_GB.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/iter_time.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/loss.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/loss_att.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/loss_ctc.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/optim0_lr0.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/optim_step_time.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/train_time.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/images/wer.png ADDED
exp/asr_train_asr_transformer_raw_foc_char/valid.acc.ave_10best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:592b170fe66e194555eeefeee626f81a24611a0b58abf87fbcddb06ae68c467a
3
+ size 108694309
meta.yaml ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ espnet: '202301'
2
+ files:
3
+ asr_model_file: exp/asr_train_asr_transformer_raw_foc_char/valid.acc.ave_10best.pth
4
+ python: "3.8.16 | packaged by conda-forge | (default, Feb 1 2023, 16:01:55) \n[GCC\
5
+ \ 11.3.0]"
6
+ timestamp: 1681908736.234121
7
+ torch: 1.10.0
8
+ yaml_files:
9
+ asr_train_config: exp/asr_train_asr_transformer_raw_foc_char/config.yaml