pkufool commited on
Commit
a916b02
·
1 Parent(s): 362880d

Add bbpe aishell pretrain model

Browse files
data/lang_bbpe_500/L.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c7cea98f81b431dfdc8ead362422e3821a0d8e63c97495bf7b8d8d6531ede56
3
+ size 3492967
data/lang_bbpe_500/LG.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6bcd10f6cc92fe20a3fd36fbc24819a6d4fdcec3ad96d87a479a3dc4a4ed35d4
3
+ size 84837411
data/lang_bbpe_500/Linv.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b0bc307920e29122ab88d99d959105161f91efdd3f1e7568a28aa8b65722a98f
3
+ size 3492967
data/lang_bbpe_500/bbpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2303acda314119dbd58fc7e4dbbc9bb0d40de71400386066bd77a6e247fe9b92
3
+ size 246116
data/lang_bbpe_500/tokens.txt ADDED
@@ -0,0 +1,502 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <blk> 0
2
+ <sos/eos> 1
3
+ <unk> 2
4
+ ▁ƌ 3
5
+ ▁ƍ 4
6
+ ▁Ǝ 5
7
+ ▁ƎĽĥ 6
8
+ ▁Ə 7
9
+ ▁Ɛ 8
10
+ ▁ƋŞġ 9
11
+ ĩ 10
12
+ Ŕ 11
13
+ Į 12
14
+ Ş 13
15
+ Ő 14
16
+ ń 15
17
+ ť 16
18
+ ▁ƌŁŎ 17
19
+ ł 18
20
+ Ť 19
21
+ ŗ 20
22
+ Ħ 21
23
+ Ģ 22
24
+ Ī 23
25
+ ▁ƌİ 24
26
+ ī 25
27
+ š 26
28
+ ţ 27
29
+ ļ 28
30
+ œ 29
31
+ ħ 30
32
+ ġ 31
33
+ Ŗ 32
34
+ ▁ƍĻŕ 33
35
+ ▁ƋŞœ 34
36
+ Š 35
37
+ ĺ 36
38
+ Ś 37
39
+ ▁ƌĮĢ 38
40
+ Ň 39
41
+ ŋ 40
42
+ ▁ƋŠŠ 41
43
+ ņ 42
44
+ Ł 43
45
+ Ĵ 44
46
+ ŕ 45
47
+ ř 46
48
+ ģ 47
49
+ ▁ƍŁĪ 48
50
+ ľ 49
51
+ Ŋ 50
52
+ Ō 51
53
+ ▁ƋŠħ 52
54
+ ▁ƋŠĭ 53
55
+ Ĥ 54
56
+ ŝ 55
57
+ ▁ƋŞī 56
58
+ ň 57
59
+ ĵ 58
60
+ Ļ 59
61
+ Ř 60
62
+ ▁ƋŞĮ 61
63
+ ▁ƌľţ 62
64
+ ĸ 63
65
+ ŏ 64
66
+ ō 65
67
+ Ţ 66
68
+ Ĺ 67
69
+ ▁ƌŞģ 68
70
+ Ĭ 69
71
+ Ņ 70
72
+ ĭ 71
73
+ ķ 72
74
+ ▁ƌŊō 73
75
+ ▁ƋŞĽ 74
76
+ ▁Ǝš 75
77
+ Ľ 76
78
+ ▁ƋŞŠ 77
79
+ İ 78
80
+ ▁ƌşŚ 79
81
+ ş 80
82
+ Ŏ 81
83
+ į 82
84
+ ▁ƋŞĪ 83
85
+ ĥ 84
86
+ ı 85
87
+ ▁ƌİĴ 86
88
+ Ĩ 87
89
+ ▁ƌĦ 88
90
+ Ķ 89
91
+ ▁Ƌš 90
92
+ ▁ƋŞŐ 91
93
+ Œ 92
94
+ ▁ƌĨŠ 93
95
+ ▁ƌŁŖ 94
96
+ Ŝ 95
97
+ ▁Əņĭ 96
98
+ ▁ƌĦŒ 97
99
+ ▁ƌĩħ 98
100
+ ▁ƋŢĽ 99
101
+ ▁ƏŚ 100
102
+ ▁ƍĩı 101
103
+ ▁ƌĵĭ 102
104
+ ▁Əťļ 103
105
+ ő 104
106
+ ▁Ƌţ 105
107
+ ▁ƌĩŖ 106
108
+ ▁ƋŠō 107
109
+ Ń 108
110
+ ▁ƌŕş 109
111
+ ▁ƋŠķ 110
112
+ ▁ƋŠ 111
113
+ ▁ƍĺŜ 112
114
+ ▁ƌŁŠ 113
115
+ ▁ƍĬ 114
116
+ ▁ƏĤţ 115
117
+ ▁ƍĩť 116
118
+ ▁ƋŞ 117
119
+ ▁ƍłŋ 118
120
+ ▁Əŕ 119
121
+ ▁ƍĹŖ 120
122
+ ▁Ƌšŋ 121
123
+ ▁ƎļŤ 122
124
+ ▁ƍĺŋ 123
125
+ ▁ƋşĬ 124
126
+ ▁Ƌş 125
127
+ ▁ƏġĦ 126
128
+ ▁ƌŖħ 127
129
+ ▁ƏŌĢ 128
130
+ ▁ƎįŖ 129
131
+ ▁ƌŔŜ 130
132
+ ▁ƌľľ 131
133
+ ▁Əśĥ 132
134
+ ▁ƌŊĽ 133
135
+ ▁ƍŁĩ 134
136
+ ▁ƐľŜ 135
137
+ ś 136
138
+ ▁Ƌşń 137
139
+ ▁ƍĹş 138
140
+ ▁ƌıį 139
141
+ ▁ƏĨ 140
142
+ ▁ƍŁŠ 141
143
+ ▁ƋŞĬ 142
144
+ ▁ƋŠį 143
145
+ ▁Ǝķń 144
146
+ ▁ƌĪĮ 145
147
+ ▁ƍŕķ 146
148
+ ▁ƍĪ 147
149
+ ▁Ǝģş 148
150
+ ▁ƌŢġ 149
151
+ ▁Ɛġ 150
152
+ ▁ƐőĻ 151
153
+ ▁ƌīŎ 152
154
+ ▁Ǝšİ 153
155
+ ▁Əťľ 154
156
+ ▁ƍīŋ 155
157
+ ▁ƎķŎ 156
158
+ ▁Əśľ 157
159
+ ▁ƋţĶ 158
160
+ ▁ƐĨĴ 159
161
+ ▁Ɛļ 160
162
+ ▁Ƌšŝ 161
163
+ ▁ƏţŌ 162
164
+ ▁ƌİŕ 163
165
+ ▁ƏŇő 164
166
+ ▁ƏťĨ 165
167
+ ▁ƋŞĨ 166
168
+ ▁ƌœı 167
169
+ ▁ƍŁŒ 168
170
+ ▁ƌĦŎ 169
171
+ ▁ƍĪĬ 170
172
+ ▁ƌİŞ 171
173
+ ▁ƋţŁ 172
174
+ ▁ƍŁġ 173
175
+ ▁Ǝķś 174
176
+ ▁ƎľŔ 175
177
+ ▁ƌĦő 176
178
+ ▁ƐĤŎ 177
179
+ ▁ƌĦœ 178
180
+ ▁ƌı 179
181
+ ▁ƍĩĴ 180
182
+ ▁ƌīľ 181
183
+ ▁ƌīŅ 182
184
+ ▁ƐłŇ 183
185
+ ▁ƋŞĤ 184
186
+ ▁ƌŠŌ 185
187
+ ▁ƋŢĢ 186
188
+ ▁ƋŠŊ 187
189
+ ▁ƌŔŃ 188
190
+ ▁ƌńį 189
191
+ ▁ƌĦĤ 190
192
+ ▁Ƌşł 191
193
+ ▁ƌŔĽ 192
194
+ ▁ƌŝŘ 193
195
+ ▁ƌŖİ 194
196
+ ▁ƏťĻ 195
197
+ ▁ƌĶĢ 196
198
+ ▁ƌĦŜ 197
199
+ ▁ƌıĭ 198
200
+ ▁ƌŝŋ 199
201
+ ▁Əġĭ 200
202
+ ▁ƎţĴ 201
203
+ ▁ƌıĩ 202
204
+ ▁ƌħĦ 203
205
+ ▁ƌĦř 204
206
+ ▁ƋšĹ 205
207
+ ▁ƋŞį 206
208
+ ▁ƍřĸ 207
209
+ ▁ƌŤĺ 208
210
+ ▁ƎŒ 209
211
+ ▁ƌŖŗ 210
212
+ ▁ƏŔŖ 211
213
+ ▁Ǝıħ 212
214
+ ▁ƌĦŋ 213
215
+ ▁Ɛĸť 214
216
+ ▁ƋŠĬ 215
217
+ ▁ƌıĮ 216
218
+ ▁ƌĸħ 217
219
+ ▁ƌŋř 218
220
+ ▁ƋšŒ 219
221
+ ▁ƐĤţ 220
222
+ ▁ƋŠŒ 221
223
+ ▁ƋŞŊ 222
224
+ ▁ƌşŜ 223
225
+ ▁ƌĩŜ 224
226
+ ▁ƌĭĹ 225
227
+ ▁ƍķť 226
228
+ ▁ƌşř 227
229
+ ▁ƍŁń 228
230
+ ▁ƏņŎ 229
231
+ ▁ƐĨĮ 230
232
+ ▁ƍŒņ 231
233
+ ▁ƋŞš 232
234
+ ▁ƍĮŔ 233
235
+ ▁ƌīņ 234
236
+ ▁Ƌťł 235
237
+ ▁ƐġĽ 236
238
+ ▁ƌĩŏ 237
239
+ ▁ƎľŞ 238
240
+ ▁ƋŤ 239
241
+ ▁ƐĢĶ 240
242
+ ▁ƌŊŏ 241
243
+ ▁ƌŗĸ 242
244
+ ▁ƍİı 243
245
+ ▁ƍīĸ 244
246
+ ▁ƌĭĺ 245
247
+ ▁ƐĨİ 246
248
+ ▁ƐĺŚ 247
249
+ ▁ƌĴĻ 248
250
+ ▁ƌšŠ 249
251
+ ▁ƌľŅ 250
252
+ ▁ƍśŝ 251
253
+ ▁ƌŋţ 252
254
+ ▁ƍĸŖ 253
255
+ ▁ƍĪġ 254
256
+ ▁ƎŔņ 255
257
+ ▁ƌŊĹ 256
258
+ ▁ƎŊŠ 257
259
+ ▁ƌĭŠ 258
260
+ ▁ƍľŚ 259
261
+ ▁ƍġō 260
262
+ ▁ƎœĪ 261
263
+ ▁ƌŠķ 262
264
+ ▁Ƌšį 263
265
+ ▁ƌŇŃ 264
266
+ ▁Əťı 265
267
+ ▁ƏŔņ 266
268
+ ▁ƌţĶ 267
269
+ ▁Ɛņş 268
270
+ ▁ƍķŜ 269
271
+ ▁Ƌťņ 270
272
+ ▁ƍĻį 271
273
+ ▁ƐĻ 272
274
+ ▁Ǝłġ 273
275
+ ▁ƏŔŤ 274
276
+ ▁ƍŖĴ 275
277
+ ▁ƏŖĤ 276
278
+ ▁Əōĥ 277
279
+ ▁ƋţĮ 278
280
+ ▁Əśŝ 279
281
+ ▁ƍĭĢ 280
282
+ ▁ƌœŌ 281
283
+ ▁ƌŤĩ 282
284
+ ▁ƍŅŢ 283
285
+ ▁ƍŘņ 284
286
+ ▁Ƌšī 285
287
+ ▁ƍœň 286
288
+ ▁Əģņ 287
289
+ ▁ƌİŖ 288
290
+ ▁ƌťĤ 289
291
+ ▁ƍĺ 290
292
+ ▁ƎĥŜ 291
293
+ ▁ƌįš 292
294
+ ▁ƌŝ 293
295
+ ▁ƌŔĪ 294
296
+ ▁ƍœŊ 295
297
+ ▁ƏŤ 296
298
+ ▁ƎŤį 297
299
+ ▁ƍŃŁ 298
300
+ ▁ƍĺŅ 299
301
+ ▁ƏĬ 300
302
+ ▁ƐĻń 301
303
+ ▁ƎōĴ 302
304
+ ▁ƎŠť 303
305
+ ▁ƌŌģ 304
306
+ ▁ƌńŠ 305
307
+ ▁ƏŠő 306
308
+ ▁ƐŁġ 307
309
+ ▁ƍĤĦ 308
310
+ ▁ƍŜĩ 309
311
+ ▁ƏťĴ 310
312
+ ▁ƏŚş 311
313
+ ▁ƌİĺ 312
314
+ ▁ƏŤŤ 313
315
+ ▁ƌŞĤ 314
316
+ ▁ƋŞĹ 315
317
+ ▁ƍŅĨ 316
318
+ ▁ƍŁŐ 317
319
+ ▁ƋŞŁ 318
320
+ ▁ƐĢ 319
321
+ ▁ƍŗģ 320
322
+ ▁ƍĢŕ 321
323
+ ▁ƏŕĢ 322
324
+ ▁ƋŠľ 323
325
+ ▁ƍœŋ 324
326
+ ▁ƎŁĬ 325
327
+ ▁ƐĨĭ 326
328
+ ▁ƌıĴ 327
329
+ ▁Ǝķŝ 328
330
+ ▁ƌĮĤ 329
331
+ ▁ƌľŃ 330
332
+ ▁ƐĺŎ 331
333
+ ▁ƍķş 332
334
+ ▁ƌŢİ 333
335
+ ▁Əঠ334
336
+ ▁ƍĥİ 335
337
+ ▁Əś 336
338
+ ▁ƍįŋ 337
339
+ ▁Ǝŗř 338
340
+ ▁Ǝšœ 339
341
+ ▁ƍīġ 340
342
+ ▁Ɛķġ 341
343
+ ▁Ƌţİ 342
344
+ ▁ƎľĴ 343
345
+ ▁ƍŖŚ 344
346
+ ▁ƍġš 345
347
+ ▁ƏĢķ 346
348
+ ▁ƌİī 347
349
+ ▁ƏŔŕ 348
350
+ ▁ƋŠť 349
351
+ ▁ƍĸŚ 350
352
+ ▁ƍĪĶ 351
353
+ ▁ƏĨř 352
354
+ ▁ƋšŜ 353
355
+ ▁Əįŝ 354
356
+ ▁ƌŢŅ 355
357
+ ▁ƌŢŠ 356
358
+ ▁ƐļŇ 357
359
+ ▁Ǝįŕ 358
360
+ ▁ƏġĢ 359
361
+ ▁Əŕŋ 360
362
+ ▁ƌŗġ 361
363
+ ▁ƎĪş 362
364
+ ▁ƐĺŔ 363
365
+ ▁ƏŚœ 364
366
+ ▁ƌķŔ 365
367
+ ▁Ǝķŗ 366
368
+ ▁ƍŋŢ 367
369
+ ▁Əŝŕ 368
370
+ ▁ƌŊĥ 369
371
+ ▁Ǝřš 370
372
+ ▁ƐŇĻ 371
373
+ ▁ƎōŖ 372
374
+ ▁ƍįŎ 373
375
+ ▁ƍĩĻ 374
376
+ ▁ƌţŗ 375
377
+ ▁ƎœĹ 376
378
+ ▁Ǝšń 377
379
+ ▁ƏŔŏ 378
380
+ ▁ƌşť 379
381
+ ▁Ƌşı 380
382
+ ▁ƌħř 381
383
+ ▁ƍŜŎ 382
384
+ ▁ƌŖĴ 383
385
+ ▁ƍĻŤ 384
386
+ ▁ƌŞŞ 385
387
+ ▁ƐŌĹ 386
388
+ ▁ƐĶŜ 387
389
+ ▁ƌŔĭ 388
390
+ ▁ƍĩŝ 389
391
+ ▁ƌŝŗ 390
392
+ ▁ƌĩľ 391
393
+ ▁ƎōĮ 392
394
+ ▁ƎőĬ 393
395
+ ▁ƍńŋ 394
396
+ ▁ƐļĮ 395
397
+ ▁ƍņĩ 396
398
+ ▁Əıŋ 397
399
+ ▁ƎŎĬ 398
400
+ ▁ƎıĤ 399
401
+ ▁ƌŊĨ 400
402
+ ▁ƐġĪ 401
403
+ ▁ƍĻĶ 402
404
+ ▁ƌīť 403
405
+ ▁ƌĨħ 404
406
+ ▁Ƌť 405
407
+ ▁ƍŒŤ 406
408
+ ▁ƎĪĭ 407
409
+ ▁ƌıĥ 408
410
+ ▁ƏŔŊ 409
411
+ ▁ƎšĶ 410
412
+ ▁ƐġŅ 411
413
+ ▁ƏŝĴ 412
414
+ ▁Ǝōŕ 413
415
+ ▁Ƌţį 414
416
+ ▁Ƌšň 415
417
+ ▁ƌœĻ 416
418
+ ▁ƍķŤ 417
419
+ ▁ƏŜĦ 418
420
+ ▁ƌįń 419
421
+ ▁ƌŝŃ 420
422
+ ▁ƌĢĽ 421
423
+ ▁ƌħĮ 422
424
+ ▁ƎŠō 423
425
+ ▁ƋŢŅ 424
426
+ ▁ƌŔŇ 425
427
+ ▁ƎĪŏ 426
428
+ ▁Ɛļŏ 427
429
+ ▁ƍŅŝ 428
430
+ ▁ƏŤĤ 429
431
+ ▁ƍĹĨ 430
432
+ ▁ƌĮĸ 431
433
+ ▁Ɛňį 432
434
+ ▁Ǝĸĭ 433
435
+ ▁ƏţŒ 434
436
+ ▁ƌĴī 435
437
+ ▁ƋŤľ 436
438
+ ▁Əōň 437
439
+ ▁ƏœŌ 438
440
+ ▁ƌŋş 439
441
+ ▁ƎįĨ 440
442
+ ▁ƌĮį 441
443
+ ▁ƐŇĥ 442
444
+ ▁Ɛľħ 443
445
+ ▁ƍįĵ 444
446
+ ▁ƌĴŎ 445
447
+ ▁ƍĭĨ 446
448
+ ▁ƌŕŢ 447
449
+ ▁ƌĮĨ 448
450
+ ▁ƍŕİ 449
451
+ ▁ƎĪĨ 450
452
+ ▁ƌťő 451
453
+ ▁ƏśŖ 452
454
+ ▁ƌŊŚ 453
455
+ ▁ƍŁŕ 454
456
+ ▁ƍŁĮ 455
457
+ ▁ƌŃĬ 456
458
+ ▁ƌĩļ 457
459
+ ▁Əōħ 458
460
+ ▁ƎľŚ 459
461
+ ▁ƎŅķ 460
462
+ ▁ƐłŃ 461
463
+ ▁ƍŠı 462
464
+ ▁ƍĹŕ 463
465
+ ▁ƎŠŌ 464
466
+ ▁ƍĹţ 465
467
+ ▁ƌĩő 466
468
+ ▁ƎŊŤ 467
469
+ ▁ƍŚš 468
470
+ ▁ƐŇħ 469
471
+ ▁ƍķŕ 470
472
+ ▁ƌħľ 471
473
+ ▁ƌļŎ 472
474
+ ▁ƍŎņ 473
475
+ ▁ƐĽŤ 474
476
+ ▁ƌĭš 475
477
+ ▁ƍĤř 476
478
+ ▁ƎįĬ 477
479
+ ▁ƌľŇ 478
480
+ ▁ƍŞŞ 479
481
+ ▁ƍŃĥ 480
482
+ ▁ƐŏŒ 481
483
+ ▁ƍĩĹ 482
484
+ ▁ƍļŠ 483
485
+ ▁ƌŢĸ 484
486
+ ▁ƌŞŌ 485
487
+ ▁ƏŃĮ 486
488
+ ▁ƎĦō 487
489
+ ▁ƎĬı 488
490
+ ▁ƌĮĺ 489
491
+ ▁ƌŘĢ 490
492
+ ▁ƌīŃ 491
493
+ Ɩ 492
494
+ Ɛ 493
495
+ Ə 494
496
+ Ǝ 495
497
+ Ƌ 496
498
+ ƍ 497
499
+ ƌ 498
500
+ ▁ 499
501
+ #0 500
502
+ #1 501
data/lang_bbpe_500/words.txt ADDED
The diff for this file is too large to render. See raw diff
 
exp/cpu_jit.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c55bb653dcef739b33d8f070b6e709a69bcdc89208f5a6b7825f069e9c370f2
3
+ size 358526334
exp/export.sh ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ #!/usr/bin/env bash
2
+
3
+ set -x
4
+
5
+ K2_ROOT=/ceph-hw/kangwei/code/k2_release/k2
6
+ ICEFALL=/ceph-hw/kangwei/code2/icefall_bbpe_aishell
7
+
8
+ export PYTHONPATH=$K2_ROOT/k2/python:$PYTHONPATH
9
+ export PYTHONPATH=$K2_ROOT/build/lib:$PYTHONPATH
10
+ export PYTHONPATH=$ICEFALL:$PYTHONPATH
11
+
12
+ export CUDA_VISIBLE_DEVICES=""
13
+
14
+ ./pruned_transducer_stateless7_bbpe/export.py \
15
+ --epoch 49 \
16
+ --avg 28 \
17
+ --bpe-model data/lang_bbpe_500/bbpe.model \
18
+ --exp-dir ./pruned_transducer_stateless7_bbpe/exp
19
+
20
+ ./pruned_transducer_stateless7_bbpe/export.py \
21
+ --epoch 49 \
22
+ --avg 28 \
23
+ --bpe-model data/lang_bbpe_500/bbpe.model \
24
+ --exp-dir ./pruned_transducer_stateless7_bbpe/exp
25
+ --jit 1
exp/log/log-train-2023-03-29-17-07-07-0 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-29-17-07-07-1 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-29-17-07-07-2 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-29-17-07-07-3 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-30-10-10-41-0 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-30-10-10-41-1 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-30-10-10-41-2 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-03-30-10-10-41-3 ADDED
The diff for this file is too large to render. See raw diff
 
exp/log/log-train-2023-04-05-12-04-35-0 ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-04-05 12:04:35,224 INFO [train.py:963] (0/4) Training started
2
+ 2023-04-05 12:04:35,227 INFO [train.py:973] (0/4) Device: cuda:0
3
+ 2023-04-05 12:04:35,230 INFO [train.py:982] (0/4) {'frame_shift_ms': 10.0, 'allowed_excess_duration_ratio': 0.1, 'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.23.4', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '9426c9f730820d291f5dcb06be337662595fa7b4', 'k2-git-date': 'Sun Feb 5 17:35:01 2023', 'lhotse-version': '1.13.0.dev+git.4cbd1bde.clean', 'torch-version': '1.10.0+cu102', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'bbpe', 'icefall-git-sha1': 'a7e0d24-dirty', 'icefall-git-date': 'Tue Mar 28 18:53:54 2023', 'icefall-path': '/ceph-kw/kangwei/code/icefall_bbpe_aishell', 'k2-path': '/ceph-hw/kangwei/code/k2_release/k2/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-hw/kangwei/dev_tools/anaconda3/envs/rnnt2/lib/python3.8/site-packages/lhotse-1.13.0.dev0+git.4cbd1bde.clean-py3.8.egg/lhotse/__init__.py', 'hostname': 'de-74279-k2-train-3-1220120619-7695ff496b-s9n4w', 'IP address': '10.177.6.147'}, 'world_size': 4, 'master_port': 12535, 'tensorboard': True, 'num_epochs': 50, 'start_epoch': 28, 'start_batch': 0, 'exp_dir': PosixPath('pruned_transducer_stateless7_bbpe/exp'), 'bbpe_model': 'data/lang_bbpe_500/bbpe.model', 'base_lr': 0.05, 'lr_batches': 5000, 'lr_epochs': 7.0, 'context_size': 2, 'prune_range': 5, 'lm_scale': 0.25, 'am_scale': 0.0, 'simple_loss_scale': 0.5, 'seed': 42, 'print_diagnostics': False, 'inf_check': False, 'save_every_n': 2000, 'keep_last_k': 30, 'average_period': 200, 'use_fp16': True, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 800, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'blank_id': 0, 'vocab_size': 500}
4
+ 2023-04-05 12:04:35,230 INFO [train.py:984] (0/4) About to create model
5
+ 2023-04-05 12:04:35,880 INFO [zipformer.py:178] (0/4) At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2023-04-05 12:04:35,895 INFO [train.py:988] (0/4) Number of model parameters: 70369391
7
+ 2023-04-05 12:04:36,636 INFO [checkpoint.py:112] (0/4) Loading checkpoint from pruned_transducer_stateless7_bbpe/exp/epoch-27.pt
8
+ 2023-04-05 12:04:37,992 INFO [checkpoint.py:131] (0/4) Loading averaged model
9
+ 2023-04-05 12:04:42,872 INFO [train.py:1003] (0/4) Using DDP
10
+ 2023-04-05 12:04:44,595 INFO [train.py:1020] (0/4) Loading optimizer state dict
11
+ 2023-04-05 12:04:45,328 INFO [train.py:1028] (0/4) Loading scheduler state dict
12
+ 2023-04-05 12:04:45,328 INFO [asr_datamodule.py:365] (0/4) About to get train cuts
13
+ 2023-04-05 12:04:45,331 INFO [asr_datamodule.py:196] (0/4) About to get Musan cuts
exp/log/log-train-2023-04-05-12-04-35-1 ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-04-05 12:04:35,240 INFO [train.py:963] (1/4) Training started
2
+ 2023-04-05 12:04:35,240 INFO [train.py:973] (1/4) Device: cuda:1
3
+ 2023-04-05 12:04:35,243 INFO [train.py:982] (1/4) {'frame_shift_ms': 10.0, 'allowed_excess_duration_ratio': 0.1, 'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.23.4', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '9426c9f730820d291f5dcb06be337662595fa7b4', 'k2-git-date': 'Sun Feb 5 17:35:01 2023', 'lhotse-version': '1.13.0.dev+git.4cbd1bde.clean', 'torch-version': '1.10.0+cu102', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'bbpe', 'icefall-git-sha1': 'a7e0d24-dirty', 'icefall-git-date': 'Tue Mar 28 18:53:54 2023', 'icefall-path': '/ceph-kw/kangwei/code/icefall_bbpe_aishell', 'k2-path': '/ceph-hw/kangwei/code/k2_release/k2/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-hw/kangwei/dev_tools/anaconda3/envs/rnnt2/lib/python3.8/site-packages/lhotse-1.13.0.dev0+git.4cbd1bde.clean-py3.8.egg/lhotse/__init__.py', 'hostname': 'de-74279-k2-train-3-1220120619-7695ff496b-s9n4w', 'IP address': '10.177.6.147'}, 'world_size': 4, 'master_port': 12535, 'tensorboard': True, 'num_epochs': 50, 'start_epoch': 28, 'start_batch': 0, 'exp_dir': PosixPath('pruned_transducer_stateless7_bbpe/exp'), 'bbpe_model': 'data/lang_bbpe_500/bbpe.model', 'base_lr': 0.05, 'lr_batches': 5000, 'lr_epochs': 7.0, 'context_size': 2, 'prune_range': 5, 'lm_scale': 0.25, 'am_scale': 0.0, 'simple_loss_scale': 0.5, 'seed': 42, 'print_diagnostics': False, 'inf_check': False, 'save_every_n': 2000, 'keep_last_k': 30, 'average_period': 200, 'use_fp16': True, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 800, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'blank_id': 0, 'vocab_size': 500}
4
+ 2023-04-05 12:04:35,243 INFO [train.py:984] (1/4) About to create model
5
+ 2023-04-05 12:04:35,898 INFO [zipformer.py:178] (1/4) At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2023-04-05 12:04:35,914 INFO [train.py:988] (1/4) Number of model parameters: 70369391
7
+ 2023-04-05 12:04:35,914 INFO [checkpoint.py:112] (1/4) Loading checkpoint from pruned_transducer_stateless7_bbpe/exp/epoch-27.pt
8
+ 2023-04-05 12:04:42,542 INFO [train.py:1003] (1/4) Using DDP
9
+ 2023-04-05 12:04:44,598 INFO [train.py:1020] (1/4) Loading optimizer state dict
10
+ 2023-04-05 12:04:45,338 INFO [train.py:1028] (1/4) Loading scheduler state dict
11
+ 2023-04-05 12:04:45,338 INFO [asr_datamodule.py:365] (1/4) About to get train cuts
12
+ 2023-04-05 12:04:45,340 INFO [asr_datamodule.py:196] (1/4) About to get Musan cuts
exp/log/log-train-2023-04-05-12-04-35-2 ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-04-05 12:04:35,237 INFO [train.py:963] (2/4) Training started
2
+ 2023-04-05 12:04:35,238 INFO [train.py:973] (2/4) Device: cuda:2
3
+ 2023-04-05 12:04:35,240 INFO [train.py:982] (2/4) {'frame_shift_ms': 10.0, 'allowed_excess_duration_ratio': 0.1, 'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.23.4', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '9426c9f730820d291f5dcb06be337662595fa7b4', 'k2-git-date': 'Sun Feb 5 17:35:01 2023', 'lhotse-version': '1.13.0.dev+git.4cbd1bde.clean', 'torch-version': '1.10.0+cu102', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'bbpe', 'icefall-git-sha1': 'a7e0d24-dirty', 'icefall-git-date': 'Tue Mar 28 18:53:54 2023', 'icefall-path': '/ceph-kw/kangwei/code/icefall_bbpe_aishell', 'k2-path': '/ceph-hw/kangwei/code/k2_release/k2/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-hw/kangwei/dev_tools/anaconda3/envs/rnnt2/lib/python3.8/site-packages/lhotse-1.13.0.dev0+git.4cbd1bde.clean-py3.8.egg/lhotse/__init__.py', 'hostname': 'de-74279-k2-train-3-1220120619-7695ff496b-s9n4w', 'IP address': '10.177.6.147'}, 'world_size': 4, 'master_port': 12535, 'tensorboard': True, 'num_epochs': 50, 'start_epoch': 28, 'start_batch': 0, 'exp_dir': PosixPath('pruned_transducer_stateless7_bbpe/exp'), 'bbpe_model': 'data/lang_bbpe_500/bbpe.model', 'base_lr': 0.05, 'lr_batches': 5000, 'lr_epochs': 7.0, 'context_size': 2, 'prune_range': 5, 'lm_scale': 0.25, 'am_scale': 0.0, 'simple_loss_scale': 0.5, 'seed': 42, 'print_diagnostics': False, 'inf_check': False, 'save_every_n': 2000, 'keep_last_k': 30, 'average_period': 200, 'use_fp16': True, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 800, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'blank_id': 0, 'vocab_size': 500}
4
+ 2023-04-05 12:04:35,240 INFO [train.py:984] (2/4) About to create model
5
+ 2023-04-05 12:04:35,897 INFO [zipformer.py:178] (2/4) At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2023-04-05 12:04:35,913 INFO [train.py:988] (2/4) Number of model parameters: 70369391
7
+ 2023-04-05 12:04:35,913 INFO [checkpoint.py:112] (2/4) Loading checkpoint from pruned_transducer_stateless7_bbpe/exp/epoch-27.pt
8
+ 2023-04-05 12:04:42,067 INFO [train.py:1003] (2/4) Using DDP
9
+ 2023-04-05 12:04:44,599 INFO [train.py:1020] (2/4) Loading optimizer state dict
10
+ 2023-04-05 12:04:45,308 INFO [train.py:1028] (2/4) Loading scheduler state dict
11
+ 2023-04-05 12:04:45,309 INFO [asr_datamodule.py:365] (2/4) About to get train cuts
12
+ 2023-04-05 12:04:45,320 INFO [asr_datamodule.py:196] (2/4) About to get Musan cuts
exp/log/log-train-2023-04-05-12-04-35-3 ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ 2023-04-05 12:04:35,238 INFO [train.py:963] (3/4) Training started
2
+ 2023-04-05 12:04:35,238 INFO [train.py:973] (3/4) Device: cuda:3
3
+ 2023-04-05 12:04:35,240 INFO [train.py:982] (3/4) {'frame_shift_ms': 10.0, 'allowed_excess_duration_ratio': 0.1, 'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.23.4', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '9426c9f730820d291f5dcb06be337662595fa7b4', 'k2-git-date': 'Sun Feb 5 17:35:01 2023', 'lhotse-version': '1.13.0.dev+git.4cbd1bde.clean', 'torch-version': '1.10.0+cu102', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'bbpe', 'icefall-git-sha1': 'a7e0d24-dirty', 'icefall-git-date': 'Tue Mar 28 18:53:54 2023', 'icefall-path': '/ceph-kw/kangwei/code/icefall_bbpe_aishell', 'k2-path': '/ceph-hw/kangwei/code/k2_release/k2/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-hw/kangwei/dev_tools/anaconda3/envs/rnnt2/lib/python3.8/site-packages/lhotse-1.13.0.dev0+git.4cbd1bde.clean-py3.8.egg/lhotse/__init__.py', 'hostname': 'de-74279-k2-train-3-1220120619-7695ff496b-s9n4w', 'IP address': '10.177.6.147'}, 'world_size': 4, 'master_port': 12535, 'tensorboard': True, 'num_epochs': 50, 'start_epoch': 28, 'start_batch': 0, 'exp_dir': PosixPath('pruned_transducer_stateless7_bbpe/exp'), 'bbpe_model': 'data/lang_bbpe_500/bbpe.model', 'base_lr': 0.05, 'lr_batches': 5000, 'lr_epochs': 7.0, 'context_size': 2, 'prune_range': 5, 'lm_scale': 0.25, 'am_scale': 0.0, 'simple_loss_scale': 0.5, 'seed': 42, 'print_diagnostics': False, 'inf_check': False, 'save_every_n': 2000, 'keep_last_k': 30, 'average_period': 200, 'use_fp16': True, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 800, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'blank_id': 0, 'vocab_size': 500}
4
+ 2023-04-05 12:04:35,241 INFO [train.py:984] (3/4) About to create model
5
+ 2023-04-05 12:04:35,898 INFO [zipformer.py:178] (3/4) At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2023-04-05 12:04:35,913 INFO [train.py:988] (3/4) Number of model parameters: 70369391
7
+ 2023-04-05 12:04:35,914 INFO [checkpoint.py:112] (3/4) Loading checkpoint from pruned_transducer_stateless7_bbpe/exp/epoch-27.pt
8
+ 2023-04-05 12:04:44,367 INFO [train.py:1003] (3/4) Using DDP
9
+ 2023-04-05 12:04:44,606 INFO [train.py:1020] (3/4) Loading optimizer state dict
exp/pretrained.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b925f1c6d88ab2546884ce79241152dd35f66f763ed86e261718a2e856770717
3
+ size 281766253
exp/tensorboard/events.out.tfevents.1680080827.de-74279-k2-train-7-1218101249-5d97868c7c-v8ngc.593174.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:261e99ece7c6c068e7d547cf057a7a39e918a1f8e90c8e9338b6f6aae2da6f17
3
+ size 151490
exp/tensorboard/events.out.tfevents.1680142241.de-74279-k2-train-3-1220120619-7695ff496b-s9n4w.337604.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a30b706725ea9f722f40981370cd20e214cd1fb5c933f3f09751523436f2eef1
3
+ size 126190
exp/tensorboard/events.out.tfevents.1680667475.de-74279-k2-train-3-1220120619-7695ff496b-s9n4w.1376633.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2dab2c5ac1687f132fc7d59ad2ae6478045298492c7f3379a5462a481e1814d3
3
+ size 40
test_waves/BAC009S0764W0121.wav ADDED
Binary file (135 kB). View file
 
test_waves/BAC009S0764W0122.wav ADDED
Binary file (132 kB). View file
 
test_waves/BAC009S0764W0123.wav ADDED
Binary file (128 kB). View file
 
test_waves/trans.txt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ BAC009S0764W0121 甚至 出现 交易 几乎 停滞 的 情况
2
+ BAC009S0764W0122 一二 线 城市 虽然 也 处于 调整 中
3
+ BAC009S0764W0123 但 因为 聚集 了 过多 公共 资源