indiejoseph commited on
Commit
6df8852
1 Parent(s): 67af4b5

Upload folder using huggingface_hub

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false
7
+ }
README.md CHANGED
@@ -1,3 +1,127 @@
1
  ---
2
- license: cc-by-4.0
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ pipeline_tag: sentence-similarity
3
+ tags:
4
+ - sentence-transformers
5
+ - feature-extraction
6
+ - sentence-similarity
7
+ - transformers
8
+
9
  ---
10
+
11
+ # {MODEL_NAME}
12
+
13
+ This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
14
+
15
+ <!--- Describe your model here -->
16
+
17
+ ## Usage (Sentence-Transformers)
18
+
19
+ Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
20
+
21
+ ```
22
+ pip install -U sentence-transformers
23
+ ```
24
+
25
+ Then you can use the model like this:
26
+
27
+ ```python
28
+ from sentence_transformers import SentenceTransformer
29
+ sentences = ["This is an example sentence", "Each sentence is converted"]
30
+
31
+ model = SentenceTransformer('{MODEL_NAME}')
32
+ embeddings = model.encode(sentences)
33
+ print(embeddings)
34
+ ```
35
+
36
+
37
+
38
+ ## Usage (HuggingFace Transformers)
39
+ Without [sentence-transformers](https://www.SBERT.net), you can use the model like this: First, you pass your input through the transformer model, then you have to apply the right pooling-operation on-top of the contextualized word embeddings.
40
+
41
+ ```python
42
+ from transformers import AutoTokenizer, AutoModel
43
+ import torch
44
+
45
+
46
+ #Mean Pooling - Take attention mask into account for correct averaging
47
+ def mean_pooling(model_output, attention_mask):
48
+ token_embeddings = model_output[0] #First element of model_output contains all token embeddings
49
+ input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
50
+ return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)
51
+
52
+
53
+ # Sentences we want sentence embeddings for
54
+ sentences = ['This is an example sentence', 'Each sentence is converted']
55
+
56
+ # Load model from HuggingFace Hub
57
+ tokenizer = AutoTokenizer.from_pretrained('{MODEL_NAME}')
58
+ model = AutoModel.from_pretrained('{MODEL_NAME}')
59
+
60
+ # Tokenize sentences
61
+ encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')
62
+
63
+ # Compute token embeddings
64
+ with torch.no_grad():
65
+ model_output = model(**encoded_input)
66
+
67
+ # Perform pooling. In this case, mean pooling.
68
+ sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask'])
69
+
70
+ print("Sentence embeddings:")
71
+ print(sentence_embeddings)
72
+ ```
73
+
74
+
75
+
76
+ ## Evaluation Results
77
+
78
+ <!--- Describe how your model was evaluated -->
79
+
80
+ For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name={MODEL_NAME})
81
+
82
+
83
+ ## Training
84
+ The model was trained with the parameters:
85
+
86
+ **DataLoader**:
87
+
88
+ `torch.utils.data.dataloader.DataLoader` of length 5745 with parameters:
89
+ ```
90
+ {'batch_size': 64, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
91
+ ```
92
+
93
+ **Loss**:
94
+
95
+ `sentence_transformers.losses.MSELoss.MSELoss`
96
+
97
+ Parameters of the fit()-Method:
98
+ ```
99
+ {
100
+ "epochs": 5,
101
+ "evaluation_steps": 1000,
102
+ "evaluator": "sentence_transformers.evaluation.SequentialEvaluator.SequentialEvaluator",
103
+ "max_grad_norm": 1,
104
+ "optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
105
+ "optimizer_params": {
106
+ "eps": 1e-06,
107
+ "lr": 2e-05
108
+ },
109
+ "scheduler": "WarmupLinear",
110
+ "steps_per_epoch": null,
111
+ "warmup_steps": 10000,
112
+ "weight_decay": 0.01
113
+ }
114
+ ```
115
+
116
+
117
+ ## Full Model Architecture
118
+ ```
119
+ SentenceTransformer(
120
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
121
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
122
+ )
123
+ ```
124
+
125
+ ## Citing & Authors
126
+
127
+ <!--- Describe where people can find more information -->
added_tokens.json ADDED
@@ -0,0 +1,502 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "㓟": 21530,
3
+ "㚻": 21315,
4
+ "㞘": 21533,
5
+ "㨃": 21571,
6
+ "㨘": 21409,
7
+ "㩒": 21330,
8
+ "㩧": 21428,
9
+ "㷫": 21197,
10
+ "䁪": 21492,
11
+ "䊦": 21310,
12
+ "䌫": 21605,
13
+ "䴉": 21417,
14
+ "丏": 21622,
15
+ "乸": 21146,
16
+ "亶": 21582,
17
+ "佮": 21305,
18
+ "偲": 21252,
19
+ "僆": 21340,
20
+ "僊": 21584,
21
+ "僞": 21555,
22
+ "儁": 21363,
23
+ "儇": 21346,
24
+ "兗": 21382,
25
+ "冑": 21594,
26
+ "冚": 21140,
27
+ "冧": 21138,
28
+ "剦": 21386,
29
+ "卌": 21385,
30
+ "卽": 21175,
31
+ "厓": 21484,
32
+ "吔": 21286,
33
+ "呔": 21179,
34
+ "咃": 21580,
35
+ "咇": 21439,
36
+ "哣": 21444,
37
+ "唂": 21235,
38
+ "唞": 21181,
39
+ "唥": 21151,
40
+ "唨": 21302,
41
+ "唪": 21159,
42
+ "唻": 21216,
43
+ "啋": 21525,
44
+ "啩": 21198,
45
+ "啹": 21321,
46
+ "喐": 21144,
47
+ "喼": 21283,
48
+ "嗌": 21130,
49
+ "嗍": 21332,
50
+ "嗱": 21189,
51
+ "嘥": 21149,
52
+ "噃": 21145,
53
+ "噅": 21413,
54
+ "噉": 21129,
55
+ "噍": 21371,
56
+ "噏": 21174,
57
+ "嚙": 21270,
58
+ "嚜": 21536,
59
+ "嚡": 21275,
60
+ "嚤": 21427,
61
+ "嚫": 21581,
62
+ "嚿": 21132,
63
+ "囘": 21389,
64
+ "坭": 21280,
65
+ "坼": 21599,
66
+ "埐": 21537,
67
+ "埞": 21196,
68
+ "埲": 21221,
69
+ "堊": 21225,
70
+ "塱": 21195,
71
+ "塹": 21309,
72
+ "塽": 21426,
73
+ "壙": 21514,
74
+ "夀": 21559,
75
+ "奀": 21285,
76
+ "奭": 21423,
77
+ "姖": 21467,
78
+ "娸": 21368,
79
+ "婄": 21493,
80
+ "媺": 21353,
81
+ "嫽": 21379,
82
+ "嬋": 21422,
83
+ "嬲": 21153,
84
+ "孭": 21178,
85
+ "孲": 21204,
86
+ "孻": 21217,
87
+ "尐": 21155,
88
+ "屘": 21193,
89
+ "屙": 21177,
90
+ "岃": 21618,
91
+ "岋": 21578,
92
+ "岜": 21542,
93
+ "崢": 21358,
94
+ "嶠": 21436,
95
+ "巉": 21366,
96
+ "巹": 21625,
97
+ "幗": 21237,
98
+ "幪": 21266,
99
+ "廄": 21397,
100
+ "廩": 21603,
101
+ "廸": 21317,
102
+ "徂": 21148,
103
+ "怐": 21499,
104
+ "惲": 21545,
105
+ "愔": 21607,
106
+ "愨": 21520,
107
+ "慤": 21240,
108
+ "懽": 21573,
109
+ "戇": 21201,
110
+ "戙": 21258,
111
+ "戥": 21188,
112
+ "扠": 21523,
113
+ "扤": 21418,
114
+ "扻": 21527,
115
+ "扽": 21288,
116
+ "抌": 21265,
117
+ "拃": 21182,
118
+ "拏": 21273,
119
+ "挐": 21255,
120
+ "捵": 21359,
121
+ "捹": 21388,
122
+ "捽": 21325,
123
+ "掅": 21503,
124
+ "掕": 21202,
125
+ "掗": 21261,
126
+ "掟": 21161,
127
+ "掯": 21522,
128
+ "掹": 21290,
129
+ "揈": 21251,
130
+ "揗": 21212,
131
+ "揞": 21342,
132
+ "揦": 21229,
133
+ "揳": 21627,
134
+ "揼": 21184,
135
+ "揾": 21165,
136
+ "搣": 21219,
137
+ "搦": 21304,
138
+ "搲": 21257,
139
+ "搾": 21513,
140
+ "摑": 21561,
141
+ "摵": 21446,
142
+ "摷": 21296,
143
+ "摼": 21587,
144
+ "撠": 21333,
145
+ "撳": 21176,
146
+ "撾": 21141,
147
+ "擗": 21488,
148
+ "擝": 21352,
149
+ "擳": 21377,
150
+ "擸": 21218,
151
+ "攋": 21277,
152
+ "攰": 21226,
153
+ "攴": 21442,
154
+ "攷": 21128,
155
+ "旚": 21248,
156
+ "旯": 21616,
157
+ "旼": 21475,
158
+ "昃": 21593,
159
+ "昪": 21604,
160
+ "昰": 21367,
161
+ "昺": 21239,
162
+ "曱": 21247,
163
+ "曺": 21360,
164
+ "朊": 21504,
165
+ "枴": 21432,
166
+ "柊": 21343,
167
+ "栢": 21185,
168
+ "桴": 21544,
169
+ "梘": 21200,
170
+ "棖": 21477,
171
+ "棯": 21551,
172
+ "椏": 21271,
173
+ "椥": 21619,
174
+ "椴": 21464,
175
+ "榎": 21624,
176
+ "樖": 21143,
177
+ "樘": 21623,
178
+ "樨": 21324,
179
+ "橈": 21546,
180
+ "橛": 21167,
181
+ "櫈": 21186,
182
+ "櫟": 21364,
183
+ "櫳": 21365,
184
+ "殮": 21194,
185
+ "殻": 21576,
186
+ "殽": 21612,
187
+ "毬": 21596,
188
+ "氂": 21517,
189
+ "氘": 21506,
190
+ "氚": 21404,
191
+ "氼": 21244,
192
+ "沚": 21345,
193
+ "泂": 21260,
194
+ "淝": 21512,
195
+ "淥": 21263,
196
+ "淰": 21457,
197
+ "淸": 21448,
198
+ "湉": 21509,
199
+ "湼": 21347,
200
+ "溦": 21337,
201
+ "滘": 21160,
202
+ "漖": 21579,
203
+ "潁": 21291,
204
+ "潯": 21531,
205
+ "澂": 21445,
206
+ "澌": 21472,
207
+ "澠": 21592,
208
+ "濰": 21206,
209
+ "瀄": 21460,
210
+ "瀡": 21583,
211
+ "灕": 21314,
212
+ "炆": 21191,
213
+ "炩": 21569,
214
+ "烚": 21230,
215
+ "烴": 21154,
216
+ "焓": 21375,
217
+ "焫": 21264,
218
+ "煇": 21192,
219
+ "煠": 21344,
220
+ "煬": 21355,
221
+ "燶": 21326,
222
+ "牀": 21208,
223
+ "牘": 21539,
224
+ "犂": 21405,
225
+ "犛": 21518,
226
+ "猢": 21282,
227
+ "猻": 21243,
228
+ "獴": 21268,
229
+ "玗": 21485,
230
+ "珓": 21565,
231
+ "琤": 21566,
232
+ "琿": 21552,
233
+ "瑭": 21562,
234
+ "璘": 21440,
235
+ "璠": 21516,
236
+ "璣": 21322,
237
+ "瓘": 21407,
238
+ "瓚": 21535,
239
+ "甂": 21287,
240
+ "甑": 21558,
241
+ "甴": 21267,
242
+ "畧": 21381,
243
+ "疋": 21528,
244
+ "疎": 21620,
245
+ "痾": 21203,
246
+ "癆": 21316,
247
+ "癐": 21209,
248
+ "癩": 21474,
249
+ "睄": 21577,
250
+ "睚": 21540,
251
+ "睺": 21585,
252
+ "睼": 21395,
253
+ "砵": 21166,
254
+ "硃": 21429,
255
+ "硏": 21601,
256
+ "硤": 21170,
257
+ "碲": 21393,
258
+ "礐": 21613,
259
+ "礬": 21410,
260
+ "礮": 21281,
261
+ "禕": 21490,
262
+ "禤": 21408,
263
+ "稈": 21279,
264
+ "穏": 21420,
265
+ "窰": 21187,
266
+ "竈": 21470,
267
+ "竉": 21190,
268
+ "笊": 21621,
269
+ "笪": 21135,
270
+ "篋": 21508,
271
+ "篸": 21615,
272
+ "篾": 21323,
273
+ "簋": 21415,
274
+ "簒": 21241,
275
+ "簕": 21262,
276
+ "糭": 21207,
277
+ "糴": 21378,
278
+ "糶": 21549,
279
+ "紥": 21349,
280
+ "緡": 21276,
281
+ "縉": 21515,
282
+ "縞": 21339,
283
+ "繑": 21491,
284
+ "繙": 21183,
285
+ "繯": 21541,
286
+ "罅": 21164,
287
+ "罉": 21400,
288
+ "罘": 21483,
289
+ "罟": 21220,
290
+ "罨": 21383,
291
+ "羋": 21336,
292
+ "胐": 21289,
293
+ "胵": 21473,
294
+ "脧": 21311,
295
+ "脷": 21136,
296
+ "腍": 21210,
297
+ "膥": 21180,
298
+ "膶": 21223,
299
+ "舘": 21361,
300
+ "苴": 21463,
301
+ "茛": 21295,
302
+ "莨": 21447,
303
+ "菢": 21452,
304
+ "菫": 21500,
305
+ "菴": 21307,
306
+ "葶": 21150,
307
+ "蒴": 21519,
308
+ "蓀": 21611,
309
+ "蔴": 21168,
310
+ "蕓": 21312,
311
+ "薾": 21570,
312
+ "藪": 21301,
313
+ "藶": 21152,
314
+ "藺": 21402,
315
+ "蘄": 21478,
316
+ "蘅": 21563,
317
+ "蚺": 21392,
318
+ "蛉": 21543,
319
+ "蛺": 21233,
320
+ "蜑": 21425,
321
+ "蜞": 21507,
322
+ "蟧": 21494,
323
+ "蠄": 21435,
324
+ "蠏": 21465,
325
+ "裇": 21224,
326
+ "褦": 21211,
327
+ "褸": 21171,
328
+ "觚": 21387,
329
+ "觜": 21294,
330
+ "詏": 21228,
331
+ "諤": 21376,
332
+ "謚": 21172,
333
+ "謳": 21370,
334
+ "谿": 21399,
335
+ "豸": 21521,
336
+ "貍": 21412,
337
+ "贇": 21556,
338
+ "趯": 21297,
339
+ "趲": 21595,
340
+ "趷": 21232,
341
+ "跣": 21306,
342
+ "踎": 21259,
343
+ "踭": 21173,
344
+ "躄": 21137,
345
+ "躝": 21313,
346
+ "軚": 21250,
347
+ "軛": 21357,
348
+ "軫": 21231,
349
+ "輋": 21169,
350
+ "輦": 21602,
351
+ "轤": 21547,
352
+ "迆": 21338,
353
+ "逑": 21610,
354
+ "逳": 21449,
355
+ "郟": 21617,
356
+ "鄕": 21498,
357
+ "鄴": 21450,
358
+ "醂": 21391,
359
+ "釤": 21560,
360
+ "釩": 21588,
361
+ "釹": 21529,
362
+ "鈁": 21511,
363
+ "鈧": 21590,
364
+ "鈮": 21554,
365
+ "鈰": 21534,
366
+ "鈷": 21482,
367
+ "鈸": 21495,
368
+ "鈹": 21586,
369
+ "鈿": 21557,
370
+ "鉈": 21430,
371
+ "鉋": 21591,
372
+ "鉍": 21318,
373
+ "鉎": 21606,
374
+ "鉬": 21348,
375
+ "鉭": 21496,
376
+ "鉸": 21222,
377
+ "銣": 21419,
378
+ "銦": 21394,
379
+ "銨": 21303,
380
+ "銫": 21356,
381
+ "銲": 21489,
382
+ "銻": 21319,
383
+ "銼": 21245,
384
+ "鋇": 21398,
385
+ "鋨": 21548,
386
+ "鋯": 21487,
387
+ "鋹": 21437,
388
+ "錒": 21236,
389
+ "錕": 21572,
390
+ "錡": 21242,
391
+ "鍔": 21471,
392
+ "鍬": 21461,
393
+ "鍶": 21278,
394
+ "鍼": 21550,
395
+ "鎅": 21328,
396
+ "鎘": 21469,
397
+ "鎢": 21411,
398
+ "鎵": 21486,
399
+ "鏇": 21510,
400
+ "鏌": 21396,
401
+ "鏐": 21481,
402
+ "鏵": 21597,
403
+ "鐖": 21403,
404
+ "鐙": 21532,
405
+ "鑌": 21589,
406
+ "鑪": 21574,
407
+ "鑭": 21249,
408
+ "閂": 21163,
409
+ "閆": 21526,
410
+ "閪": 21284,
411
+ "閬": 21380,
412
+ "閭": 21480,
413
+ "闐": 21205,
414
+ "闓": 21466,
415
+ "闞": 21497,
416
+ "隗": 21608,
417
+ "鞮": 21600,
418
+ "韃": 21213,
419
+ "韙": 21454,
420
+ "韞": 21341,
421
+ "韮": 21501,
422
+ "頊": 21384,
423
+ "頴": 21234,
424
+ "顓": 21524,
425
+ "顥": 21308,
426
+ "顳": 21327,
427
+ "颮": 21575,
428
+ "餬": 21254,
429
+ "餸": 21139,
430
+ "馱": 21401,
431
+ "駟": 21456,
432
+ "駢": 21479,
433
+ "騤": 21468,
434
+ "騫": 21156,
435
+ "騮": 21158,
436
+ "騾": 21458,
437
+ "驃": 21335,
438
+ "驄": 21300,
439
+ "驤": 21538,
440
+ "骱": 21567,
441
+ "骹": 21256,
442
+ "髀": 21147,
443
+ "髁": 21614,
444
+ "髙": 21421,
445
+ "髧": 21374,
446
+ "髹": 21334,
447
+ "鬅": 21246,
448
+ "鬭": 21354,
449
+ "魨": 21238,
450
+ "魴": 21434,
451
+ "鮋": 21416,
452
+ "鮓": 21350,
453
+ "鮟": 21134,
454
+ "鮫": 21476,
455
+ "鯁": 21351,
456
+ "鯇": 21455,
457
+ "鯡": 21406,
458
+ "鯥": 21441,
459
+ "鯪": 21227,
460
+ "鯭": 21269,
461
+ "鯷": 21553,
462
+ "鰂": 21157,
463
+ "鰨": 21451,
464
+ "鰹": 21331,
465
+ "鱇": 21133,
466
+ "鱒": 21424,
467
+ "鱟": 21502,
468
+ "鱲": 21199,
469
+ "鳧": 21568,
470
+ "鴒": 21431,
471
+ "鴞": 21215,
472
+ "鴟": 21453,
473
+ "鴣": 21414,
474
+ "鴴": 21214,
475
+ "鵐": 21253,
476
+ "鵞": 21362,
477
+ "鵪": 21505,
478
+ "鵯": 21564,
479
+ "鶇": 21369,
480
+ "鶉": 21373,
481
+ "鶲": 21459,
482
+ "鶺": 21438,
483
+ "鶿": 21299,
484
+ "鷂": 21390,
485
+ "鷄": 21162,
486
+ "鷈": 21293,
487
+ "鷓": 21433,
488
+ "鷸": 21329,
489
+ "鷿": 21292,
490
+ "鸌": 21609,
491
+ "鸏": 21598,
492
+ "鸕": 21298,
493
+ "鸛": 21274,
494
+ "麪": 21131,
495
+ "黐": 21142,
496
+ "鼆": 21272,
497
+ "鼇": 21443,
498
+ "鼩": 21626,
499
+ "龑": 21372,
500
+ "龠": 21462,
501
+ "龢": 21320
502
+ }
config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "indiejoseph/bert-base-cantonese",
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "classifier_dropout": null,
8
+ "directionality": "bidi",
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-12,
15
+ "max_position_embeddings": 512,
16
+ "model_type": "bert",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 0,
20
+ "pooler_fc_size": 768,
21
+ "pooler_num_attention_heads": 12,
22
+ "pooler_num_fc_layers": 3,
23
+ "pooler_size_per_head": 128,
24
+ "pooler_type": "first_token_transform",
25
+ "position_embedding_type": "absolute",
26
+ "torch_dtype": "float32",
27
+ "transformers_version": "4.34.0.dev0",
28
+ "type_vocab_size": 2,
29
+ "use_cache": true,
30
+ "vocab_size": 21628
31
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "2.2.2",
4
+ "transformers": "4.34.0.dev0",
5
+ "pytorch": "2.0.1+cu117"
6
+ }
7
+ }
eval/mse_evaluation_sbert_bilingual_test_results.csv ADDED
@@ -0,0 +1,87 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,MSE
2
+ 0,1000,14.161652326583862
3
+ 0,-1,13.561658561229706
4
+ 1,1000,10.426171869039536
5
+ 1,-1,10.175418853759766
6
+ 2,1000,8.463533967733383
7
+ 2,-1,8.304891735315323
8
+ 3,1000,7.1127355098724365
9
+ 3,-1,7.011887431144714
10
+ 4,1000,6.137048825621605
11
+ 4,-1,6.04826807975769
12
+ 0,1000,5.897531285881996
13
+ 0,-1,5.874970182776451
14
+ 1,1000,5.675851926207542
15
+ 1,-1,5.644375830888748
16
+ 2,1000,5.3504932671785355
17
+ 2,-1,5.322911962866783
18
+ 3,1000,5.003084614872932
19
+ 3,-1,4.979576542973518
20
+ 4,1000,4.652346670627594
21
+ 4,-1,4.632246121764183
22
+ 0,1000,15.276926755905151
23
+ 0,-1,14.773666858673096
24
+ 1,1000,12.426591664552689
25
+ 1,-1,12.224958091974258
26
+ 2,1000,10.940007120370865
27
+ 2,-1,10.81506609916687
28
+ 3,1000,9.898988157510757
29
+ 3,-1,9.807772189378738
30
+ 4,1000,9.155871719121933
31
+ 4,-1,9.088793396949768
32
+ 0,1000,14.144869148731232
33
+ 0,-1,13.555073738098145
34
+ 1,1000,10.409367829561234
35
+ 1,-1,10.17189621925354
36
+ 2,1000,8.456149697303772
37
+ 2,-1,8.295857161283493
38
+ 3,1000,7.113900780677795
39
+ 3,-1,6.9980815052986145
40
+ 4,1000,6.125354394316673
41
+ 4,-1,6.053166836500168
42
+ 0,1000,5.895005539059639
43
+ 0,-1,5.8802105486392975
44
+ 1,1000,5.673418566584587
45
+ 1,-1,5.642145499587059
46
+ 0,1000,5.5615052580833435
47
+ 0,-1,5.546179041266441
48
+ 1,1000,5.383569374680519
49
+ 1,-1,5.351389572024345
50
+ 2,1000,5.117785930633545
51
+ 2,-1,5.089312791824341
52
+ 0,1000,5.007679387927055
53
+ 0,-1,4.9951184540987015
54
+ 1,1000,4.883035644888878
55
+ 1,-1,4.859518259763718
56
+ 0,1000,17.446313798427582
57
+ 0,2000,13.509276509284973
58
+ 0,1000,16.489487886428833
59
+ 0,2000,13.311152160167694
60
+ 0,3000,11.23809814453125
61
+ 0,4000,9.722928702831268
62
+ 0,5000,8.600396662950516
63
+ 0,-1,7.951334118843079
64
+ 1,1000,7.195384055376053
65
+ 1,2000,6.626403331756592
66
+ 1,3000,6.137128546833992
67
+ 1,4000,5.733538791537285
68
+ 1,5000,5.38359172642231
69
+ 1,-1,5.173630639910698
70
+ 2,1000,4.955727607011795
71
+ 2,2000,4.767784848809242
72
+ 2,3000,4.621163010597229
73
+ 2,4000,4.469508305191994
74
+ 2,5000,4.379042237997055
75
+ 2,-1,4.302250221371651
76
+ 3,1000,4.202662035822868
77
+ 3,2000,4.131331667304039
78
+ 3,3000,4.070168361067772
79
+ 3,4000,4.010483622550964
80
+ 3,5000,3.9553701877593994
81
+ 3,-1,3.925058990716934
82
+ 4,1000,3.8913611322641373
83
+ 4,2000,3.8557831197977066
84
+ 4,3000,3.833599016070366
85
+ 4,4000,3.810567408800125
86
+ 4,5000,3.7947382777929306
87
+ 4,-1,3.791682794690132
eval/translation_evaluation_sbert_bilingual_test_results.csv ADDED
@@ -0,0 +1,87 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,src2trg,trg2src
2
+ 0,1000,0.983691431529899,0.9958581413409268
3
+ 0,-1,0.9842091638622832,0.9958581413409268
4
+ 1,1000,0.9958581413409268,0.996634739839503
5
+ 1,-1,0.9961170075071188,0.996634739839503
6
+ 2,1000,0.9961170075071188,0.9968936060056951
7
+ 2,-1,0.9963758736733109,0.9968936060056951
8
+ 3,1000,0.9963758736733109,0.9968936060056951
9
+ 3,-1,0.996634739839503,0.9968936060056951
10
+ 4,1000,0.9963758736733109,0.9968936060056951
11
+ 4,-1,0.9961170075071188,0.9968936060056951
12
+ 0,1000,0.9961170075071188,0.9968936060056951
13
+ 0,-1,0.9963758736733109,0.9968936060056951
14
+ 1,1000,0.996634739839503,0.9968936060056951
15
+ 1,-1,0.9963758736733109,0.9968936060056951
16
+ 2,1000,0.9968936060056951,0.9968936060056951
17
+ 2,-1,0.996634739839503,0.9968936060056951
18
+ 3,1000,0.996634739839503,0.996634739839503
19
+ 3,-1,0.9968936060056951,0.996634739839503
20
+ 4,1000,0.9968936060056951,0.996634739839503
21
+ 4,-1,0.996634739839503,0.9968936060056951
22
+ 0,1000,0.9862800931918199,0.9930106135128138
23
+ 0,-1,0.9886098886875485,0.9950815428423505
24
+ 1,1000,0.996634739839503,0.9968936060056951
25
+ 1,-1,0.996634739839503,0.9968936060056951
26
+ 2,1000,0.996634739839503,0.9968936060056951
27
+ 2,-1,0.996634739839503,0.9968936060056951
28
+ 3,1000,0.9968936060056951,0.9968936060056951
29
+ 3,-1,0.996634739839503,0.9968936060056951
30
+ 4,1000,0.9968936060056951,0.9968936060056951
31
+ 4,-1,0.9968936060056951,0.996634739839503
32
+ 0,1000,0.9839502976960911,0.9958581413409268
33
+ 0,-1,0.9844680300284753,0.9955992751747347
34
+ 1,1000,0.9961170075071188,0.996634739839503
35
+ 1,-1,0.9961170075071188,0.996634739839503
36
+ 2,1000,0.996634739839503,0.9968936060056951
37
+ 2,-1,0.9963758736733109,0.9968936060056951
38
+ 3,1000,0.9963758736733109,0.9968936060056951
39
+ 3,-1,0.9963758736733109,0.9968936060056951
40
+ 4,1000,0.996634739839503,0.9968936060056951
41
+ 4,-1,0.9968936060056951,0.9968936060056951
42
+ 0,1000,0.9963758736733109,0.9968936060056951
43
+ 0,-1,0.9968936060056951,0.9968936060056951
44
+ 1,1000,0.9968936060056951,0.9968936060056951
45
+ 1,-1,0.9968936060056951,0.9968936060056951
46
+ 0,1000,0.996634739839503,0.9968936060056951
47
+ 0,-1,0.9968936060056951,0.996634739839503
48
+ 1,1000,0.996634739839503,0.9968936060056951
49
+ 1,-1,0.9968936060056951,0.996634739839503
50
+ 2,1000,0.996634739839503,0.9968936060056951
51
+ 2,-1,0.9971524721718872,0.996634739839503
52
+ 0,1000,0.996634739839503,0.996634739839503
53
+ 0,-1,0.9968936060056951,0.996634739839503
54
+ 1,1000,0.9968936060056951,0.996634739839503
55
+ 1,-1,0.9971524721718872,0.996634739839503
56
+ 0,1000,0.9855899277037329,0.9868686371907737
57
+ 0,2000,0.989524418433089,0.9905080411154281
58
+ 0,1000,0.9882104435820602,0.9903227391069411
59
+ 0,2000,0.9908139706243553,0.9920911725696321
60
+ 0,3000,0.9928280198457533,0.9931227587562018
61
+ 0,4000,0.9931227587562018,0.9933683745149089
62
+ 0,5000,0.9935157439701331,0.9935648671218745
63
+ 0,-1,0.9934174976666503,0.9940560986392887
64
+ 1,1000,0.9939087291840645,0.9944982070049614
65
+ 1,2000,0.9943017143979958,0.9945473301567028
66
+ 1,3000,0.9943999607014786,0.9945473301567028
67
+ 1,4000,0.9946455764601857,0.9949403153706342
68
+ 1,5000,0.9948420690671513,0.9951859311293413
69
+ 1,-1,0.9948911922188928,0.9952350542810827
70
+ 2,1000,0.9949894385223756,0.995038561674117
71
+ 2,2000,0.9952350542810827,0.9951368079775998
72
+ 2,3000,0.9951368079775998,0.9951859311293413
73
+ 2,4000,0.9951859311293413,0.9949894385223756
74
+ 2,5000,0.9954806700397898,0.9951368079775998
75
+ 2,-1,0.9952350542810827,0.995038561674117
76
+ 3,1000,0.9953333005845655,0.9950876848258584
77
+ 3,2000,0.9954315468880484,0.995284177432824
78
+ 3,3000,0.9956771626467554,0.9951859311293413
79
+ 3,4000,0.9953824237363069,0.9950876848258584
80
+ 3,5000,0.9954806700397898,0.9952350542810827
81
+ 3,-1,0.9954806700397898,0.995284177432824
82
+ 4,1000,0.9956771626467554,0.9951859311293413
83
+ 4,2000,0.9954315468880484,0.9953333005845655
84
+ 4,3000,0.9953824237363069,0.995038561674117
85
+ 4,4000,0.9954315468880484,0.995284177432824
86
+ 4,5000,0.9954315468880484,0.9952350542810827
87
+ 4,-1,0.9953333005845655,0.9952350542810827
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea4cfb34748c3f38517e2167f631bc54f83d0b4d964e9677c375a7af17971a57
3
+ size 410673321
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": "[CLS]",
3
+ "mask_token": "[MASK]",
4
+ "pad_token": "[PAD]",
5
+ "sep_token": "[SEP]",
6
+ "unk_token": "[UNK]"
7
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,4063 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "21128": {
44
+ "content": "攷",
45
+ "lstrip": false,
46
+ "normalized": true,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": false
50
+ },
51
+ "21129": {
52
+ "content": "噉",
53
+ "lstrip": false,
54
+ "normalized": true,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": false
58
+ },
59
+ "21130": {
60
+ "content": "嗌",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false,
65
+ "special": false
66
+ },
67
+ "21131": {
68
+ "content": "麪",
69
+ "lstrip": false,
70
+ "normalized": true,
71
+ "rstrip": false,
72
+ "single_word": false,
73
+ "special": false
74
+ },
75
+ "21132": {
76
+ "content": "嚿",
77
+ "lstrip": false,
78
+ "normalized": true,
79
+ "rstrip": false,
80
+ "single_word": false,
81
+ "special": false
82
+ },
83
+ "21133": {
84
+ "content": "鱇",
85
+ "lstrip": false,
86
+ "normalized": true,
87
+ "rstrip": false,
88
+ "single_word": false,
89
+ "special": false
90
+ },
91
+ "21134": {
92
+ "content": "鮟",
93
+ "lstrip": false,
94
+ "normalized": true,
95
+ "rstrip": false,
96
+ "single_word": false,
97
+ "special": false
98
+ },
99
+ "21135": {
100
+ "content": "笪",
101
+ "lstrip": false,
102
+ "normalized": true,
103
+ "rstrip": false,
104
+ "single_word": false,
105
+ "special": false
106
+ },
107
+ "21136": {
108
+ "content": "脷",
109
+ "lstrip": false,
110
+ "normalized": true,
111
+ "rstrip": false,
112
+ "single_word": false,
113
+ "special": false
114
+ },
115
+ "21137": {
116
+ "content": "躄",
117
+ "lstrip": false,
118
+ "normalized": true,
119
+ "rstrip": false,
120
+ "single_word": false,
121
+ "special": false
122
+ },
123
+ "21138": {
124
+ "content": "冧",
125
+ "lstrip": false,
126
+ "normalized": true,
127
+ "rstrip": false,
128
+ "single_word": false,
129
+ "special": false
130
+ },
131
+ "21139": {
132
+ "content": "餸",
133
+ "lstrip": false,
134
+ "normalized": true,
135
+ "rstrip": false,
136
+ "single_word": false,
137
+ "special": false
138
+ },
139
+ "21140": {
140
+ "content": "冚",
141
+ "lstrip": false,
142
+ "normalized": true,
143
+ "rstrip": false,
144
+ "single_word": false,
145
+ "special": false
146
+ },
147
+ "21141": {
148
+ "content": "撾",
149
+ "lstrip": false,
150
+ "normalized": true,
151
+ "rstrip": false,
152
+ "single_word": false,
153
+ "special": false
154
+ },
155
+ "21142": {
156
+ "content": "黐",
157
+ "lstrip": false,
158
+ "normalized": true,
159
+ "rstrip": false,
160
+ "single_word": false,
161
+ "special": false
162
+ },
163
+ "21143": {
164
+ "content": "樖",
165
+ "lstrip": false,
166
+ "normalized": true,
167
+ "rstrip": false,
168
+ "single_word": false,
169
+ "special": false
170
+ },
171
+ "21144": {
172
+ "content": "喐",
173
+ "lstrip": false,
174
+ "normalized": true,
175
+ "rstrip": false,
176
+ "single_word": false,
177
+ "special": false
178
+ },
179
+ "21145": {
180
+ "content": "噃",
181
+ "lstrip": false,
182
+ "normalized": true,
183
+ "rstrip": false,
184
+ "single_word": false,
185
+ "special": false
186
+ },
187
+ "21146": {
188
+ "content": "乸",
189
+ "lstrip": false,
190
+ "normalized": true,
191
+ "rstrip": false,
192
+ "single_word": false,
193
+ "special": false
194
+ },
195
+ "21147": {
196
+ "content": "髀",
197
+ "lstrip": false,
198
+ "normalized": true,
199
+ "rstrip": false,
200
+ "single_word": false,
201
+ "special": false
202
+ },
203
+ "21148": {
204
+ "content": "徂",
205
+ "lstrip": false,
206
+ "normalized": true,
207
+ "rstrip": false,
208
+ "single_word": false,
209
+ "special": false
210
+ },
211
+ "21149": {
212
+ "content": "嘥",
213
+ "lstrip": false,
214
+ "normalized": true,
215
+ "rstrip": false,
216
+ "single_word": false,
217
+ "special": false
218
+ },
219
+ "21150": {
220
+ "content": "葶",
221
+ "lstrip": false,
222
+ "normalized": true,
223
+ "rstrip": false,
224
+ "single_word": false,
225
+ "special": false
226
+ },
227
+ "21151": {
228
+ "content": "唥",
229
+ "lstrip": false,
230
+ "normalized": true,
231
+ "rstrip": false,
232
+ "single_word": false,
233
+ "special": false
234
+ },
235
+ "21152": {
236
+ "content": "藶",
237
+ "lstrip": false,
238
+ "normalized": true,
239
+ "rstrip": false,
240
+ "single_word": false,
241
+ "special": false
242
+ },
243
+ "21153": {
244
+ "content": "嬲",
245
+ "lstrip": false,
246
+ "normalized": true,
247
+ "rstrip": false,
248
+ "single_word": false,
249
+ "special": false
250
+ },
251
+ "21154": {
252
+ "content": "烴",
253
+ "lstrip": false,
254
+ "normalized": true,
255
+ "rstrip": false,
256
+ "single_word": false,
257
+ "special": false
258
+ },
259
+ "21155": {
260
+ "content": "尐",
261
+ "lstrip": false,
262
+ "normalized": true,
263
+ "rstrip": false,
264
+ "single_word": false,
265
+ "special": false
266
+ },
267
+ "21156": {
268
+ "content": "騫",
269
+ "lstrip": false,
270
+ "normalized": true,
271
+ "rstrip": false,
272
+ "single_word": false,
273
+ "special": false
274
+ },
275
+ "21157": {
276
+ "content": "鰂",
277
+ "lstrip": false,
278
+ "normalized": true,
279
+ "rstrip": false,
280
+ "single_word": false,
281
+ "special": false
282
+ },
283
+ "21158": {
284
+ "content": "騮",
285
+ "lstrip": false,
286
+ "normalized": true,
287
+ "rstrip": false,
288
+ "single_word": false,
289
+ "special": false
290
+ },
291
+ "21159": {
292
+ "content": "唪",
293
+ "lstrip": false,
294
+ "normalized": true,
295
+ "rstrip": false,
296
+ "single_word": false,
297
+ "special": false
298
+ },
299
+ "21160": {
300
+ "content": "滘",
301
+ "lstrip": false,
302
+ "normalized": true,
303
+ "rstrip": false,
304
+ "single_word": false,
305
+ "special": false
306
+ },
307
+ "21161": {
308
+ "content": "掟",
309
+ "lstrip": false,
310
+ "normalized": true,
311
+ "rstrip": false,
312
+ "single_word": false,
313
+ "special": false
314
+ },
315
+ "21162": {
316
+ "content": "鷄",
317
+ "lstrip": false,
318
+ "normalized": true,
319
+ "rstrip": false,
320
+ "single_word": false,
321
+ "special": false
322
+ },
323
+ "21163": {
324
+ "content": "閂",
325
+ "lstrip": false,
326
+ "normalized": true,
327
+ "rstrip": false,
328
+ "single_word": false,
329
+ "special": false
330
+ },
331
+ "21164": {
332
+ "content": "罅",
333
+ "lstrip": false,
334
+ "normalized": true,
335
+ "rstrip": false,
336
+ "single_word": false,
337
+ "special": false
338
+ },
339
+ "21165": {
340
+ "content": "揾",
341
+ "lstrip": false,
342
+ "normalized": true,
343
+ "rstrip": false,
344
+ "single_word": false,
345
+ "special": false
346
+ },
347
+ "21166": {
348
+ "content": "砵",
349
+ "lstrip": false,
350
+ "normalized": true,
351
+ "rstrip": false,
352
+ "single_word": false,
353
+ "special": false
354
+ },
355
+ "21167": {
356
+ "content": "橛",
357
+ "lstrip": false,
358
+ "normalized": true,
359
+ "rstrip": false,
360
+ "single_word": false,
361
+ "special": false
362
+ },
363
+ "21168": {
364
+ "content": "蔴",
365
+ "lstrip": false,
366
+ "normalized": true,
367
+ "rstrip": false,
368
+ "single_word": false,
369
+ "special": false
370
+ },
371
+ "21169": {
372
+ "content": "輋",
373
+ "lstrip": false,
374
+ "normalized": true,
375
+ "rstrip": false,
376
+ "single_word": false,
377
+ "special": false
378
+ },
379
+ "21170": {
380
+ "content": "硤",
381
+ "lstrip": false,
382
+ "normalized": true,
383
+ "rstrip": false,
384
+ "single_word": false,
385
+ "special": false
386
+ },
387
+ "21171": {
388
+ "content": "褸",
389
+ "lstrip": false,
390
+ "normalized": true,
391
+ "rstrip": false,
392
+ "single_word": false,
393
+ "special": false
394
+ },
395
+ "21172": {
396
+ "content": "謚",
397
+ "lstrip": false,
398
+ "normalized": true,
399
+ "rstrip": false,
400
+ "single_word": false,
401
+ "special": false
402
+ },
403
+ "21173": {
404
+ "content": "踭",
405
+ "lstrip": false,
406
+ "normalized": true,
407
+ "rstrip": false,
408
+ "single_word": false,
409
+ "special": false
410
+ },
411
+ "21174": {
412
+ "content": "噏",
413
+ "lstrip": false,
414
+ "normalized": true,
415
+ "rstrip": false,
416
+ "single_word": false,
417
+ "special": false
418
+ },
419
+ "21175": {
420
+ "content": "卽",
421
+ "lstrip": false,
422
+ "normalized": true,
423
+ "rstrip": false,
424
+ "single_word": false,
425
+ "special": false
426
+ },
427
+ "21176": {
428
+ "content": "撳",
429
+ "lstrip": false,
430
+ "normalized": true,
431
+ "rstrip": false,
432
+ "single_word": false,
433
+ "special": false
434
+ },
435
+ "21177": {
436
+ "content": "屙",
437
+ "lstrip": false,
438
+ "normalized": true,
439
+ "rstrip": false,
440
+ "single_word": false,
441
+ "special": false
442
+ },
443
+ "21178": {
444
+ "content": "孭",
445
+ "lstrip": false,
446
+ "normalized": true,
447
+ "rstrip": false,
448
+ "single_word": false,
449
+ "special": false
450
+ },
451
+ "21179": {
452
+ "content": "呔",
453
+ "lstrip": false,
454
+ "normalized": true,
455
+ "rstrip": false,
456
+ "single_word": false,
457
+ "special": false
458
+ },
459
+ "21180": {
460
+ "content": "膥",
461
+ "lstrip": false,
462
+ "normalized": true,
463
+ "rstrip": false,
464
+ "single_word": false,
465
+ "special": false
466
+ },
467
+ "21181": {
468
+ "content": "唞",
469
+ "lstrip": false,
470
+ "normalized": true,
471
+ "rstrip": false,
472
+ "single_word": false,
473
+ "special": false
474
+ },
475
+ "21182": {
476
+ "content": "拃",
477
+ "lstrip": false,
478
+ "normalized": true,
479
+ "rstrip": false,
480
+ "single_word": false,
481
+ "special": false
482
+ },
483
+ "21183": {
484
+ "content": "繙",
485
+ "lstrip": false,
486
+ "normalized": true,
487
+ "rstrip": false,
488
+ "single_word": false,
489
+ "special": false
490
+ },
491
+ "21184": {
492
+ "content": "揼",
493
+ "lstrip": false,
494
+ "normalized": true,
495
+ "rstrip": false,
496
+ "single_word": false,
497
+ "special": false
498
+ },
499
+ "21185": {
500
+ "content": "栢",
501
+ "lstrip": false,
502
+ "normalized": true,
503
+ "rstrip": false,
504
+ "single_word": false,
505
+ "special": false
506
+ },
507
+ "21186": {
508
+ "content": "櫈",
509
+ "lstrip": false,
510
+ "normalized": true,
511
+ "rstrip": false,
512
+ "single_word": false,
513
+ "special": false
514
+ },
515
+ "21187": {
516
+ "content": "窰",
517
+ "lstrip": false,
518
+ "normalized": true,
519
+ "rstrip": false,
520
+ "single_word": false,
521
+ "special": false
522
+ },
523
+ "21188": {
524
+ "content": "戥",
525
+ "lstrip": false,
526
+ "normalized": true,
527
+ "rstrip": false,
528
+ "single_word": false,
529
+ "special": false
530
+ },
531
+ "21189": {
532
+ "content": "嗱",
533
+ "lstrip": false,
534
+ "normalized": true,
535
+ "rstrip": false,
536
+ "single_word": false,
537
+ "special": false
538
+ },
539
+ "21190": {
540
+ "content": "竉",
541
+ "lstrip": false,
542
+ "normalized": true,
543
+ "rstrip": false,
544
+ "single_word": false,
545
+ "special": false
546
+ },
547
+ "21191": {
548
+ "content": "炆",
549
+ "lstrip": false,
550
+ "normalized": true,
551
+ "rstrip": false,
552
+ "single_word": false,
553
+ "special": false
554
+ },
555
+ "21192": {
556
+ "content": "煇",
557
+ "lstrip": false,
558
+ "normalized": true,
559
+ "rstrip": false,
560
+ "single_word": false,
561
+ "special": false
562
+ },
563
+ "21193": {
564
+ "content": "屘",
565
+ "lstrip": false,
566
+ "normalized": true,
567
+ "rstrip": false,
568
+ "single_word": false,
569
+ "special": false
570
+ },
571
+ "21194": {
572
+ "content": "殮",
573
+ "lstrip": false,
574
+ "normalized": true,
575
+ "rstrip": false,
576
+ "single_word": false,
577
+ "special": false
578
+ },
579
+ "21195": {
580
+ "content": "塱",
581
+ "lstrip": false,
582
+ "normalized": true,
583
+ "rstrip": false,
584
+ "single_word": false,
585
+ "special": false
586
+ },
587
+ "21196": {
588
+ "content": "埞",
589
+ "lstrip": false,
590
+ "normalized": true,
591
+ "rstrip": false,
592
+ "single_word": false,
593
+ "special": false
594
+ },
595
+ "21197": {
596
+ "content": "㷫",
597
+ "lstrip": false,
598
+ "normalized": true,
599
+ "rstrip": false,
600
+ "single_word": false,
601
+ "special": false
602
+ },
603
+ "21198": {
604
+ "content": "啩",
605
+ "lstrip": false,
606
+ "normalized": true,
607
+ "rstrip": false,
608
+ "single_word": false,
609
+ "special": false
610
+ },
611
+ "21199": {
612
+ "content": "鱲",
613
+ "lstrip": false,
614
+ "normalized": true,
615
+ "rstrip": false,
616
+ "single_word": false,
617
+ "special": false
618
+ },
619
+ "21200": {
620
+ "content": "梘",
621
+ "lstrip": false,
622
+ "normalized": true,
623
+ "rstrip": false,
624
+ "single_word": false,
625
+ "special": false
626
+ },
627
+ "21201": {
628
+ "content": "戇",
629
+ "lstrip": false,
630
+ "normalized": true,
631
+ "rstrip": false,
632
+ "single_word": false,
633
+ "special": false
634
+ },
635
+ "21202": {
636
+ "content": "掕",
637
+ "lstrip": false,
638
+ "normalized": true,
639
+ "rstrip": false,
640
+ "single_word": false,
641
+ "special": false
642
+ },
643
+ "21203": {
644
+ "content": "痾",
645
+ "lstrip": false,
646
+ "normalized": true,
647
+ "rstrip": false,
648
+ "single_word": false,
649
+ "special": false
650
+ },
651
+ "21204": {
652
+ "content": "孲",
653
+ "lstrip": false,
654
+ "normalized": true,
655
+ "rstrip": false,
656
+ "single_word": false,
657
+ "special": false
658
+ },
659
+ "21205": {
660
+ "content": "闐",
661
+ "lstrip": false,
662
+ "normalized": true,
663
+ "rstrip": false,
664
+ "single_word": false,
665
+ "special": false
666
+ },
667
+ "21206": {
668
+ "content": "濰",
669
+ "lstrip": false,
670
+ "normalized": true,
671
+ "rstrip": false,
672
+ "single_word": false,
673
+ "special": false
674
+ },
675
+ "21207": {
676
+ "content": "糭",
677
+ "lstrip": false,
678
+ "normalized": true,
679
+ "rstrip": false,
680
+ "single_word": false,
681
+ "special": false
682
+ },
683
+ "21208": {
684
+ "content": "牀",
685
+ "lstrip": false,
686
+ "normalized": true,
687
+ "rstrip": false,
688
+ "single_word": false,
689
+ "special": false
690
+ },
691
+ "21209": {
692
+ "content": "癐",
693
+ "lstrip": false,
694
+ "normalized": true,
695
+ "rstrip": false,
696
+ "single_word": false,
697
+ "special": false
698
+ },
699
+ "21210": {
700
+ "content": "腍",
701
+ "lstrip": false,
702
+ "normalized": true,
703
+ "rstrip": false,
704
+ "single_word": false,
705
+ "special": false
706
+ },
707
+ "21211": {
708
+ "content": "褦",
709
+ "lstrip": false,
710
+ "normalized": true,
711
+ "rstrip": false,
712
+ "single_word": false,
713
+ "special": false
714
+ },
715
+ "21212": {
716
+ "content": "揗",
717
+ "lstrip": false,
718
+ "normalized": true,
719
+ "rstrip": false,
720
+ "single_word": false,
721
+ "special": false
722
+ },
723
+ "21213": {
724
+ "content": "韃",
725
+ "lstrip": false,
726
+ "normalized": true,
727
+ "rstrip": false,
728
+ "single_word": false,
729
+ "special": false
730
+ },
731
+ "21214": {
732
+ "content": "鴴",
733
+ "lstrip": false,
734
+ "normalized": true,
735
+ "rstrip": false,
736
+ "single_word": false,
737
+ "special": false
738
+ },
739
+ "21215": {
740
+ "content": "鴞",
741
+ "lstrip": false,
742
+ "normalized": true,
743
+ "rstrip": false,
744
+ "single_word": false,
745
+ "special": false
746
+ },
747
+ "21216": {
748
+ "content": "唻",
749
+ "lstrip": false,
750
+ "normalized": true,
751
+ "rstrip": false,
752
+ "single_word": false,
753
+ "special": false
754
+ },
755
+ "21217": {
756
+ "content": "孻",
757
+ "lstrip": false,
758
+ "normalized": true,
759
+ "rstrip": false,
760
+ "single_word": false,
761
+ "special": false
762
+ },
763
+ "21218": {
764
+ "content": "擸",
765
+ "lstrip": false,
766
+ "normalized": true,
767
+ "rstrip": false,
768
+ "single_word": false,
769
+ "special": false
770
+ },
771
+ "21219": {
772
+ "content": "搣",
773
+ "lstrip": false,
774
+ "normalized": true,
775
+ "rstrip": false,
776
+ "single_word": false,
777
+ "special": false
778
+ },
779
+ "21220": {
780
+ "content": "罟",
781
+ "lstrip": false,
782
+ "normalized": true,
783
+ "rstrip": false,
784
+ "single_word": false,
785
+ "special": false
786
+ },
787
+ "21221": {
788
+ "content": "埲",
789
+ "lstrip": false,
790
+ "normalized": true,
791
+ "rstrip": false,
792
+ "single_word": false,
793
+ "special": false
794
+ },
795
+ "21222": {
796
+ "content": "鉸",
797
+ "lstrip": false,
798
+ "normalized": true,
799
+ "rstrip": false,
800
+ "single_word": false,
801
+ "special": false
802
+ },
803
+ "21223": {
804
+ "content": "膶",
805
+ "lstrip": false,
806
+ "normalized": true,
807
+ "rstrip": false,
808
+ "single_word": false,
809
+ "special": false
810
+ },
811
+ "21224": {
812
+ "content": "裇",
813
+ "lstrip": false,
814
+ "normalized": true,
815
+ "rstrip": false,
816
+ "single_word": false,
817
+ "special": false
818
+ },
819
+ "21225": {
820
+ "content": "堊",
821
+ "lstrip": false,
822
+ "normalized": true,
823
+ "rstrip": false,
824
+ "single_word": false,
825
+ "special": false
826
+ },
827
+ "21226": {
828
+ "content": "攰",
829
+ "lstrip": false,
830
+ "normalized": true,
831
+ "rstrip": false,
832
+ "single_word": false,
833
+ "special": false
834
+ },
835
+ "21227": {
836
+ "content": "鯪",
837
+ "lstrip": false,
838
+ "normalized": true,
839
+ "rstrip": false,
840
+ "single_word": false,
841
+ "special": false
842
+ },
843
+ "21228": {
844
+ "content": "詏",
845
+ "lstrip": false,
846
+ "normalized": true,
847
+ "rstrip": false,
848
+ "single_word": false,
849
+ "special": false
850
+ },
851
+ "21229": {
852
+ "content": "揦",
853
+ "lstrip": false,
854
+ "normalized": true,
855
+ "rstrip": false,
856
+ "single_word": false,
857
+ "special": false
858
+ },
859
+ "21230": {
860
+ "content": "烚",
861
+ "lstrip": false,
862
+ "normalized": true,
863
+ "rstrip": false,
864
+ "single_word": false,
865
+ "special": false
866
+ },
867
+ "21231": {
868
+ "content": "軫",
869
+ "lstrip": false,
870
+ "normalized": true,
871
+ "rstrip": false,
872
+ "single_word": false,
873
+ "special": false
874
+ },
875
+ "21232": {
876
+ "content": "趷",
877
+ "lstrip": false,
878
+ "normalized": true,
879
+ "rstrip": false,
880
+ "single_word": false,
881
+ "special": false
882
+ },
883
+ "21233": {
884
+ "content": "蛺",
885
+ "lstrip": false,
886
+ "normalized": true,
887
+ "rstrip": false,
888
+ "single_word": false,
889
+ "special": false
890
+ },
891
+ "21234": {
892
+ "content": "頴",
893
+ "lstrip": false,
894
+ "normalized": true,
895
+ "rstrip": false,
896
+ "single_word": false,
897
+ "special": false
898
+ },
899
+ "21235": {
900
+ "content": "唂",
901
+ "lstrip": false,
902
+ "normalized": true,
903
+ "rstrip": false,
904
+ "single_word": false,
905
+ "special": false
906
+ },
907
+ "21236": {
908
+ "content": "錒",
909
+ "lstrip": false,
910
+ "normalized": true,
911
+ "rstrip": false,
912
+ "single_word": false,
913
+ "special": false
914
+ },
915
+ "21237": {
916
+ "content": "幗",
917
+ "lstrip": false,
918
+ "normalized": true,
919
+ "rstrip": false,
920
+ "single_word": false,
921
+ "special": false
922
+ },
923
+ "21238": {
924
+ "content": "魨",
925
+ "lstrip": false,
926
+ "normalized": true,
927
+ "rstrip": false,
928
+ "single_word": false,
929
+ "special": false
930
+ },
931
+ "21239": {
932
+ "content": "昺",
933
+ "lstrip": false,
934
+ "normalized": true,
935
+ "rstrip": false,
936
+ "single_word": false,
937
+ "special": false
938
+ },
939
+ "21240": {
940
+ "content": "慤",
941
+ "lstrip": false,
942
+ "normalized": true,
943
+ "rstrip": false,
944
+ "single_word": false,
945
+ "special": false
946
+ },
947
+ "21241": {
948
+ "content": "簒",
949
+ "lstrip": false,
950
+ "normalized": true,
951
+ "rstrip": false,
952
+ "single_word": false,
953
+ "special": false
954
+ },
955
+ "21242": {
956
+ "content": "錡",
957
+ "lstrip": false,
958
+ "normalized": true,
959
+ "rstrip": false,
960
+ "single_word": false,
961
+ "special": false
962
+ },
963
+ "21243": {
964
+ "content": "猻",
965
+ "lstrip": false,
966
+ "normalized": true,
967
+ "rstrip": false,
968
+ "single_word": false,
969
+ "special": false
970
+ },
971
+ "21244": {
972
+ "content": "氼",
973
+ "lstrip": false,
974
+ "normalized": true,
975
+ "rstrip": false,
976
+ "single_word": false,
977
+ "special": false
978
+ },
979
+ "21245": {
980
+ "content": "銼",
981
+ "lstrip": false,
982
+ "normalized": true,
983
+ "rstrip": false,
984
+ "single_word": false,
985
+ "special": false
986
+ },
987
+ "21246": {
988
+ "content": "鬅",
989
+ "lstrip": false,
990
+ "normalized": true,
991
+ "rstrip": false,
992
+ "single_word": false,
993
+ "special": false
994
+ },
995
+ "21247": {
996
+ "content": "曱",
997
+ "lstrip": false,
998
+ "normalized": true,
999
+ "rstrip": false,
1000
+ "single_word": false,
1001
+ "special": false
1002
+ },
1003
+ "21248": {
1004
+ "content": "旚",
1005
+ "lstrip": false,
1006
+ "normalized": true,
1007
+ "rstrip": false,
1008
+ "single_word": false,
1009
+ "special": false
1010
+ },
1011
+ "21249": {
1012
+ "content": "鑭",
1013
+ "lstrip": false,
1014
+ "normalized": true,
1015
+ "rstrip": false,
1016
+ "single_word": false,
1017
+ "special": false
1018
+ },
1019
+ "21250": {
1020
+ "content": "軚",
1021
+ "lstrip": false,
1022
+ "normalized": true,
1023
+ "rstrip": false,
1024
+ "single_word": false,
1025
+ "special": false
1026
+ },
1027
+ "21251": {
1028
+ "content": "揈",
1029
+ "lstrip": false,
1030
+ "normalized": true,
1031
+ "rstrip": false,
1032
+ "single_word": false,
1033
+ "special": false
1034
+ },
1035
+ "21252": {
1036
+ "content": "偲",
1037
+ "lstrip": false,
1038
+ "normalized": true,
1039
+ "rstrip": false,
1040
+ "single_word": false,
1041
+ "special": false
1042
+ },
1043
+ "21253": {
1044
+ "content": "鵐",
1045
+ "lstrip": false,
1046
+ "normalized": true,
1047
+ "rstrip": false,
1048
+ "single_word": false,
1049
+ "special": false
1050
+ },
1051
+ "21254": {
1052
+ "content": "餬",
1053
+ "lstrip": false,
1054
+ "normalized": true,
1055
+ "rstrip": false,
1056
+ "single_word": false,
1057
+ "special": false
1058
+ },
1059
+ "21255": {
1060
+ "content": "挐",
1061
+ "lstrip": false,
1062
+ "normalized": true,
1063
+ "rstrip": false,
1064
+ "single_word": false,
1065
+ "special": false
1066
+ },
1067
+ "21256": {
1068
+ "content": "骹",
1069
+ "lstrip": false,
1070
+ "normalized": true,
1071
+ "rstrip": false,
1072
+ "single_word": false,
1073
+ "special": false
1074
+ },
1075
+ "21257": {
1076
+ "content": "搲",
1077
+ "lstrip": false,
1078
+ "normalized": true,
1079
+ "rstrip": false,
1080
+ "single_word": false,
1081
+ "special": false
1082
+ },
1083
+ "21258": {
1084
+ "content": "戙",
1085
+ "lstrip": false,
1086
+ "normalized": true,
1087
+ "rstrip": false,
1088
+ "single_word": false,
1089
+ "special": false
1090
+ },
1091
+ "21259": {
1092
+ "content": "踎",
1093
+ "lstrip": false,
1094
+ "normalized": true,
1095
+ "rstrip": false,
1096
+ "single_word": false,
1097
+ "special": false
1098
+ },
1099
+ "21260": {
1100
+ "content": "泂",
1101
+ "lstrip": false,
1102
+ "normalized": true,
1103
+ "rstrip": false,
1104
+ "single_word": false,
1105
+ "special": false
1106
+ },
1107
+ "21261": {
1108
+ "content": "掗",
1109
+ "lstrip": false,
1110
+ "normalized": true,
1111
+ "rstrip": false,
1112
+ "single_word": false,
1113
+ "special": false
1114
+ },
1115
+ "21262": {
1116
+ "content": "簕",
1117
+ "lstrip": false,
1118
+ "normalized": true,
1119
+ "rstrip": false,
1120
+ "single_word": false,
1121
+ "special": false
1122
+ },
1123
+ "21263": {
1124
+ "content": "淥",
1125
+ "lstrip": false,
1126
+ "normalized": true,
1127
+ "rstrip": false,
1128
+ "single_word": false,
1129
+ "special": false
1130
+ },
1131
+ "21264": {
1132
+ "content": "焫",
1133
+ "lstrip": false,
1134
+ "normalized": true,
1135
+ "rstrip": false,
1136
+ "single_word": false,
1137
+ "special": false
1138
+ },
1139
+ "21265": {
1140
+ "content": "抌",
1141
+ "lstrip": false,
1142
+ "normalized": true,
1143
+ "rstrip": false,
1144
+ "single_word": false,
1145
+ "special": false
1146
+ },
1147
+ "21266": {
1148
+ "content": "幪",
1149
+ "lstrip": false,
1150
+ "normalized": true,
1151
+ "rstrip": false,
1152
+ "single_word": false,
1153
+ "special": false
1154
+ },
1155
+ "21267": {
1156
+ "content": "甴",
1157
+ "lstrip": false,
1158
+ "normalized": true,
1159
+ "rstrip": false,
1160
+ "single_word": false,
1161
+ "special": false
1162
+ },
1163
+ "21268": {
1164
+ "content": "獴",
1165
+ "lstrip": false,
1166
+ "normalized": true,
1167
+ "rstrip": false,
1168
+ "single_word": false,
1169
+ "special": false
1170
+ },
1171
+ "21269": {
1172
+ "content": "鯭",
1173
+ "lstrip": false,
1174
+ "normalized": true,
1175
+ "rstrip": false,
1176
+ "single_word": false,
1177
+ "special": false
1178
+ },
1179
+ "21270": {
1180
+ "content": "嚙",
1181
+ "lstrip": false,
1182
+ "normalized": true,
1183
+ "rstrip": false,
1184
+ "single_word": false,
1185
+ "special": false
1186
+ },
1187
+ "21271": {
1188
+ "content": "椏",
1189
+ "lstrip": false,
1190
+ "normalized": true,
1191
+ "rstrip": false,
1192
+ "single_word": false,
1193
+ "special": false
1194
+ },
1195
+ "21272": {
1196
+ "content": "鼆",
1197
+ "lstrip": false,
1198
+ "normalized": true,
1199
+ "rstrip": false,
1200
+ "single_word": false,
1201
+ "special": false
1202
+ },
1203
+ "21273": {
1204
+ "content": "拏",
1205
+ "lstrip": false,
1206
+ "normalized": true,
1207
+ "rstrip": false,
1208
+ "single_word": false,
1209
+ "special": false
1210
+ },
1211
+ "21274": {
1212
+ "content": "鸛",
1213
+ "lstrip": false,
1214
+ "normalized": true,
1215
+ "rstrip": false,
1216
+ "single_word": false,
1217
+ "special": false
1218
+ },
1219
+ "21275": {
1220
+ "content": "嚡",
1221
+ "lstrip": false,
1222
+ "normalized": true,
1223
+ "rstrip": false,
1224
+ "single_word": false,
1225
+ "special": false
1226
+ },
1227
+ "21276": {
1228
+ "content": "緡",
1229
+ "lstrip": false,
1230
+ "normalized": true,
1231
+ "rstrip": false,
1232
+ "single_word": false,
1233
+ "special": false
1234
+ },
1235
+ "21277": {
1236
+ "content": "攋",
1237
+ "lstrip": false,
1238
+ "normalized": true,
1239
+ "rstrip": false,
1240
+ "single_word": false,
1241
+ "special": false
1242
+ },
1243
+ "21278": {
1244
+ "content": "鍶",
1245
+ "lstrip": false,
1246
+ "normalized": true,
1247
+ "rstrip": false,
1248
+ "single_word": false,
1249
+ "special": false
1250
+ },
1251
+ "21279": {
1252
+ "content": "稈",
1253
+ "lstrip": false,
1254
+ "normalized": true,
1255
+ "rstrip": false,
1256
+ "single_word": false,
1257
+ "special": false
1258
+ },
1259
+ "21280": {
1260
+ "content": "坭",
1261
+ "lstrip": false,
1262
+ "normalized": true,
1263
+ "rstrip": false,
1264
+ "single_word": false,
1265
+ "special": false
1266
+ },
1267
+ "21281": {
1268
+ "content": "礮",
1269
+ "lstrip": false,
1270
+ "normalized": true,
1271
+ "rstrip": false,
1272
+ "single_word": false,
1273
+ "special": false
1274
+ },
1275
+ "21282": {
1276
+ "content": "猢",
1277
+ "lstrip": false,
1278
+ "normalized": true,
1279
+ "rstrip": false,
1280
+ "single_word": false,
1281
+ "special": false
1282
+ },
1283
+ "21283": {
1284
+ "content": "喼",
1285
+ "lstrip": false,
1286
+ "normalized": true,
1287
+ "rstrip": false,
1288
+ "single_word": false,
1289
+ "special": false
1290
+ },
1291
+ "21284": {
1292
+ "content": "閪",
1293
+ "lstrip": false,
1294
+ "normalized": true,
1295
+ "rstrip": false,
1296
+ "single_word": false,
1297
+ "special": false
1298
+ },
1299
+ "21285": {
1300
+ "content": "奀",
1301
+ "lstrip": false,
1302
+ "normalized": true,
1303
+ "rstrip": false,
1304
+ "single_word": false,
1305
+ "special": false
1306
+ },
1307
+ "21286": {
1308
+ "content": "吔",
1309
+ "lstrip": false,
1310
+ "normalized": true,
1311
+ "rstrip": false,
1312
+ "single_word": false,
1313
+ "special": false
1314
+ },
1315
+ "21287": {
1316
+ "content": "甂",
1317
+ "lstrip": false,
1318
+ "normalized": true,
1319
+ "rstrip": false,
1320
+ "single_word": false,
1321
+ "special": false
1322
+ },
1323
+ "21288": {
1324
+ "content": "扽",
1325
+ "lstrip": false,
1326
+ "normalized": true,
1327
+ "rstrip": false,
1328
+ "single_word": false,
1329
+ "special": false
1330
+ },
1331
+ "21289": {
1332
+ "content": "胐",
1333
+ "lstrip": false,
1334
+ "normalized": true,
1335
+ "rstrip": false,
1336
+ "single_word": false,
1337
+ "special": false
1338
+ },
1339
+ "21290": {
1340
+ "content": "掹",
1341
+ "lstrip": false,
1342
+ "normalized": true,
1343
+ "rstrip": false,
1344
+ "single_word": false,
1345
+ "special": false
1346
+ },
1347
+ "21291": {
1348
+ "content": "潁",
1349
+ "lstrip": false,
1350
+ "normalized": true,
1351
+ "rstrip": false,
1352
+ "single_word": false,
1353
+ "special": false
1354
+ },
1355
+ "21292": {
1356
+ "content": "鷿",
1357
+ "lstrip": false,
1358
+ "normalized": true,
1359
+ "rstrip": false,
1360
+ "single_word": false,
1361
+ "special": false
1362
+ },
1363
+ "21293": {
1364
+ "content": "鷈",
1365
+ "lstrip": false,
1366
+ "normalized": true,
1367
+ "rstrip": false,
1368
+ "single_word": false,
1369
+ "special": false
1370
+ },
1371
+ "21294": {
1372
+ "content": "觜",
1373
+ "lstrip": false,
1374
+ "normalized": true,
1375
+ "rstrip": false,
1376
+ "single_word": false,
1377
+ "special": false
1378
+ },
1379
+ "21295": {
1380
+ "content": "茛",
1381
+ "lstrip": false,
1382
+ "normalized": true,
1383
+ "rstrip": false,
1384
+ "single_word": false,
1385
+ "special": false
1386
+ },
1387
+ "21296": {
1388
+ "content": "摷",
1389
+ "lstrip": false,
1390
+ "normalized": true,
1391
+ "rstrip": false,
1392
+ "single_word": false,
1393
+ "special": false
1394
+ },
1395
+ "21297": {
1396
+ "content": "趯",
1397
+ "lstrip": false,
1398
+ "normalized": true,
1399
+ "rstrip": false,
1400
+ "single_word": false,
1401
+ "special": false
1402
+ },
1403
+ "21298": {
1404
+ "content": "鸕",
1405
+ "lstrip": false,
1406
+ "normalized": true,
1407
+ "rstrip": false,
1408
+ "single_word": false,
1409
+ "special": false
1410
+ },
1411
+ "21299": {
1412
+ "content": "鶿",
1413
+ "lstrip": false,
1414
+ "normalized": true,
1415
+ "rstrip": false,
1416
+ "single_word": false,
1417
+ "special": false
1418
+ },
1419
+ "21300": {
1420
+ "content": "驄",
1421
+ "lstrip": false,
1422
+ "normalized": true,
1423
+ "rstrip": false,
1424
+ "single_word": false,
1425
+ "special": false
1426
+ },
1427
+ "21301": {
1428
+ "content": "藪",
1429
+ "lstrip": false,
1430
+ "normalized": true,
1431
+ "rstrip": false,
1432
+ "single_word": false,
1433
+ "special": false
1434
+ },
1435
+ "21302": {
1436
+ "content": "唨",
1437
+ "lstrip": false,
1438
+ "normalized": true,
1439
+ "rstrip": false,
1440
+ "single_word": false,
1441
+ "special": false
1442
+ },
1443
+ "21303": {
1444
+ "content": "銨",
1445
+ "lstrip": false,
1446
+ "normalized": true,
1447
+ "rstrip": false,
1448
+ "single_word": false,
1449
+ "special": false
1450
+ },
1451
+ "21304": {
1452
+ "content": "搦",
1453
+ "lstrip": false,
1454
+ "normalized": true,
1455
+ "rstrip": false,
1456
+ "single_word": false,
1457
+ "special": false
1458
+ },
1459
+ "21305": {
1460
+ "content": "佮",
1461
+ "lstrip": false,
1462
+ "normalized": true,
1463
+ "rstrip": false,
1464
+ "single_word": false,
1465
+ "special": false
1466
+ },
1467
+ "21306": {
1468
+ "content": "跣",
1469
+ "lstrip": false,
1470
+ "normalized": true,
1471
+ "rstrip": false,
1472
+ "single_word": false,
1473
+ "special": false
1474
+ },
1475
+ "21307": {
1476
+ "content": "菴",
1477
+ "lstrip": false,
1478
+ "normalized": true,
1479
+ "rstrip": false,
1480
+ "single_word": false,
1481
+ "special": false
1482
+ },
1483
+ "21308": {
1484
+ "content": "顥",
1485
+ "lstrip": false,
1486
+ "normalized": true,
1487
+ "rstrip": false,
1488
+ "single_word": false,
1489
+ "special": false
1490
+ },
1491
+ "21309": {
1492
+ "content": "塹",
1493
+ "lstrip": false,
1494
+ "normalized": true,
1495
+ "rstrip": false,
1496
+ "single_word": false,
1497
+ "special": false
1498
+ },
1499
+ "21310": {
1500
+ "content": "䊦",
1501
+ "lstrip": false,
1502
+ "normalized": true,
1503
+ "rstrip": false,
1504
+ "single_word": false,
1505
+ "special": false
1506
+ },
1507
+ "21311": {
1508
+ "content": "脧",
1509
+ "lstrip": false,
1510
+ "normalized": true,
1511
+ "rstrip": false,
1512
+ "single_word": false,
1513
+ "special": false
1514
+ },
1515
+ "21312": {
1516
+ "content": "蕓",
1517
+ "lstrip": false,
1518
+ "normalized": true,
1519
+ "rstrip": false,
1520
+ "single_word": false,
1521
+ "special": false
1522
+ },
1523
+ "21313": {
1524
+ "content": "躝",
1525
+ "lstrip": false,
1526
+ "normalized": true,
1527
+ "rstrip": false,
1528
+ "single_word": false,
1529
+ "special": false
1530
+ },
1531
+ "21314": {
1532
+ "content": "灕",
1533
+ "lstrip": false,
1534
+ "normalized": true,
1535
+ "rstrip": false,
1536
+ "single_word": false,
1537
+ "special": false
1538
+ },
1539
+ "21315": {
1540
+ "content": "㚻",
1541
+ "lstrip": false,
1542
+ "normalized": true,
1543
+ "rstrip": false,
1544
+ "single_word": false,
1545
+ "special": false
1546
+ },
1547
+ "21316": {
1548
+ "content": "癆",
1549
+ "lstrip": false,
1550
+ "normalized": true,
1551
+ "rstrip": false,
1552
+ "single_word": false,
1553
+ "special": false
1554
+ },
1555
+ "21317": {
1556
+ "content": "廸",
1557
+ "lstrip": false,
1558
+ "normalized": true,
1559
+ "rstrip": false,
1560
+ "single_word": false,
1561
+ "special": false
1562
+ },
1563
+ "21318": {
1564
+ "content": "鉍",
1565
+ "lstrip": false,
1566
+ "normalized": true,
1567
+ "rstrip": false,
1568
+ "single_word": false,
1569
+ "special": false
1570
+ },
1571
+ "21319": {
1572
+ "content": "銻",
1573
+ "lstrip": false,
1574
+ "normalized": true,
1575
+ "rstrip": false,
1576
+ "single_word": false,
1577
+ "special": false
1578
+ },
1579
+ "21320": {
1580
+ "content": "龢",
1581
+ "lstrip": false,
1582
+ "normalized": true,
1583
+ "rstrip": false,
1584
+ "single_word": false,
1585
+ "special": false
1586
+ },
1587
+ "21321": {
1588
+ "content": "啹",
1589
+ "lstrip": false,
1590
+ "normalized": true,
1591
+ "rstrip": false,
1592
+ "single_word": false,
1593
+ "special": false
1594
+ },
1595
+ "21322": {
1596
+ "content": "璣",
1597
+ "lstrip": false,
1598
+ "normalized": true,
1599
+ "rstrip": false,
1600
+ "single_word": false,
1601
+ "special": false
1602
+ },
1603
+ "21323": {
1604
+ "content": "篾",
1605
+ "lstrip": false,
1606
+ "normalized": true,
1607
+ "rstrip": false,
1608
+ "single_word": false,
1609
+ "special": false
1610
+ },
1611
+ "21324": {
1612
+ "content": "樨",
1613
+ "lstrip": false,
1614
+ "normalized": true,
1615
+ "rstrip": false,
1616
+ "single_word": false,
1617
+ "special": false
1618
+ },
1619
+ "21325": {
1620
+ "content": "���",
1621
+ "lstrip": false,
1622
+ "normalized": true,
1623
+ "rstrip": false,
1624
+ "single_word": false,
1625
+ "special": false
1626
+ },
1627
+ "21326": {
1628
+ "content": "燶",
1629
+ "lstrip": false,
1630
+ "normalized": true,
1631
+ "rstrip": false,
1632
+ "single_word": false,
1633
+ "special": false
1634
+ },
1635
+ "21327": {
1636
+ "content": "顳",
1637
+ "lstrip": false,
1638
+ "normalized": true,
1639
+ "rstrip": false,
1640
+ "single_word": false,
1641
+ "special": false
1642
+ },
1643
+ "21328": {
1644
+ "content": "鎅",
1645
+ "lstrip": false,
1646
+ "normalized": true,
1647
+ "rstrip": false,
1648
+ "single_word": false,
1649
+ "special": false
1650
+ },
1651
+ "21329": {
1652
+ "content": "鷸",
1653
+ "lstrip": false,
1654
+ "normalized": true,
1655
+ "rstrip": false,
1656
+ "single_word": false,
1657
+ "special": false
1658
+ },
1659
+ "21330": {
1660
+ "content": "㩒",
1661
+ "lstrip": false,
1662
+ "normalized": true,
1663
+ "rstrip": false,
1664
+ "single_word": false,
1665
+ "special": false
1666
+ },
1667
+ "21331": {
1668
+ "content": "鰹",
1669
+ "lstrip": false,
1670
+ "normalized": true,
1671
+ "rstrip": false,
1672
+ "single_word": false,
1673
+ "special": false
1674
+ },
1675
+ "21332": {
1676
+ "content": "嗍",
1677
+ "lstrip": false,
1678
+ "normalized": true,
1679
+ "rstrip": false,
1680
+ "single_word": false,
1681
+ "special": false
1682
+ },
1683
+ "21333": {
1684
+ "content": "撠",
1685
+ "lstrip": false,
1686
+ "normalized": true,
1687
+ "rstrip": false,
1688
+ "single_word": false,
1689
+ "special": false
1690
+ },
1691
+ "21334": {
1692
+ "content": "髹",
1693
+ "lstrip": false,
1694
+ "normalized": true,
1695
+ "rstrip": false,
1696
+ "single_word": false,
1697
+ "special": false
1698
+ },
1699
+ "21335": {
1700
+ "content": "驃",
1701
+ "lstrip": false,
1702
+ "normalized": true,
1703
+ "rstrip": false,
1704
+ "single_word": false,
1705
+ "special": false
1706
+ },
1707
+ "21336": {
1708
+ "content": "羋",
1709
+ "lstrip": false,
1710
+ "normalized": true,
1711
+ "rstrip": false,
1712
+ "single_word": false,
1713
+ "special": false
1714
+ },
1715
+ "21337": {
1716
+ "content": "溦",
1717
+ "lstrip": false,
1718
+ "normalized": true,
1719
+ "rstrip": false,
1720
+ "single_word": false,
1721
+ "special": false
1722
+ },
1723
+ "21338": {
1724
+ "content": "迆",
1725
+ "lstrip": false,
1726
+ "normalized": true,
1727
+ "rstrip": false,
1728
+ "single_word": false,
1729
+ "special": false
1730
+ },
1731
+ "21339": {
1732
+ "content": "縞",
1733
+ "lstrip": false,
1734
+ "normalized": true,
1735
+ "rstrip": false,
1736
+ "single_word": false,
1737
+ "special": false
1738
+ },
1739
+ "21340": {
1740
+ "content": "僆",
1741
+ "lstrip": false,
1742
+ "normalized": true,
1743
+ "rstrip": false,
1744
+ "single_word": false,
1745
+ "special": false
1746
+ },
1747
+ "21341": {
1748
+ "content": "韞",
1749
+ "lstrip": false,
1750
+ "normalized": true,
1751
+ "rstrip": false,
1752
+ "single_word": false,
1753
+ "special": false
1754
+ },
1755
+ "21342": {
1756
+ "content": "揞",
1757
+ "lstrip": false,
1758
+ "normalized": true,
1759
+ "rstrip": false,
1760
+ "single_word": false,
1761
+ "special": false
1762
+ },
1763
+ "21343": {
1764
+ "content": "柊",
1765
+ "lstrip": false,
1766
+ "normalized": true,
1767
+ "rstrip": false,
1768
+ "single_word": false,
1769
+ "special": false
1770
+ },
1771
+ "21344": {
1772
+ "content": "煠",
1773
+ "lstrip": false,
1774
+ "normalized": true,
1775
+ "rstrip": false,
1776
+ "single_word": false,
1777
+ "special": false
1778
+ },
1779
+ "21345": {
1780
+ "content": "沚",
1781
+ "lstrip": false,
1782
+ "normalized": true,
1783
+ "rstrip": false,
1784
+ "single_word": false,
1785
+ "special": false
1786
+ },
1787
+ "21346": {
1788
+ "content": "儇",
1789
+ "lstrip": false,
1790
+ "normalized": true,
1791
+ "rstrip": false,
1792
+ "single_word": false,
1793
+ "special": false
1794
+ },
1795
+ "21347": {
1796
+ "content": "湼",
1797
+ "lstrip": false,
1798
+ "normalized": true,
1799
+ "rstrip": false,
1800
+ "single_word": false,
1801
+ "special": false
1802
+ },
1803
+ "21348": {
1804
+ "content": "鉬",
1805
+ "lstrip": false,
1806
+ "normalized": true,
1807
+ "rstrip": false,
1808
+ "single_word": false,
1809
+ "special": false
1810
+ },
1811
+ "21349": {
1812
+ "content": "紥",
1813
+ "lstrip": false,
1814
+ "normalized": true,
1815
+ "rstrip": false,
1816
+ "single_word": false,
1817
+ "special": false
1818
+ },
1819
+ "21350": {
1820
+ "content": "鮓",
1821
+ "lstrip": false,
1822
+ "normalized": true,
1823
+ "rstrip": false,
1824
+ "single_word": false,
1825
+ "special": false
1826
+ },
1827
+ "21351": {
1828
+ "content": "鯁",
1829
+ "lstrip": false,
1830
+ "normalized": true,
1831
+ "rstrip": false,
1832
+ "single_word": false,
1833
+ "special": false
1834
+ },
1835
+ "21352": {
1836
+ "content": "擝",
1837
+ "lstrip": false,
1838
+ "normalized": true,
1839
+ "rstrip": false,
1840
+ "single_word": false,
1841
+ "special": false
1842
+ },
1843
+ "21353": {
1844
+ "content": "媺",
1845
+ "lstrip": false,
1846
+ "normalized": true,
1847
+ "rstrip": false,
1848
+ "single_word": false,
1849
+ "special": false
1850
+ },
1851
+ "21354": {
1852
+ "content": "鬭",
1853
+ "lstrip": false,
1854
+ "normalized": true,
1855
+ "rstrip": false,
1856
+ "single_word": false,
1857
+ "special": false
1858
+ },
1859
+ "21355": {
1860
+ "content": "煬",
1861
+ "lstrip": false,
1862
+ "normalized": true,
1863
+ "rstrip": false,
1864
+ "single_word": false,
1865
+ "special": false
1866
+ },
1867
+ "21356": {
1868
+ "content": "銫",
1869
+ "lstrip": false,
1870
+ "normalized": true,
1871
+ "rstrip": false,
1872
+ "single_word": false,
1873
+ "special": false
1874
+ },
1875
+ "21357": {
1876
+ "content": "軛",
1877
+ "lstrip": false,
1878
+ "normalized": true,
1879
+ "rstrip": false,
1880
+ "single_word": false,
1881
+ "special": false
1882
+ },
1883
+ "21358": {
1884
+ "content": "崢",
1885
+ "lstrip": false,
1886
+ "normalized": true,
1887
+ "rstrip": false,
1888
+ "single_word": false,
1889
+ "special": false
1890
+ },
1891
+ "21359": {
1892
+ "content": "捵",
1893
+ "lstrip": false,
1894
+ "normalized": true,
1895
+ "rstrip": false,
1896
+ "single_word": false,
1897
+ "special": false
1898
+ },
1899
+ "21360": {
1900
+ "content": "曺",
1901
+ "lstrip": false,
1902
+ "normalized": true,
1903
+ "rstrip": false,
1904
+ "single_word": false,
1905
+ "special": false
1906
+ },
1907
+ "21361": {
1908
+ "content": "舘",
1909
+ "lstrip": false,
1910
+ "normalized": true,
1911
+ "rstrip": false,
1912
+ "single_word": false,
1913
+ "special": false
1914
+ },
1915
+ "21362": {
1916
+ "content": "鵞",
1917
+ "lstrip": false,
1918
+ "normalized": true,
1919
+ "rstrip": false,
1920
+ "single_word": false,
1921
+ "special": false
1922
+ },
1923
+ "21363": {
1924
+ "content": "儁",
1925
+ "lstrip": false,
1926
+ "normalized": true,
1927
+ "rstrip": false,
1928
+ "single_word": false,
1929
+ "special": false
1930
+ },
1931
+ "21364": {
1932
+ "content": "櫟",
1933
+ "lstrip": false,
1934
+ "normalized": true,
1935
+ "rstrip": false,
1936
+ "single_word": false,
1937
+ "special": false
1938
+ },
1939
+ "21365": {
1940
+ "content": "櫳",
1941
+ "lstrip": false,
1942
+ "normalized": true,
1943
+ "rstrip": false,
1944
+ "single_word": false,
1945
+ "special": false
1946
+ },
1947
+ "21366": {
1948
+ "content": "巉",
1949
+ "lstrip": false,
1950
+ "normalized": true,
1951
+ "rstrip": false,
1952
+ "single_word": false,
1953
+ "special": false
1954
+ },
1955
+ "21367": {
1956
+ "content": "昰",
1957
+ "lstrip": false,
1958
+ "normalized": true,
1959
+ "rstrip": false,
1960
+ "single_word": false,
1961
+ "special": false
1962
+ },
1963
+ "21368": {
1964
+ "content": "娸",
1965
+ "lstrip": false,
1966
+ "normalized": true,
1967
+ "rstrip": false,
1968
+ "single_word": false,
1969
+ "special": false
1970
+ },
1971
+ "21369": {
1972
+ "content": "鶇",
1973
+ "lstrip": false,
1974
+ "normalized": true,
1975
+ "rstrip": false,
1976
+ "single_word": false,
1977
+ "special": false
1978
+ },
1979
+ "21370": {
1980
+ "content": "謳",
1981
+ "lstrip": false,
1982
+ "normalized": true,
1983
+ "rstrip": false,
1984
+ "single_word": false,
1985
+ "special": false
1986
+ },
1987
+ "21371": {
1988
+ "content": "噍",
1989
+ "lstrip": false,
1990
+ "normalized": true,
1991
+ "rstrip": false,
1992
+ "single_word": false,
1993
+ "special": false
1994
+ },
1995
+ "21372": {
1996
+ "content": "龑",
1997
+ "lstrip": false,
1998
+ "normalized": true,
1999
+ "rstrip": false,
2000
+ "single_word": false,
2001
+ "special": false
2002
+ },
2003
+ "21373": {
2004
+ "content": "鶉",
2005
+ "lstrip": false,
2006
+ "normalized": true,
2007
+ "rstrip": false,
2008
+ "single_word": false,
2009
+ "special": false
2010
+ },
2011
+ "21374": {
2012
+ "content": "髧",
2013
+ "lstrip": false,
2014
+ "normalized": true,
2015
+ "rstrip": false,
2016
+ "single_word": false,
2017
+ "special": false
2018
+ },
2019
+ "21375": {
2020
+ "content": "焓",
2021
+ "lstrip": false,
2022
+ "normalized": true,
2023
+ "rstrip": false,
2024
+ "single_word": false,
2025
+ "special": false
2026
+ },
2027
+ "21376": {
2028
+ "content": "諤",
2029
+ "lstrip": false,
2030
+ "normalized": true,
2031
+ "rstrip": false,
2032
+ "single_word": false,
2033
+ "special": false
2034
+ },
2035
+ "21377": {
2036
+ "content": "擳",
2037
+ "lstrip": false,
2038
+ "normalized": true,
2039
+ "rstrip": false,
2040
+ "single_word": false,
2041
+ "special": false
2042
+ },
2043
+ "21378": {
2044
+ "content": "糴",
2045
+ "lstrip": false,
2046
+ "normalized": true,
2047
+ "rstrip": false,
2048
+ "single_word": false,
2049
+ "special": false
2050
+ },
2051
+ "21379": {
2052
+ "content": "嫽",
2053
+ "lstrip": false,
2054
+ "normalized": true,
2055
+ "rstrip": false,
2056
+ "single_word": false,
2057
+ "special": false
2058
+ },
2059
+ "21380": {
2060
+ "content": "閬",
2061
+ "lstrip": false,
2062
+ "normalized": true,
2063
+ "rstrip": false,
2064
+ "single_word": false,
2065
+ "special": false
2066
+ },
2067
+ "21381": {
2068
+ "content": "畧",
2069
+ "lstrip": false,
2070
+ "normalized": true,
2071
+ "rstrip": false,
2072
+ "single_word": false,
2073
+ "special": false
2074
+ },
2075
+ "21382": {
2076
+ "content": "兗",
2077
+ "lstrip": false,
2078
+ "normalized": true,
2079
+ "rstrip": false,
2080
+ "single_word": false,
2081
+ "special": false
2082
+ },
2083
+ "21383": {
2084
+ "content": "罨",
2085
+ "lstrip": false,
2086
+ "normalized": true,
2087
+ "rstrip": false,
2088
+ "single_word": false,
2089
+ "special": false
2090
+ },
2091
+ "21384": {
2092
+ "content": "頊",
2093
+ "lstrip": false,
2094
+ "normalized": true,
2095
+ "rstrip": false,
2096
+ "single_word": false,
2097
+ "special": false
2098
+ },
2099
+ "21385": {
2100
+ "content": "卌",
2101
+ "lstrip": false,
2102
+ "normalized": true,
2103
+ "rstrip": false,
2104
+ "single_word": false,
2105
+ "special": false
2106
+ },
2107
+ "21386": {
2108
+ "content": "剦",
2109
+ "lstrip": false,
2110
+ "normalized": true,
2111
+ "rstrip": false,
2112
+ "single_word": false,
2113
+ "special": false
2114
+ },
2115
+ "21387": {
2116
+ "content": "觚",
2117
+ "lstrip": false,
2118
+ "normalized": true,
2119
+ "rstrip": false,
2120
+ "single_word": false,
2121
+ "special": false
2122
+ },
2123
+ "21388": {
2124
+ "content": "捹",
2125
+ "lstrip": false,
2126
+ "normalized": true,
2127
+ "rstrip": false,
2128
+ "single_word": false,
2129
+ "special": false
2130
+ },
2131
+ "21389": {
2132
+ "content": "囘",
2133
+ "lstrip": false,
2134
+ "normalized": true,
2135
+ "rstrip": false,
2136
+ "single_word": false,
2137
+ "special": false
2138
+ },
2139
+ "21390": {
2140
+ "content": "鷂",
2141
+ "lstrip": false,
2142
+ "normalized": true,
2143
+ "rstrip": false,
2144
+ "single_word": false,
2145
+ "special": false
2146
+ },
2147
+ "21391": {
2148
+ "content": "醂",
2149
+ "lstrip": false,
2150
+ "normalized": true,
2151
+ "rstrip": false,
2152
+ "single_word": false,
2153
+ "special": false
2154
+ },
2155
+ "21392": {
2156
+ "content": "蚺",
2157
+ "lstrip": false,
2158
+ "normalized": true,
2159
+ "rstrip": false,
2160
+ "single_word": false,
2161
+ "special": false
2162
+ },
2163
+ "21393": {
2164
+ "content": "碲",
2165
+ "lstrip": false,
2166
+ "normalized": true,
2167
+ "rstrip": false,
2168
+ "single_word": false,
2169
+ "special": false
2170
+ },
2171
+ "21394": {
2172
+ "content": "銦",
2173
+ "lstrip": false,
2174
+ "normalized": true,
2175
+ "rstrip": false,
2176
+ "single_word": false,
2177
+ "special": false
2178
+ },
2179
+ "21395": {
2180
+ "content": "睼",
2181
+ "lstrip": false,
2182
+ "normalized": true,
2183
+ "rstrip": false,
2184
+ "single_word": false,
2185
+ "special": false
2186
+ },
2187
+ "21396": {
2188
+ "content": "鏌",
2189
+ "lstrip": false,
2190
+ "normalized": true,
2191
+ "rstrip": false,
2192
+ "single_word": false,
2193
+ "special": false
2194
+ },
2195
+ "21397": {
2196
+ "content": "廄",
2197
+ "lstrip": false,
2198
+ "normalized": true,
2199
+ "rstrip": false,
2200
+ "single_word": false,
2201
+ "special": false
2202
+ },
2203
+ "21398": {
2204
+ "content": "鋇",
2205
+ "lstrip": false,
2206
+ "normalized": true,
2207
+ "rstrip": false,
2208
+ "single_word": false,
2209
+ "special": false
2210
+ },
2211
+ "21399": {
2212
+ "content": "谿",
2213
+ "lstrip": false,
2214
+ "normalized": true,
2215
+ "rstrip": false,
2216
+ "single_word": false,
2217
+ "special": false
2218
+ },
2219
+ "21400": {
2220
+ "content": "罉",
2221
+ "lstrip": false,
2222
+ "normalized": true,
2223
+ "rstrip": false,
2224
+ "single_word": false,
2225
+ "special": false
2226
+ },
2227
+ "21401": {
2228
+ "content": "馱",
2229
+ "lstrip": false,
2230
+ "normalized": true,
2231
+ "rstrip": false,
2232
+ "single_word": false,
2233
+ "special": false
2234
+ },
2235
+ "21402": {
2236
+ "content": "藺",
2237
+ "lstrip": false,
2238
+ "normalized": true,
2239
+ "rstrip": false,
2240
+ "single_word": false,
2241
+ "special": false
2242
+ },
2243
+ "21403": {
2244
+ "content": "鐖",
2245
+ "lstrip": false,
2246
+ "normalized": true,
2247
+ "rstrip": false,
2248
+ "single_word": false,
2249
+ "special": false
2250
+ },
2251
+ "21404": {
2252
+ "content": "氚",
2253
+ "lstrip": false,
2254
+ "normalized": true,
2255
+ "rstrip": false,
2256
+ "single_word": false,
2257
+ "special": false
2258
+ },
2259
+ "21405": {
2260
+ "content": "犂",
2261
+ "lstrip": false,
2262
+ "normalized": true,
2263
+ "rstrip": false,
2264
+ "single_word": false,
2265
+ "special": false
2266
+ },
2267
+ "21406": {
2268
+ "content": "鯡",
2269
+ "lstrip": false,
2270
+ "normalized": true,
2271
+ "rstrip": false,
2272
+ "single_word": false,
2273
+ "special": false
2274
+ },
2275
+ "21407": {
2276
+ "content": "瓘",
2277
+ "lstrip": false,
2278
+ "normalized": true,
2279
+ "rstrip": false,
2280
+ "single_word": false,
2281
+ "special": false
2282
+ },
2283
+ "21408": {
2284
+ "content": "禤",
2285
+ "lstrip": false,
2286
+ "normalized": true,
2287
+ "rstrip": false,
2288
+ "single_word": false,
2289
+ "special": false
2290
+ },
2291
+ "21409": {
2292
+ "content": "㨘",
2293
+ "lstrip": false,
2294
+ "normalized": true,
2295
+ "rstrip": false,
2296
+ "single_word": false,
2297
+ "special": false
2298
+ },
2299
+ "21410": {
2300
+ "content": "礬",
2301
+ "lstrip": false,
2302
+ "normalized": true,
2303
+ "rstrip": false,
2304
+ "single_word": false,
2305
+ "special": false
2306
+ },
2307
+ "21411": {
2308
+ "content": "鎢",
2309
+ "lstrip": false,
2310
+ "normalized": true,
2311
+ "rstrip": false,
2312
+ "single_word": false,
2313
+ "special": false
2314
+ },
2315
+ "21412": {
2316
+ "content": "貍",
2317
+ "lstrip": false,
2318
+ "normalized": true,
2319
+ "rstrip": false,
2320
+ "single_word": false,
2321
+ "special": false
2322
+ },
2323
+ "21413": {
2324
+ "content": "噅",
2325
+ "lstrip": false,
2326
+ "normalized": true,
2327
+ "rstrip": false,
2328
+ "single_word": false,
2329
+ "special": false
2330
+ },
2331
+ "21414": {
2332
+ "content": "鴣",
2333
+ "lstrip": false,
2334
+ "normalized": true,
2335
+ "rstrip": false,
2336
+ "single_word": false,
2337
+ "special": false
2338
+ },
2339
+ "21415": {
2340
+ "content": "簋",
2341
+ "lstrip": false,
2342
+ "normalized": true,
2343
+ "rstrip": false,
2344
+ "single_word": false,
2345
+ "special": false
2346
+ },
2347
+ "21416": {
2348
+ "content": "鮋",
2349
+ "lstrip": false,
2350
+ "normalized": true,
2351
+ "rstrip": false,
2352
+ "single_word": false,
2353
+ "special": false
2354
+ },
2355
+ "21417": {
2356
+ "content": "䴉",
2357
+ "lstrip": false,
2358
+ "normalized": true,
2359
+ "rstrip": false,
2360
+ "single_word": false,
2361
+ "special": false
2362
+ },
2363
+ "21418": {
2364
+ "content": "扤",
2365
+ "lstrip": false,
2366
+ "normalized": true,
2367
+ "rstrip": false,
2368
+ "single_word": false,
2369
+ "special": false
2370
+ },
2371
+ "21419": {
2372
+ "content": "銣",
2373
+ "lstrip": false,
2374
+ "normalized": true,
2375
+ "rstrip": false,
2376
+ "single_word": false,
2377
+ "special": false
2378
+ },
2379
+ "21420": {
2380
+ "content": "穏",
2381
+ "lstrip": false,
2382
+ "normalized": true,
2383
+ "rstrip": false,
2384
+ "single_word": false,
2385
+ "special": false
2386
+ },
2387
+ "21421": {
2388
+ "content": "髙",
2389
+ "lstrip": false,
2390
+ "normalized": true,
2391
+ "rstrip": false,
2392
+ "single_word": false,
2393
+ "special": false
2394
+ },
2395
+ "21422": {
2396
+ "content": "嬋",
2397
+ "lstrip": false,
2398
+ "normalized": true,
2399
+ "rstrip": false,
2400
+ "single_word": false,
2401
+ "special": false
2402
+ },
2403
+ "21423": {
2404
+ "content": "奭",
2405
+ "lstrip": false,
2406
+ "normalized": true,
2407
+ "rstrip": false,
2408
+ "single_word": false,
2409
+ "special": false
2410
+ },
2411
+ "21424": {
2412
+ "content": "鱒",
2413
+ "lstrip": false,
2414
+ "normalized": true,
2415
+ "rstrip": false,
2416
+ "single_word": false,
2417
+ "special": false
2418
+ },
2419
+ "21425": {
2420
+ "content": "蜑",
2421
+ "lstrip": false,
2422
+ "normalized": true,
2423
+ "rstrip": false,
2424
+ "single_word": false,
2425
+ "special": false
2426
+ },
2427
+ "21426": {
2428
+ "content": "塽",
2429
+ "lstrip": false,
2430
+ "normalized": true,
2431
+ "rstrip": false,
2432
+ "single_word": false,
2433
+ "special": false
2434
+ },
2435
+ "21427": {
2436
+ "content": "嚤",
2437
+ "lstrip": false,
2438
+ "normalized": true,
2439
+ "rstrip": false,
2440
+ "single_word": false,
2441
+ "special": false
2442
+ },
2443
+ "21428": {
2444
+ "content": "㩧",
2445
+ "lstrip": false,
2446
+ "normalized": true,
2447
+ "rstrip": false,
2448
+ "single_word": false,
2449
+ "special": false
2450
+ },
2451
+ "21429": {
2452
+ "content": "硃",
2453
+ "lstrip": false,
2454
+ "normalized": true,
2455
+ "rstrip": false,
2456
+ "single_word": false,
2457
+ "special": false
2458
+ },
2459
+ "21430": {
2460
+ "content": "鉈",
2461
+ "lstrip": false,
2462
+ "normalized": true,
2463
+ "rstrip": false,
2464
+ "single_word": false,
2465
+ "special": false
2466
+ },
2467
+ "21431": {
2468
+ "content": "鴒",
2469
+ "lstrip": false,
2470
+ "normalized": true,
2471
+ "rstrip": false,
2472
+ "single_word": false,
2473
+ "special": false
2474
+ },
2475
+ "21432": {
2476
+ "content": "枴",
2477
+ "lstrip": false,
2478
+ "normalized": true,
2479
+ "rstrip": false,
2480
+ "single_word": false,
2481
+ "special": false
2482
+ },
2483
+ "21433": {
2484
+ "content": "鷓",
2485
+ "lstrip": false,
2486
+ "normalized": true,
2487
+ "rstrip": false,
2488
+ "single_word": false,
2489
+ "special": false
2490
+ },
2491
+ "21434": {
2492
+ "content": "魴",
2493
+ "lstrip": false,
2494
+ "normalized": true,
2495
+ "rstrip": false,
2496
+ "single_word": false,
2497
+ "special": false
2498
+ },
2499
+ "21435": {
2500
+ "content": "蠄",
2501
+ "lstrip": false,
2502
+ "normalized": true,
2503
+ "rstrip": false,
2504
+ "single_word": false,
2505
+ "special": false
2506
+ },
2507
+ "21436": {
2508
+ "content": "嶠",
2509
+ "lstrip": false,
2510
+ "normalized": true,
2511
+ "rstrip": false,
2512
+ "single_word": false,
2513
+ "special": false
2514
+ },
2515
+ "21437": {
2516
+ "content": "鋹",
2517
+ "lstrip": false,
2518
+ "normalized": true,
2519
+ "rstrip": false,
2520
+ "single_word": false,
2521
+ "special": false
2522
+ },
2523
+ "21438": {
2524
+ "content": "鶺",
2525
+ "lstrip": false,
2526
+ "normalized": true,
2527
+ "rstrip": false,
2528
+ "single_word": false,
2529
+ "special": false
2530
+ },
2531
+ "21439": {
2532
+ "content": "咇",
2533
+ "lstrip": false,
2534
+ "normalized": true,
2535
+ "rstrip": false,
2536
+ "single_word": false,
2537
+ "special": false
2538
+ },
2539
+ "21440": {
2540
+ "content": "璘",
2541
+ "lstrip": false,
2542
+ "normalized": true,
2543
+ "rstrip": false,
2544
+ "single_word": false,
2545
+ "special": false
2546
+ },
2547
+ "21441": {
2548
+ "content": "鯥",
2549
+ "lstrip": false,
2550
+ "normalized": true,
2551
+ "rstrip": false,
2552
+ "single_word": false,
2553
+ "special": false
2554
+ },
2555
+ "21442": {
2556
+ "content": "攴",
2557
+ "lstrip": false,
2558
+ "normalized": true,
2559
+ "rstrip": false,
2560
+ "single_word": false,
2561
+ "special": false
2562
+ },
2563
+ "21443": {
2564
+ "content": "鼇",
2565
+ "lstrip": false,
2566
+ "normalized": true,
2567
+ "rstrip": false,
2568
+ "single_word": false,
2569
+ "special": false
2570
+ },
2571
+ "21444": {
2572
+ "content": "哣",
2573
+ "lstrip": false,
2574
+ "normalized": true,
2575
+ "rstrip": false,
2576
+ "single_word": false,
2577
+ "special": false
2578
+ },
2579
+ "21445": {
2580
+ "content": "澂",
2581
+ "lstrip": false,
2582
+ "normalized": true,
2583
+ "rstrip": false,
2584
+ "single_word": false,
2585
+ "special": false
2586
+ },
2587
+ "21446": {
2588
+ "content": "摵",
2589
+ "lstrip": false,
2590
+ "normalized": true,
2591
+ "rstrip": false,
2592
+ "single_word": false,
2593
+ "special": false
2594
+ },
2595
+ "21447": {
2596
+ "content": "莨",
2597
+ "lstrip": false,
2598
+ "normalized": true,
2599
+ "rstrip": false,
2600
+ "single_word": false,
2601
+ "special": false
2602
+ },
2603
+ "21448": {
2604
+ "content": "淸",
2605
+ "lstrip": false,
2606
+ "normalized": true,
2607
+ "rstrip": false,
2608
+ "single_word": false,
2609
+ "special": false
2610
+ },
2611
+ "21449": {
2612
+ "content": "逳",
2613
+ "lstrip": false,
2614
+ "normalized": true,
2615
+ "rstrip": false,
2616
+ "single_word": false,
2617
+ "special": false
2618
+ },
2619
+ "21450": {
2620
+ "content": "鄴",
2621
+ "lstrip": false,
2622
+ "normalized": true,
2623
+ "rstrip": false,
2624
+ "single_word": false,
2625
+ "special": false
2626
+ },
2627
+ "21451": {
2628
+ "content": "鰨",
2629
+ "lstrip": false,
2630
+ "normalized": true,
2631
+ "rstrip": false,
2632
+ "single_word": false,
2633
+ "special": false
2634
+ },
2635
+ "21452": {
2636
+ "content": "菢",
2637
+ "lstrip": false,
2638
+ "normalized": true,
2639
+ "rstrip": false,
2640
+ "single_word": false,
2641
+ "special": false
2642
+ },
2643
+ "21453": {
2644
+ "content": "鴟",
2645
+ "lstrip": false,
2646
+ "normalized": true,
2647
+ "rstrip": false,
2648
+ "single_word": false,
2649
+ "special": false
2650
+ },
2651
+ "21454": {
2652
+ "content": "韙",
2653
+ "lstrip": false,
2654
+ "normalized": true,
2655
+ "rstrip": false,
2656
+ "single_word": false,
2657
+ "special": false
2658
+ },
2659
+ "21455": {
2660
+ "content": "鯇",
2661
+ "lstrip": false,
2662
+ "normalized": true,
2663
+ "rstrip": false,
2664
+ "single_word": false,
2665
+ "special": false
2666
+ },
2667
+ "21456": {
2668
+ "content": "駟",
2669
+ "lstrip": false,
2670
+ "normalized": true,
2671
+ "rstrip": false,
2672
+ "single_word": false,
2673
+ "special": false
2674
+ },
2675
+ "21457": {
2676
+ "content": "淰",
2677
+ "lstrip": false,
2678
+ "normalized": true,
2679
+ "rstrip": false,
2680
+ "single_word": false,
2681
+ "special": false
2682
+ },
2683
+ "21458": {
2684
+ "content": "騾",
2685
+ "lstrip": false,
2686
+ "normalized": true,
2687
+ "rstrip": false,
2688
+ "single_word": false,
2689
+ "special": false
2690
+ },
2691
+ "21459": {
2692
+ "content": "鶲",
2693
+ "lstrip": false,
2694
+ "normalized": true,
2695
+ "rstrip": false,
2696
+ "single_word": false,
2697
+ "special": false
2698
+ },
2699
+ "21460": {
2700
+ "content": "瀄",
2701
+ "lstrip": false,
2702
+ "normalized": true,
2703
+ "rstrip": false,
2704
+ "single_word": false,
2705
+ "special": false
2706
+ },
2707
+ "21461": {
2708
+ "content": "鍬",
2709
+ "lstrip": false,
2710
+ "normalized": true,
2711
+ "rstrip": false,
2712
+ "single_word": false,
2713
+ "special": false
2714
+ },
2715
+ "21462": {
2716
+ "content": "龠",
2717
+ "lstrip": false,
2718
+ "normalized": true,
2719
+ "rstrip": false,
2720
+ "single_word": false,
2721
+ "special": false
2722
+ },
2723
+ "21463": {
2724
+ "content": "苴",
2725
+ "lstrip": false,
2726
+ "normalized": true,
2727
+ "rstrip": false,
2728
+ "single_word": false,
2729
+ "special": false
2730
+ },
2731
+ "21464": {
2732
+ "content": "椴",
2733
+ "lstrip": false,
2734
+ "normalized": true,
2735
+ "rstrip": false,
2736
+ "single_word": false,
2737
+ "special": false
2738
+ },
2739
+ "21465": {
2740
+ "content": "蠏",
2741
+ "lstrip": false,
2742
+ "normalized": true,
2743
+ "rstrip": false,
2744
+ "single_word": false,
2745
+ "special": false
2746
+ },
2747
+ "21466": {
2748
+ "content": "闓",
2749
+ "lstrip": false,
2750
+ "normalized": true,
2751
+ "rstrip": false,
2752
+ "single_word": false,
2753
+ "special": false
2754
+ },
2755
+ "21467": {
2756
+ "content": "姖",
2757
+ "lstrip": false,
2758
+ "normalized": true,
2759
+ "rstrip": false,
2760
+ "single_word": false,
2761
+ "special": false
2762
+ },
2763
+ "21468": {
2764
+ "content": "騤",
2765
+ "lstrip": false,
2766
+ "normalized": true,
2767
+ "rstrip": false,
2768
+ "single_word": false,
2769
+ "special": false
2770
+ },
2771
+ "21469": {
2772
+ "content": "鎘",
2773
+ "lstrip": false,
2774
+ "normalized": true,
2775
+ "rstrip": false,
2776
+ "single_word": false,
2777
+ "special": false
2778
+ },
2779
+ "21470": {
2780
+ "content": "竈",
2781
+ "lstrip": false,
2782
+ "normalized": true,
2783
+ "rstrip": false,
2784
+ "single_word": false,
2785
+ "special": false
2786
+ },
2787
+ "21471": {
2788
+ "content": "鍔",
2789
+ "lstrip": false,
2790
+ "normalized": true,
2791
+ "rstrip": false,
2792
+ "single_word": false,
2793
+ "special": false
2794
+ },
2795
+ "21472": {
2796
+ "content": "澌",
2797
+ "lstrip": false,
2798
+ "normalized": true,
2799
+ "rstrip": false,
2800
+ "single_word": false,
2801
+ "special": false
2802
+ },
2803
+ "21473": {
2804
+ "content": "胵",
2805
+ "lstrip": false,
2806
+ "normalized": true,
2807
+ "rstrip": false,
2808
+ "single_word": false,
2809
+ "special": false
2810
+ },
2811
+ "21474": {
2812
+ "content": "癩",
2813
+ "lstrip": false,
2814
+ "normalized": true,
2815
+ "rstrip": false,
2816
+ "single_word": false,
2817
+ "special": false
2818
+ },
2819
+ "21475": {
2820
+ "content": "旼",
2821
+ "lstrip": false,
2822
+ "normalized": true,
2823
+ "rstrip": false,
2824
+ "single_word": false,
2825
+ "special": false
2826
+ },
2827
+ "21476": {
2828
+ "content": "鮫",
2829
+ "lstrip": false,
2830
+ "normalized": true,
2831
+ "rstrip": false,
2832
+ "single_word": false,
2833
+ "special": false
2834
+ },
2835
+ "21477": {
2836
+ "content": "棖",
2837
+ "lstrip": false,
2838
+ "normalized": true,
2839
+ "rstrip": false,
2840
+ "single_word": false,
2841
+ "special": false
2842
+ },
2843
+ "21478": {
2844
+ "content": "蘄",
2845
+ "lstrip": false,
2846
+ "normalized": true,
2847
+ "rstrip": false,
2848
+ "single_word": false,
2849
+ "special": false
2850
+ },
2851
+ "21479": {
2852
+ "content": "駢",
2853
+ "lstrip": false,
2854
+ "normalized": true,
2855
+ "rstrip": false,
2856
+ "single_word": false,
2857
+ "special": false
2858
+ },
2859
+ "21480": {
2860
+ "content": "閭",
2861
+ "lstrip": false,
2862
+ "normalized": true,
2863
+ "rstrip": false,
2864
+ "single_word": false,
2865
+ "special": false
2866
+ },
2867
+ "21481": {
2868
+ "content": "鏐",
2869
+ "lstrip": false,
2870
+ "normalized": true,
2871
+ "rstrip": false,
2872
+ "single_word": false,
2873
+ "special": false
2874
+ },
2875
+ "21482": {
2876
+ "content": "鈷",
2877
+ "lstrip": false,
2878
+ "normalized": true,
2879
+ "rstrip": false,
2880
+ "single_word": false,
2881
+ "special": false
2882
+ },
2883
+ "21483": {
2884
+ "content": "罘",
2885
+ "lstrip": false,
2886
+ "normalized": true,
2887
+ "rstrip": false,
2888
+ "single_word": false,
2889
+ "special": false
2890
+ },
2891
+ "21484": {
2892
+ "content": "厓",
2893
+ "lstrip": false,
2894
+ "normalized": true,
2895
+ "rstrip": false,
2896
+ "single_word": false,
2897
+ "special": false
2898
+ },
2899
+ "21485": {
2900
+ "content": "玗",
2901
+ "lstrip": false,
2902
+ "normalized": true,
2903
+ "rstrip": false,
2904
+ "single_word": false,
2905
+ "special": false
2906
+ },
2907
+ "21486": {
2908
+ "content": "鎵",
2909
+ "lstrip": false,
2910
+ "normalized": true,
2911
+ "rstrip": false,
2912
+ "single_word": false,
2913
+ "special": false
2914
+ },
2915
+ "21487": {
2916
+ "content": "鋯",
2917
+ "lstrip": false,
2918
+ "normalized": true,
2919
+ "rstrip": false,
2920
+ "single_word": false,
2921
+ "special": false
2922
+ },
2923
+ "21488": {
2924
+ "content": "擗",
2925
+ "lstrip": false,
2926
+ "normalized": true,
2927
+ "rstrip": false,
2928
+ "single_word": false,
2929
+ "special": false
2930
+ },
2931
+ "21489": {
2932
+ "content": "銲",
2933
+ "lstrip": false,
2934
+ "normalized": true,
2935
+ "rstrip": false,
2936
+ "single_word": false,
2937
+ "special": false
2938
+ },
2939
+ "21490": {
2940
+ "content": "禕",
2941
+ "lstrip": false,
2942
+ "normalized": true,
2943
+ "rstrip": false,
2944
+ "single_word": false,
2945
+ "special": false
2946
+ },
2947
+ "21491": {
2948
+ "content": "繑",
2949
+ "lstrip": false,
2950
+ "normalized": true,
2951
+ "rstrip": false,
2952
+ "single_word": false,
2953
+ "special": false
2954
+ },
2955
+ "21492": {
2956
+ "content": "䁪",
2957
+ "lstrip": false,
2958
+ "normalized": true,
2959
+ "rstrip": false,
2960
+ "single_word": false,
2961
+ "special": false
2962
+ },
2963
+ "21493": {
2964
+ "content": "婄",
2965
+ "lstrip": false,
2966
+ "normalized": true,
2967
+ "rstrip": false,
2968
+ "single_word": false,
2969
+ "special": false
2970
+ },
2971
+ "21494": {
2972
+ "content": "蟧",
2973
+ "lstrip": false,
2974
+ "normalized": true,
2975
+ "rstrip": false,
2976
+ "single_word": false,
2977
+ "special": false
2978
+ },
2979
+ "21495": {
2980
+ "content": "鈸",
2981
+ "lstrip": false,
2982
+ "normalized": true,
2983
+ "rstrip": false,
2984
+ "single_word": false,
2985
+ "special": false
2986
+ },
2987
+ "21496": {
2988
+ "content": "鉭",
2989
+ "lstrip": false,
2990
+ "normalized": true,
2991
+ "rstrip": false,
2992
+ "single_word": false,
2993
+ "special": false
2994
+ },
2995
+ "21497": {
2996
+ "content": "闞",
2997
+ "lstrip": false,
2998
+ "normalized": true,
2999
+ "rstrip": false,
3000
+ "single_word": false,
3001
+ "special": false
3002
+ },
3003
+ "21498": {
3004
+ "content": "鄕",
3005
+ "lstrip": false,
3006
+ "normalized": true,
3007
+ "rstrip": false,
3008
+ "single_word": false,
3009
+ "special": false
3010
+ },
3011
+ "21499": {
3012
+ "content": "怐",
3013
+ "lstrip": false,
3014
+ "normalized": true,
3015
+ "rstrip": false,
3016
+ "single_word": false,
3017
+ "special": false
3018
+ },
3019
+ "21500": {
3020
+ "content": "菫",
3021
+ "lstrip": false,
3022
+ "normalized": true,
3023
+ "rstrip": false,
3024
+ "single_word": false,
3025
+ "special": false
3026
+ },
3027
+ "21501": {
3028
+ "content": "韮",
3029
+ "lstrip": false,
3030
+ "normalized": true,
3031
+ "rstrip": false,
3032
+ "single_word": false,
3033
+ "special": false
3034
+ },
3035
+ "21502": {
3036
+ "content": "鱟",
3037
+ "lstrip": false,
3038
+ "normalized": true,
3039
+ "rstrip": false,
3040
+ "single_word": false,
3041
+ "special": false
3042
+ },
3043
+ "21503": {
3044
+ "content": "掅",
3045
+ "lstrip": false,
3046
+ "normalized": true,
3047
+ "rstrip": false,
3048
+ "single_word": false,
3049
+ "special": false
3050
+ },
3051
+ "21504": {
3052
+ "content": "朊",
3053
+ "lstrip": false,
3054
+ "normalized": true,
3055
+ "rstrip": false,
3056
+ "single_word": false,
3057
+ "special": false
3058
+ },
3059
+ "21505": {
3060
+ "content": "鵪",
3061
+ "lstrip": false,
3062
+ "normalized": true,
3063
+ "rstrip": false,
3064
+ "single_word": false,
3065
+ "special": false
3066
+ },
3067
+ "21506": {
3068
+ "content": "氘",
3069
+ "lstrip": false,
3070
+ "normalized": true,
3071
+ "rstrip": false,
3072
+ "single_word": false,
3073
+ "special": false
3074
+ },
3075
+ "21507": {
3076
+ "content": "蜞",
3077
+ "lstrip": false,
3078
+ "normalized": true,
3079
+ "rstrip": false,
3080
+ "single_word": false,
3081
+ "special": false
3082
+ },
3083
+ "21508": {
3084
+ "content": "篋",
3085
+ "lstrip": false,
3086
+ "normalized": true,
3087
+ "rstrip": false,
3088
+ "single_word": false,
3089
+ "special": false
3090
+ },
3091
+ "21509": {
3092
+ "content": "湉",
3093
+ "lstrip": false,
3094
+ "normalized": true,
3095
+ "rstrip": false,
3096
+ "single_word": false,
3097
+ "special": false
3098
+ },
3099
+ "21510": {
3100
+ "content": "鏇",
3101
+ "lstrip": false,
3102
+ "normalized": true,
3103
+ "rstrip": false,
3104
+ "single_word": false,
3105
+ "special": false
3106
+ },
3107
+ "21511": {
3108
+ "content": "鈁",
3109
+ "lstrip": false,
3110
+ "normalized": true,
3111
+ "rstrip": false,
3112
+ "single_word": false,
3113
+ "special": false
3114
+ },
3115
+ "21512": {
3116
+ "content": "淝",
3117
+ "lstrip": false,
3118
+ "normalized": true,
3119
+ "rstrip": false,
3120
+ "single_word": false,
3121
+ "special": false
3122
+ },
3123
+ "21513": {
3124
+ "content": "搾",
3125
+ "lstrip": false,
3126
+ "normalized": true,
3127
+ "rstrip": false,
3128
+ "single_word": false,
3129
+ "special": false
3130
+ },
3131
+ "21514": {
3132
+ "content": "壙",
3133
+ "lstrip": false,
3134
+ "normalized": true,
3135
+ "rstrip": false,
3136
+ "single_word": false,
3137
+ "special": false
3138
+ },
3139
+ "21515": {
3140
+ "content": "縉",
3141
+ "lstrip": false,
3142
+ "normalized": true,
3143
+ "rstrip": false,
3144
+ "single_word": false,
3145
+ "special": false
3146
+ },
3147
+ "21516": {
3148
+ "content": "璠",
3149
+ "lstrip": false,
3150
+ "normalized": true,
3151
+ "rstrip": false,
3152
+ "single_word": false,
3153
+ "special": false
3154
+ },
3155
+ "21517": {
3156
+ "content": "氂",
3157
+ "lstrip": false,
3158
+ "normalized": true,
3159
+ "rstrip": false,
3160
+ "single_word": false,
3161
+ "special": false
3162
+ },
3163
+ "21518": {
3164
+ "content": "犛",
3165
+ "lstrip": false,
3166
+ "normalized": true,
3167
+ "rstrip": false,
3168
+ "single_word": false,
3169
+ "special": false
3170
+ },
3171
+ "21519": {
3172
+ "content": "蒴",
3173
+ "lstrip": false,
3174
+ "normalized": true,
3175
+ "rstrip": false,
3176
+ "single_word": false,
3177
+ "special": false
3178
+ },
3179
+ "21520": {
3180
+ "content": "愨",
3181
+ "lstrip": false,
3182
+ "normalized": true,
3183
+ "rstrip": false,
3184
+ "single_word": false,
3185
+ "special": false
3186
+ },
3187
+ "21521": {
3188
+ "content": "豸",
3189
+ "lstrip": false,
3190
+ "normalized": true,
3191
+ "rstrip": false,
3192
+ "single_word": false,
3193
+ "special": false
3194
+ },
3195
+ "21522": {
3196
+ "content": "掯",
3197
+ "lstrip": false,
3198
+ "normalized": true,
3199
+ "rstrip": false,
3200
+ "single_word": false,
3201
+ "special": false
3202
+ },
3203
+ "21523": {
3204
+ "content": "扠",
3205
+ "lstrip": false,
3206
+ "normalized": true,
3207
+ "rstrip": false,
3208
+ "single_word": false,
3209
+ "special": false
3210
+ },
3211
+ "21524": {
3212
+ "content": "顓",
3213
+ "lstrip": false,
3214
+ "normalized": true,
3215
+ "rstrip": false,
3216
+ "single_word": false,
3217
+ "special": false
3218
+ },
3219
+ "21525": {
3220
+ "content": "啋",
3221
+ "lstrip": false,
3222
+ "normalized": true,
3223
+ "rstrip": false,
3224
+ "single_word": false,
3225
+ "special": false
3226
+ },
3227
+ "21526": {
3228
+ "content": "閆",
3229
+ "lstrip": false,
3230
+ "normalized": true,
3231
+ "rstrip": false,
3232
+ "single_word": false,
3233
+ "special": false
3234
+ },
3235
+ "21527": {
3236
+ "content": "扻",
3237
+ "lstrip": false,
3238
+ "normalized": true,
3239
+ "rstrip": false,
3240
+ "single_word": false,
3241
+ "special": false
3242
+ },
3243
+ "21528": {
3244
+ "content": "疋",
3245
+ "lstrip": false,
3246
+ "normalized": true,
3247
+ "rstrip": false,
3248
+ "single_word": false,
3249
+ "special": false
3250
+ },
3251
+ "21529": {
3252
+ "content": "釹",
3253
+ "lstrip": false,
3254
+ "normalized": true,
3255
+ "rstrip": false,
3256
+ "single_word": false,
3257
+ "special": false
3258
+ },
3259
+ "21530": {
3260
+ "content": "㓟",
3261
+ "lstrip": false,
3262
+ "normalized": true,
3263
+ "rstrip": false,
3264
+ "single_word": false,
3265
+ "special": false
3266
+ },
3267
+ "21531": {
3268
+ "content": "潯",
3269
+ "lstrip": false,
3270
+ "normalized": true,
3271
+ "rstrip": false,
3272
+ "single_word": false,
3273
+ "special": false
3274
+ },
3275
+ "21532": {
3276
+ "content": "鐙",
3277
+ "lstrip": false,
3278
+ "normalized": true,
3279
+ "rstrip": false,
3280
+ "single_word": false,
3281
+ "special": false
3282
+ },
3283
+ "21533": {
3284
+ "content": "㞘",
3285
+ "lstrip": false,
3286
+ "normalized": true,
3287
+ "rstrip": false,
3288
+ "single_word": false,
3289
+ "special": false
3290
+ },
3291
+ "21534": {
3292
+ "content": "鈰",
3293
+ "lstrip": false,
3294
+ "normalized": true,
3295
+ "rstrip": false,
3296
+ "single_word": false,
3297
+ "special": false
3298
+ },
3299
+ "21535": {
3300
+ "content": "瓚",
3301
+ "lstrip": false,
3302
+ "normalized": true,
3303
+ "rstrip": false,
3304
+ "single_word": false,
3305
+ "special": false
3306
+ },
3307
+ "21536": {
3308
+ "content": "嚜",
3309
+ "lstrip": false,
3310
+ "normalized": true,
3311
+ "rstrip": false,
3312
+ "single_word": false,
3313
+ "special": false
3314
+ },
3315
+ "21537": {
3316
+ "content": "埐",
3317
+ "lstrip": false,
3318
+ "normalized": true,
3319
+ "rstrip": false,
3320
+ "single_word": false,
3321
+ "special": false
3322
+ },
3323
+ "21538": {
3324
+ "content": "驤",
3325
+ "lstrip": false,
3326
+ "normalized": true,
3327
+ "rstrip": false,
3328
+ "single_word": false,
3329
+ "special": false
3330
+ },
3331
+ "21539": {
3332
+ "content": "牘",
3333
+ "lstrip": false,
3334
+ "normalized": true,
3335
+ "rstrip": false,
3336
+ "single_word": false,
3337
+ "special": false
3338
+ },
3339
+ "21540": {
3340
+ "content": "睚",
3341
+ "lstrip": false,
3342
+ "normalized": true,
3343
+ "rstrip": false,
3344
+ "single_word": false,
3345
+ "special": false
3346
+ },
3347
+ "21541": {
3348
+ "content": "繯",
3349
+ "lstrip": false,
3350
+ "normalized": true,
3351
+ "rstrip": false,
3352
+ "single_word": false,
3353
+ "special": false
3354
+ },
3355
+ "21542": {
3356
+ "content": "岜",
3357
+ "lstrip": false,
3358
+ "normalized": true,
3359
+ "rstrip": false,
3360
+ "single_word": false,
3361
+ "special": false
3362
+ },
3363
+ "21543": {
3364
+ "content": "蛉",
3365
+ "lstrip": false,
3366
+ "normalized": true,
3367
+ "rstrip": false,
3368
+ "single_word": false,
3369
+ "special": false
3370
+ },
3371
+ "21544": {
3372
+ "content": "桴",
3373
+ "lstrip": false,
3374
+ "normalized": true,
3375
+ "rstrip": false,
3376
+ "single_word": false,
3377
+ "special": false
3378
+ },
3379
+ "21545": {
3380
+ "content": "惲",
3381
+ "lstrip": false,
3382
+ "normalized": true,
3383
+ "rstrip": false,
3384
+ "single_word": false,
3385
+ "special": false
3386
+ },
3387
+ "21546": {
3388
+ "content": "橈",
3389
+ "lstrip": false,
3390
+ "normalized": true,
3391
+ "rstrip": false,
3392
+ "single_word": false,
3393
+ "special": false
3394
+ },
3395
+ "21547": {
3396
+ "content": "轤",
3397
+ "lstrip": false,
3398
+ "normalized": true,
3399
+ "rstrip": false,
3400
+ "single_word": false,
3401
+ "special": false
3402
+ },
3403
+ "21548": {
3404
+ "content": "鋨",
3405
+ "lstrip": false,
3406
+ "normalized": true,
3407
+ "rstrip": false,
3408
+ "single_word": false,
3409
+ "special": false
3410
+ },
3411
+ "21549": {
3412
+ "content": "糶",
3413
+ "lstrip": false,
3414
+ "normalized": true,
3415
+ "rstrip": false,
3416
+ "single_word": false,
3417
+ "special": false
3418
+ },
3419
+ "21550": {
3420
+ "content": "鍼",
3421
+ "lstrip": false,
3422
+ "normalized": true,
3423
+ "rstrip": false,
3424
+ "single_word": false,
3425
+ "special": false
3426
+ },
3427
+ "21551": {
3428
+ "content": "棯",
3429
+ "lstrip": false,
3430
+ "normalized": true,
3431
+ "rstrip": false,
3432
+ "single_word": false,
3433
+ "special": false
3434
+ },
3435
+ "21552": {
3436
+ "content": "琿",
3437
+ "lstrip": false,
3438
+ "normalized": true,
3439
+ "rstrip": false,
3440
+ "single_word": false,
3441
+ "special": false
3442
+ },
3443
+ "21553": {
3444
+ "content": "鯷",
3445
+ "lstrip": false,
3446
+ "normalized": true,
3447
+ "rstrip": false,
3448
+ "single_word": false,
3449
+ "special": false
3450
+ },
3451
+ "21554": {
3452
+ "content": "鈮",
3453
+ "lstrip": false,
3454
+ "normalized": true,
3455
+ "rstrip": false,
3456
+ "single_word": false,
3457
+ "special": false
3458
+ },
3459
+ "21555": {
3460
+ "content": "僞",
3461
+ "lstrip": false,
3462
+ "normalized": true,
3463
+ "rstrip": false,
3464
+ "single_word": false,
3465
+ "special": false
3466
+ },
3467
+ "21556": {
3468
+ "content": "贇",
3469
+ "lstrip": false,
3470
+ "normalized": true,
3471
+ "rstrip": false,
3472
+ "single_word": false,
3473
+ "special": false
3474
+ },
3475
+ "21557": {
3476
+ "content": "鈿",
3477
+ "lstrip": false,
3478
+ "normalized": true,
3479
+ "rstrip": false,
3480
+ "single_word": false,
3481
+ "special": false
3482
+ },
3483
+ "21558": {
3484
+ "content": "甑",
3485
+ "lstrip": false,
3486
+ "normalized": true,
3487
+ "rstrip": false,
3488
+ "single_word": false,
3489
+ "special": false
3490
+ },
3491
+ "21559": {
3492
+ "content": "夀",
3493
+ "lstrip": false,
3494
+ "normalized": true,
3495
+ "rstrip": false,
3496
+ "single_word": false,
3497
+ "special": false
3498
+ },
3499
+ "21560": {
3500
+ "content": "釤",
3501
+ "lstrip": false,
3502
+ "normalized": true,
3503
+ "rstrip": false,
3504
+ "single_word": false,
3505
+ "special": false
3506
+ },
3507
+ "21561": {
3508
+ "content": "摑",
3509
+ "lstrip": false,
3510
+ "normalized": true,
3511
+ "rstrip": false,
3512
+ "single_word": false,
3513
+ "special": false
3514
+ },
3515
+ "21562": {
3516
+ "content": "瑭",
3517
+ "lstrip": false,
3518
+ "normalized": true,
3519
+ "rstrip": false,
3520
+ "single_word": false,
3521
+ "special": false
3522
+ },
3523
+ "21563": {
3524
+ "content": "蘅",
3525
+ "lstrip": false,
3526
+ "normalized": true,
3527
+ "rstrip": false,
3528
+ "single_word": false,
3529
+ "special": false
3530
+ },
3531
+ "21564": {
3532
+ "content": "鵯",
3533
+ "lstrip": false,
3534
+ "normalized": true,
3535
+ "rstrip": false,
3536
+ "single_word": false,
3537
+ "special": false
3538
+ },
3539
+ "21565": {
3540
+ "content": "珓",
3541
+ "lstrip": false,
3542
+ "normalized": true,
3543
+ "rstrip": false,
3544
+ "single_word": false,
3545
+ "special": false
3546
+ },
3547
+ "21566": {
3548
+ "content": "琤",
3549
+ "lstrip": false,
3550
+ "normalized": true,
3551
+ "rstrip": false,
3552
+ "single_word": false,
3553
+ "special": false
3554
+ },
3555
+ "21567": {
3556
+ "content": "骱",
3557
+ "lstrip": false,
3558
+ "normalized": true,
3559
+ "rstrip": false,
3560
+ "single_word": false,
3561
+ "special": false
3562
+ },
3563
+ "21568": {
3564
+ "content": "鳧",
3565
+ "lstrip": false,
3566
+ "normalized": true,
3567
+ "rstrip": false,
3568
+ "single_word": false,
3569
+ "special": false
3570
+ },
3571
+ "21569": {
3572
+ "content": "炩",
3573
+ "lstrip": false,
3574
+ "normalized": true,
3575
+ "rstrip": false,
3576
+ "single_word": false,
3577
+ "special": false
3578
+ },
3579
+ "21570": {
3580
+ "content": "薾",
3581
+ "lstrip": false,
3582
+ "normalized": true,
3583
+ "rstrip": false,
3584
+ "single_word": false,
3585
+ "special": false
3586
+ },
3587
+ "21571": {
3588
+ "content": "㨃",
3589
+ "lstrip": false,
3590
+ "normalized": true,
3591
+ "rstrip": false,
3592
+ "single_word": false,
3593
+ "special": false
3594
+ },
3595
+ "21572": {
3596
+ "content": "錕",
3597
+ "lstrip": false,
3598
+ "normalized": true,
3599
+ "rstrip": false,
3600
+ "single_word": false,
3601
+ "special": false
3602
+ },
3603
+ "21573": {
3604
+ "content": "懽",
3605
+ "lstrip": false,
3606
+ "normalized": true,
3607
+ "rstrip": false,
3608
+ "single_word": false,
3609
+ "special": false
3610
+ },
3611
+ "21574": {
3612
+ "content": "鑪",
3613
+ "lstrip": false,
3614
+ "normalized": true,
3615
+ "rstrip": false,
3616
+ "single_word": false,
3617
+ "special": false
3618
+ },
3619
+ "21575": {
3620
+ "content": "颮",
3621
+ "lstrip": false,
3622
+ "normalized": true,
3623
+ "rstrip": false,
3624
+ "single_word": false,
3625
+ "special": false
3626
+ },
3627
+ "21576": {
3628
+ "content": "殻",
3629
+ "lstrip": false,
3630
+ "normalized": true,
3631
+ "rstrip": false,
3632
+ "single_word": false,
3633
+ "special": false
3634
+ },
3635
+ "21577": {
3636
+ "content": "睄",
3637
+ "lstrip": false,
3638
+ "normalized": true,
3639
+ "rstrip": false,
3640
+ "single_word": false,
3641
+ "special": false
3642
+ },
3643
+ "21578": {
3644
+ "content": "岋",
3645
+ "lstrip": false,
3646
+ "normalized": true,
3647
+ "rstrip": false,
3648
+ "single_word": false,
3649
+ "special": false
3650
+ },
3651
+ "21579": {
3652
+ "content": "漖",
3653
+ "lstrip": false,
3654
+ "normalized": true,
3655
+ "rstrip": false,
3656
+ "single_word": false,
3657
+ "special": false
3658
+ },
3659
+ "21580": {
3660
+ "content": "咃",
3661
+ "lstrip": false,
3662
+ "normalized": true,
3663
+ "rstrip": false,
3664
+ "single_word": false,
3665
+ "special": false
3666
+ },
3667
+ "21581": {
3668
+ "content": "嚫",
3669
+ "lstrip": false,
3670
+ "normalized": true,
3671
+ "rstrip": false,
3672
+ "single_word": false,
3673
+ "special": false
3674
+ },
3675
+ "21582": {
3676
+ "content": "亶",
3677
+ "lstrip": false,
3678
+ "normalized": true,
3679
+ "rstrip": false,
3680
+ "single_word": false,
3681
+ "special": false
3682
+ },
3683
+ "21583": {
3684
+ "content": "瀡",
3685
+ "lstrip": false,
3686
+ "normalized": true,
3687
+ "rstrip": false,
3688
+ "single_word": false,
3689
+ "special": false
3690
+ },
3691
+ "21584": {
3692
+ "content": "僊",
3693
+ "lstrip": false,
3694
+ "normalized": true,
3695
+ "rstrip": false,
3696
+ "single_word": false,
3697
+ "special": false
3698
+ },
3699
+ "21585": {
3700
+ "content": "睺",
3701
+ "lstrip": false,
3702
+ "normalized": true,
3703
+ "rstrip": false,
3704
+ "single_word": false,
3705
+ "special": false
3706
+ },
3707
+ "21586": {
3708
+ "content": "鈹",
3709
+ "lstrip": false,
3710
+ "normalized": true,
3711
+ "rstrip": false,
3712
+ "single_word": false,
3713
+ "special": false
3714
+ },
3715
+ "21587": {
3716
+ "content": "摼",
3717
+ "lstrip": false,
3718
+ "normalized": true,
3719
+ "rstrip": false,
3720
+ "single_word": false,
3721
+ "special": false
3722
+ },
3723
+ "21588": {
3724
+ "content": "釩",
3725
+ "lstrip": false,
3726
+ "normalized": true,
3727
+ "rstrip": false,
3728
+ "single_word": false,
3729
+ "special": false
3730
+ },
3731
+ "21589": {
3732
+ "content": "鑌",
3733
+ "lstrip": false,
3734
+ "normalized": true,
3735
+ "rstrip": false,
3736
+ "single_word": false,
3737
+ "special": false
3738
+ },
3739
+ "21590": {
3740
+ "content": "鈧",
3741
+ "lstrip": false,
3742
+ "normalized": true,
3743
+ "rstrip": false,
3744
+ "single_word": false,
3745
+ "special": false
3746
+ },
3747
+ "21591": {
3748
+ "content": "鉋",
3749
+ "lstrip": false,
3750
+ "normalized": true,
3751
+ "rstrip": false,
3752
+ "single_word": false,
3753
+ "special": false
3754
+ },
3755
+ "21592": {
3756
+ "content": "澠",
3757
+ "lstrip": false,
3758
+ "normalized": true,
3759
+ "rstrip": false,
3760
+ "single_word": false,
3761
+ "special": false
3762
+ },
3763
+ "21593": {
3764
+ "content": "昃",
3765
+ "lstrip": false,
3766
+ "normalized": true,
3767
+ "rstrip": false,
3768
+ "single_word": false,
3769
+ "special": false
3770
+ },
3771
+ "21594": {
3772
+ "content": "冑",
3773
+ "lstrip": false,
3774
+ "normalized": true,
3775
+ "rstrip": false,
3776
+ "single_word": false,
3777
+ "special": false
3778
+ },
3779
+ "21595": {
3780
+ "content": "趲",
3781
+ "lstrip": false,
3782
+ "normalized": true,
3783
+ "rstrip": false,
3784
+ "single_word": false,
3785
+ "special": false
3786
+ },
3787
+ "21596": {
3788
+ "content": "毬",
3789
+ "lstrip": false,
3790
+ "normalized": true,
3791
+ "rstrip": false,
3792
+ "single_word": false,
3793
+ "special": false
3794
+ },
3795
+ "21597": {
3796
+ "content": "鏵",
3797
+ "lstrip": false,
3798
+ "normalized": true,
3799
+ "rstrip": false,
3800
+ "single_word": false,
3801
+ "special": false
3802
+ },
3803
+ "21598": {
3804
+ "content": "鸏",
3805
+ "lstrip": false,
3806
+ "normalized": true,
3807
+ "rstrip": false,
3808
+ "single_word": false,
3809
+ "special": false
3810
+ },
3811
+ "21599": {
3812
+ "content": "坼",
3813
+ "lstrip": false,
3814
+ "normalized": true,
3815
+ "rstrip": false,
3816
+ "single_word": false,
3817
+ "special": false
3818
+ },
3819
+ "21600": {
3820
+ "content": "鞮",
3821
+ "lstrip": false,
3822
+ "normalized": true,
3823
+ "rstrip": false,
3824
+ "single_word": false,
3825
+ "special": false
3826
+ },
3827
+ "21601": {
3828
+ "content": "硏",
3829
+ "lstrip": false,
3830
+ "normalized": true,
3831
+ "rstrip": false,
3832
+ "single_word": false,
3833
+ "special": false
3834
+ },
3835
+ "21602": {
3836
+ "content": "輦",
3837
+ "lstrip": false,
3838
+ "normalized": true,
3839
+ "rstrip": false,
3840
+ "single_word": false,
3841
+ "special": false
3842
+ },
3843
+ "21603": {
3844
+ "content": "廩",
3845
+ "lstrip": false,
3846
+ "normalized": true,
3847
+ "rstrip": false,
3848
+ "single_word": false,
3849
+ "special": false
3850
+ },
3851
+ "21604": {
3852
+ "content": "昪",
3853
+ "lstrip": false,
3854
+ "normalized": true,
3855
+ "rstrip": false,
3856
+ "single_word": false,
3857
+ "special": false
3858
+ },
3859
+ "21605": {
3860
+ "content": "䌫",
3861
+ "lstrip": false,
3862
+ "normalized": true,
3863
+ "rstrip": false,
3864
+ "single_word": false,
3865
+ "special": false
3866
+ },
3867
+ "21606": {
3868
+ "content": "鉎",
3869
+ "lstrip": false,
3870
+ "normalized": true,
3871
+ "rstrip": false,
3872
+ "single_word": false,
3873
+ "special": false
3874
+ },
3875
+ "21607": {
3876
+ "content": "愔",
3877
+ "lstrip": false,
3878
+ "normalized": true,
3879
+ "rstrip": false,
3880
+ "single_word": false,
3881
+ "special": false
3882
+ },
3883
+ "21608": {
3884
+ "content": "隗",
3885
+ "lstrip": false,
3886
+ "normalized": true,
3887
+ "rstrip": false,
3888
+ "single_word": false,
3889
+ "special": false
3890
+ },
3891
+ "21609": {
3892
+ "content": "鸌",
3893
+ "lstrip": false,
3894
+ "normalized": true,
3895
+ "rstrip": false,
3896
+ "single_word": false,
3897
+ "special": false
3898
+ },
3899
+ "21610": {
3900
+ "content": "逑",
3901
+ "lstrip": false,
3902
+ "normalized": true,
3903
+ "rstrip": false,
3904
+ "single_word": false,
3905
+ "special": false
3906
+ },
3907
+ "21611": {
3908
+ "content": "蓀",
3909
+ "lstrip": false,
3910
+ "normalized": true,
3911
+ "rstrip": false,
3912
+ "single_word": false,
3913
+ "special": false
3914
+ },
3915
+ "21612": {
3916
+ "content": "殽",
3917
+ "lstrip": false,
3918
+ "normalized": true,
3919
+ "rstrip": false,
3920
+ "single_word": false,
3921
+ "special": false
3922
+ },
3923
+ "21613": {
3924
+ "content": "礐",
3925
+ "lstrip": false,
3926
+ "normalized": true,
3927
+ "rstrip": false,
3928
+ "single_word": false,
3929
+ "special": false
3930
+ },
3931
+ "21614": {
3932
+ "content": "髁",
3933
+ "lstrip": false,
3934
+ "normalized": true,
3935
+ "rstrip": false,
3936
+ "single_word": false,
3937
+ "special": false
3938
+ },
3939
+ "21615": {
3940
+ "content": "篸",
3941
+ "lstrip": false,
3942
+ "normalized": true,
3943
+ "rstrip": false,
3944
+ "single_word": false,
3945
+ "special": false
3946
+ },
3947
+ "21616": {
3948
+ "content": "旯",
3949
+ "lstrip": false,
3950
+ "normalized": true,
3951
+ "rstrip": false,
3952
+ "single_word": false,
3953
+ "special": false
3954
+ },
3955
+ "21617": {
3956
+ "content": "郟",
3957
+ "lstrip": false,
3958
+ "normalized": true,
3959
+ "rstrip": false,
3960
+ "single_word": false,
3961
+ "special": false
3962
+ },
3963
+ "21618": {
3964
+ "content": "岃",
3965
+ "lstrip": false,
3966
+ "normalized": true,
3967
+ "rstrip": false,
3968
+ "single_word": false,
3969
+ "special": false
3970
+ },
3971
+ "21619": {
3972
+ "content": "椥",
3973
+ "lstrip": false,
3974
+ "normalized": true,
3975
+ "rstrip": false,
3976
+ "single_word": false,
3977
+ "special": false
3978
+ },
3979
+ "21620": {
3980
+ "content": "疎",
3981
+ "lstrip": false,
3982
+ "normalized": true,
3983
+ "rstrip": false,
3984
+ "single_word": false,
3985
+ "special": false
3986
+ },
3987
+ "21621": {
3988
+ "content": "笊",
3989
+ "lstrip": false,
3990
+ "normalized": true,
3991
+ "rstrip": false,
3992
+ "single_word": false,
3993
+ "special": false
3994
+ },
3995
+ "21622": {
3996
+ "content": "丏",
3997
+ "lstrip": false,
3998
+ "normalized": true,
3999
+ "rstrip": false,
4000
+ "single_word": false,
4001
+ "special": false
4002
+ },
4003
+ "21623": {
4004
+ "content": "樘",
4005
+ "lstrip": false,
4006
+ "normalized": true,
4007
+ "rstrip": false,
4008
+ "single_word": false,
4009
+ "special": false
4010
+ },
4011
+ "21624": {
4012
+ "content": "榎",
4013
+ "lstrip": false,
4014
+ "normalized": true,
4015
+ "rstrip": false,
4016
+ "single_word": false,
4017
+ "special": false
4018
+ },
4019
+ "21625": {
4020
+ "content": "巹",
4021
+ "lstrip": false,
4022
+ "normalized": true,
4023
+ "rstrip": false,
4024
+ "single_word": false,
4025
+ "special": false
4026
+ },
4027
+ "21626": {
4028
+ "content": "鼩",
4029
+ "lstrip": false,
4030
+ "normalized": true,
4031
+ "rstrip": false,
4032
+ "single_word": false,
4033
+ "special": false
4034
+ },
4035
+ "21627": {
4036
+ "content": "揳",
4037
+ "lstrip": false,
4038
+ "normalized": true,
4039
+ "rstrip": false,
4040
+ "single_word": false,
4041
+ "special": false
4042
+ }
4043
+ },
4044
+ "additional_special_tokens": [],
4045
+ "clean_up_tokenization_spaces": true,
4046
+ "cls_token": "[CLS]",
4047
+ "do_lower_case": false,
4048
+ "mask_token": "[MASK]",
4049
+ "max_length": 512,
4050
+ "model_max_length": 512,
4051
+ "pad_to_multiple_of": null,
4052
+ "pad_token": "[PAD]",
4053
+ "pad_token_type_id": 0,
4054
+ "padding_side": "right",
4055
+ "sep_token": "[SEP]",
4056
+ "stride": 0,
4057
+ "strip_accents": null,
4058
+ "tokenize_chinese_chars": true,
4059
+ "tokenizer_class": "BertTokenizer",
4060
+ "truncation_side": "right",
4061
+ "truncation_strategy": "longest_first",
4062
+ "unk_token": "[UNK]"
4063
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff