yzimmermann commited on
Commit
3d15cf2
·
verified ·
1 Parent(s): d10440d

Upload tokenizer

Browse files
Files changed (4) hide show
  1. README.md +199 -0
  2. special_tokens_map.json +9 -0
  3. tokenizer.json +1200 -0
  4. tokenizer_config.json +72 -0
README.md ADDED
@@ -0,0 +1,199 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags: []
4
+ ---
5
+
6
+ # Model Card for Model ID
7
+
8
+ <!-- Provide a quick summary of what the model is/does. -->
9
+
10
+
11
+
12
+ ## Model Details
13
+
14
+ ### Model Description
15
+
16
+ <!-- Provide a longer summary of what this model is. -->
17
+
18
+ This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
+
20
+ - **Developed by:** [More Information Needed]
21
+ - **Funded by [optional]:** [More Information Needed]
22
+ - **Shared by [optional]:** [More Information Needed]
23
+ - **Model type:** [More Information Needed]
24
+ - **Language(s) (NLP):** [More Information Needed]
25
+ - **License:** [More Information Needed]
26
+ - **Finetuned from model [optional]:** [More Information Needed]
27
+
28
+ ### Model Sources [optional]
29
+
30
+ <!-- Provide the basic links for the model. -->
31
+
32
+ - **Repository:** [More Information Needed]
33
+ - **Paper [optional]:** [More Information Needed]
34
+ - **Demo [optional]:** [More Information Needed]
35
+
36
+ ## Uses
37
+
38
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
+
40
+ ### Direct Use
41
+
42
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
+
44
+ [More Information Needed]
45
+
46
+ ### Downstream Use [optional]
47
+
48
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
+
50
+ [More Information Needed]
51
+
52
+ ### Out-of-Scope Use
53
+
54
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
+
56
+ [More Information Needed]
57
+
58
+ ## Bias, Risks, and Limitations
59
+
60
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
+
62
+ [More Information Needed]
63
+
64
+ ### Recommendations
65
+
66
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
+
68
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
+
70
+ ## How to Get Started with the Model
71
+
72
+ Use the code below to get started with the model.
73
+
74
+ [More Information Needed]
75
+
76
+ ## Training Details
77
+
78
+ ### Training Data
79
+
80
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
+
82
+ [More Information Needed]
83
+
84
+ ### Training Procedure
85
+
86
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
+
88
+ #### Preprocessing [optional]
89
+
90
+ [More Information Needed]
91
+
92
+
93
+ #### Training Hyperparameters
94
+
95
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
+
97
+ #### Speeds, Sizes, Times [optional]
98
+
99
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
+
101
+ [More Information Needed]
102
+
103
+ ## Evaluation
104
+
105
+ <!-- This section describes the evaluation protocols and provides the results. -->
106
+
107
+ ### Testing Data, Factors & Metrics
108
+
109
+ #### Testing Data
110
+
111
+ <!-- This should link to a Dataset Card if possible. -->
112
+
113
+ [More Information Needed]
114
+
115
+ #### Factors
116
+
117
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
+
119
+ [More Information Needed]
120
+
121
+ #### Metrics
122
+
123
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
+
125
+ [More Information Needed]
126
+
127
+ ### Results
128
+
129
+ [More Information Needed]
130
+
131
+ #### Summary
132
+
133
+
134
+
135
+ ## Model Examination [optional]
136
+
137
+ <!-- Relevant interpretability work for the model goes here -->
138
+
139
+ [More Information Needed]
140
+
141
+ ## Environmental Impact
142
+
143
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
+
145
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
+
147
+ - **Hardware Type:** [More Information Needed]
148
+ - **Hours used:** [More Information Needed]
149
+ - **Cloud Provider:** [More Information Needed]
150
+ - **Compute Region:** [More Information Needed]
151
+ - **Carbon Emitted:** [More Information Needed]
152
+
153
+ ## Technical Specifications [optional]
154
+
155
+ ### Model Architecture and Objective
156
+
157
+ [More Information Needed]
158
+
159
+ ### Compute Infrastructure
160
+
161
+ [More Information Needed]
162
+
163
+ #### Hardware
164
+
165
+ [More Information Needed]
166
+
167
+ #### Software
168
+
169
+ [More Information Needed]
170
+
171
+ ## Citation [optional]
172
+
173
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
+
175
+ **BibTeX:**
176
+
177
+ [More Information Needed]
178
+
179
+ **APA:**
180
+
181
+ [More Information Needed]
182
+
183
+ ## Glossary [optional]
184
+
185
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
+
187
+ [More Information Needed]
188
+
189
+ ## More Information [optional]
190
+
191
+ [More Information Needed]
192
+
193
+ ## Model Card Authors [optional]
194
+
195
+ [More Information Needed]
196
+
197
+ ## Model Card Contact
198
+
199
+ [More Information Needed]
special_tokens_map.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": "[BOS]",
3
+ "cls_token": "[CLS]",
4
+ "eos_token": "[EOS]",
5
+ "mask_token": "[MASK]",
6
+ "pad_token": "[PAD]",
7
+ "sep_token": "[SEP]",
8
+ "unk_token": "[UNK]"
9
+ }
tokenizer.json ADDED
@@ -0,0 +1,1200 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Left",
5
+ "max_length": 512,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": null,
10
+ "added_tokens": [
11
+ {
12
+ "id": 0,
13
+ "content": "[BOS]",
14
+ "single_word": false,
15
+ "lstrip": false,
16
+ "rstrip": false,
17
+ "normalized": false,
18
+ "special": true
19
+ },
20
+ {
21
+ "id": 1,
22
+ "content": "[EOS]",
23
+ "single_word": false,
24
+ "lstrip": false,
25
+ "rstrip": false,
26
+ "normalized": false,
27
+ "special": true
28
+ },
29
+ {
30
+ "id": 2,
31
+ "content": "[PAD]",
32
+ "single_word": false,
33
+ "lstrip": false,
34
+ "rstrip": false,
35
+ "normalized": false,
36
+ "special": true
37
+ },
38
+ {
39
+ "id": 3,
40
+ "content": "[UNK]",
41
+ "single_word": false,
42
+ "lstrip": false,
43
+ "rstrip": false,
44
+ "normalized": false,
45
+ "special": true
46
+ },
47
+ {
48
+ "id": 4,
49
+ "content": "[MASK]",
50
+ "single_word": false,
51
+ "lstrip": false,
52
+ "rstrip": false,
53
+ "normalized": false,
54
+ "special": true
55
+ },
56
+ {
57
+ "id": 5,
58
+ "content": "[CLS]",
59
+ "single_word": false,
60
+ "lstrip": false,
61
+ "rstrip": false,
62
+ "normalized": false,
63
+ "special": true
64
+ },
65
+ {
66
+ "id": 6,
67
+ "content": "[SEP]",
68
+ "single_word": false,
69
+ "lstrip": false,
70
+ "rstrip": false,
71
+ "normalized": false,
72
+ "special": true
73
+ }
74
+ ],
75
+ "normalizer": null,
76
+ "pre_tokenizer": {
77
+ "type": "Split",
78
+ "pattern": {
79
+ "Regex": "(\\[[^\\]]+]|Br?|Cl?|N|O|S|P|F|I|b|c|n|o|s|p|\\(|\\)|\\.|=|-|\\+|\\\\|\\/|:|~|@|\\?|>>?|\\*|\\$|\\%[0-9]{2}|[0-9])"
80
+ },
81
+ "behavior": "Isolated",
82
+ "invert": false
83
+ },
84
+ "post_processor": {
85
+ "type": "TemplateProcessing",
86
+ "single": [
87
+ {
88
+ "SpecialToken": {
89
+ "id": "[BOS]",
90
+ "type_id": 0
91
+ }
92
+ },
93
+ {
94
+ "Sequence": {
95
+ "id": "A",
96
+ "type_id": 0
97
+ }
98
+ },
99
+ {
100
+ "SpecialToken": {
101
+ "id": "[EOS]",
102
+ "type_id": 0
103
+ }
104
+ }
105
+ ],
106
+ "pair": [
107
+ {
108
+ "Sequence": {
109
+ "id": "A",
110
+ "type_id": 0
111
+ }
112
+ },
113
+ {
114
+ "Sequence": {
115
+ "id": "B",
116
+ "type_id": 1
117
+ }
118
+ }
119
+ ],
120
+ "special_tokens": {
121
+ "[BOS]": {
122
+ "id": "[BOS]",
123
+ "ids": [
124
+ 0
125
+ ],
126
+ "tokens": [
127
+ "[BOS]"
128
+ ]
129
+ },
130
+ "[EOS]": {
131
+ "id": "[EOS]",
132
+ "ids": [
133
+ 1
134
+ ],
135
+ "tokens": [
136
+ "[EOS]"
137
+ ]
138
+ }
139
+ }
140
+ },
141
+ "decoder": null,
142
+ "model": {
143
+ "type": "WordLevel",
144
+ "vocab": {
145
+ "[BOS]": 0,
146
+ "[EOS]": 1,
147
+ "[PAD]": 2,
148
+ "[UNK]": 3,
149
+ "[MASK]": 4,
150
+ "[CLS]": 5,
151
+ "[SEP]": 6,
152
+ "C": 7,
153
+ "=": 8,
154
+ "(": 9,
155
+ ")": 10,
156
+ "O": 11,
157
+ "N": 12,
158
+ "1": 13,
159
+ "2": 14,
160
+ "3": 15,
161
+ "4": 16,
162
+ "S": 17,
163
+ "Cl": 18,
164
+ "F": 19,
165
+ "5": 20,
166
+ "[C@@H]": 21,
167
+ "[C@H]": 22,
168
+ "/": 23,
169
+ "[O-]": 24,
170
+ "[N+]": 25,
171
+ "Br": 26,
172
+ "6": 27,
173
+ "#": 28,
174
+ ".": 29,
175
+ "\\": 30,
176
+ "7": 31,
177
+ "[NH+]": 32,
178
+ "I": 33,
179
+ "[C@@]": 34,
180
+ "[C@]": 35,
181
+ "P": 36,
182
+ "8": 37,
183
+ "[Si]": 38,
184
+ "9": 39,
185
+ "[NH2+]": 40,
186
+ "[Cl-]": 41,
187
+ "[N-]": 42,
188
+ "[Na+]": 43,
189
+ "[C-]": 44,
190
+ "B": 45,
191
+ "[I-]": 46,
192
+ "[Br-]": 47,
193
+ "[NH3+]": 48,
194
+ "[O+]": 49,
195
+ "[S-]": 50,
196
+ "[NH-]": 51,
197
+ "[O-2]": 52,
198
+ "[Se]": 53,
199
+ "[P+]": 54,
200
+ "[OH+]": 55,
201
+ "[As]": 56,
202
+ "[Sn]": 57,
203
+ "[2H]": 58,
204
+ "[H+]": 59,
205
+ "[K+]": 60,
206
+ "[OH2+]": 61,
207
+ "[NH2-]": 62,
208
+ "[Hg]": 63,
209
+ "[CH-]": 64,
210
+ "[OH-]": 65,
211
+ "[S+]": 66,
212
+ "[Pt+2]": 67,
213
+ "[Cu+2]": 68,
214
+ "[CH]": 69,
215
+ "[NH4+]": 70,
216
+ "[SH+]": 71,
217
+ "[W]": 72,
218
+ "[Zn+2]": 73,
219
+ "[Ca+2]": 74,
220
+ "[Cu]": 75,
221
+ "[B-]": 76,
222
+ "[Co]": 77,
223
+ "[Mg+2]": 78,
224
+ "[Mo]": 79,
225
+ "[Fe]": 80,
226
+ "[CH2-]": 81,
227
+ "[Ni+2]": 82,
228
+ "[SiH]": 83,
229
+ "[Fe+2]": 84,
230
+ "[Pt]": 85,
231
+ "[Ni]": 86,
232
+ "[18F]": 87,
233
+ "[Co+2]": 88,
234
+ "[F-]": 89,
235
+ "[C]": 90,
236
+ "[Cr]": 91,
237
+ "[Li+]": 92,
238
+ "[Fe+3]": 93,
239
+ "[Co+3]": 94,
240
+ "[Al+3]": 95,
241
+ "[Te]": 96,
242
+ "[O]": 97,
243
+ "[Cr+3]": 98,
244
+ "[Ti]": 99,
245
+ "[Ge]": 100,
246
+ "[11CH3]": 101,
247
+ "[V]": 102,
248
+ "[Zn]": 103,
249
+ "[SH2+]": 104,
250
+ "[Mn+2]": 105,
251
+ "[Ba+2]": 106,
252
+ "[Pd]": 107,
253
+ "[Mn]": 108,
254
+ "[Rh]": 109,
255
+ "[Pt+4]": 110,
256
+ "[Ti+4]": 111,
257
+ "[Sb]": 112,
258
+ "[Pb]": 113,
259
+ "[Pd+2]": 114,
260
+ "[CH3-]": 115,
261
+ "[99Tc]": 116,
262
+ "[Ag+]": 117,
263
+ "[Ru]": 118,
264
+ "[Cu+]": 119,
265
+ "[H-]": 120,
266
+ "[S-2]": 121,
267
+ "[SiH2]": 122,
268
+ "[125I]": 123,
269
+ "[11C]": 124,
270
+ "[P-]": 125,
271
+ "[N@+]": 126,
272
+ "[Ac]": 127,
273
+ "[Hg+2]": 128,
274
+ "[N@@+]": 129,
275
+ "[Cd+2]": 130,
276
+ "[Au]": 131,
277
+ "[Sr+2]": 132,
278
+ "[Pb+2]": 133,
279
+ "[Bi+3]": 134,
280
+ "[Re]": 135,
281
+ "[I+]": 136,
282
+ "[3H]": 137,
283
+ "[13C]": 138,
284
+ "[Zn+]": 139,
285
+ "[Hg+]": 140,
286
+ "[Sn+4]": 141,
287
+ "[SiH3]": 142,
288
+ "[Rh+2]": 143,
289
+ "[Gd+3]": 144,
290
+ "[Ru+3]": 145,
291
+ "[Zr]": 146,
292
+ "[Ag]": 147,
293
+ "[Au+]": 148,
294
+ "[Be+2]": 149,
295
+ "[SH-]": 150,
296
+ "[Sb+3]": 151,
297
+ "[SH3+]": 152,
298
+ "[U]": 153,
299
+ "[No]": 154,
300
+ "[Cs+]": 155,
301
+ "[Al]": 156,
302
+ "[La+3]": 157,
303
+ "[Cd]": 158,
304
+ "[Zr+4]": 159,
305
+ "[Rh+3]": 160,
306
+ "[11CH2]": 161,
307
+ "[Ce+3]": 162,
308
+ "[Nb]": 163,
309
+ "[Ru+2]": 164,
310
+ "[CH+]": 165,
311
+ "[As+]": 166,
312
+ "[Se-2]": 167,
313
+ "[Nd+3]": 168,
314
+ "[SnH]": 169,
315
+ "[Hf]": 170,
316
+ "[Os]": 171,
317
+ "[Ir]": 172,
318
+ "[123I]": 173,
319
+ "[131I]": 174,
320
+ "[Eu]": 175,
321
+ "[SeH]": 176,
322
+ "[Ga+3]": 177,
323
+ "[AsH]": 178,
324
+ "[BH3-]": 179,
325
+ "[Sn+2]": 180,
326
+ "[Y+3]": 181,
327
+ "[1H]": 182,
328
+ "[C+]": 183,
329
+ "[Rb+]": 184,
330
+ "[AlH3]": 185,
331
+ "[CH2]": 186,
332
+ "[Si+4]": 187,
333
+ "[Ta]": 188,
334
+ "[Mn+3]": 189,
335
+ "[Mo+2]": 190,
336
+ "[OH3+]": 191,
337
+ "[Tl+]": 192,
338
+ "[Pr+3]": 193,
339
+ "[In+3]": 194,
340
+ "[Co+]": 195,
341
+ "[Au+3]": 196,
342
+ "[Bi]": 197,
343
+ "[Ir+3]": 198,
344
+ "[Y]": 199,
345
+ "[Eu+3]": 200,
346
+ "[Pt+]": 201,
347
+ "[Sm+3]": 202,
348
+ "[Zr+2]": 203,
349
+ "[13CH]": 204,
350
+ "[La]": 205,
351
+ "[14C]": 206,
352
+ "[Ce]": 207,
353
+ "[Yb]": 208,
354
+ "[Tl]": 209,
355
+ "[BH-]": 210,
356
+ "[BH2-]": 211,
357
+ "[Pr]": 212,
358
+ "[Tb]": 213,
359
+ "[U+2]": 214,
360
+ "[Sm]": 215,
361
+ "[Cr+2]": 216,
362
+ "[Er]": 217,
363
+ "[Nb+5]": 218,
364
+ "[Dy+3]": 219,
365
+ "[Ga]": 220,
366
+ "[Gd]": 221,
367
+ "[Lu]": 222,
368
+ "[Ho]": 223,
369
+ "[11CH]": 224,
370
+ "[Tb+3]": 225,
371
+ "[Th]": 226,
372
+ "[Dy]": 227,
373
+ "[Lu+3]": 228,
374
+ "[GeH2]": 229,
375
+ "[N]": 230,
376
+ "[Nd]": 231,
377
+ "[P-3]": 232,
378
+ "[Ru+4]": 233,
379
+ "[Er+3]": 234,
380
+ "[Ho+3]": 235,
381
+ "[Ru-]": 236,
382
+ "[Sc+3]": 237,
383
+ "[SnH2]": 238,
384
+ "[Tm+3]": 239,
385
+ "[Tm]": 240,
386
+ "[V+2]": 241,
387
+ "[Yb+3]": 242,
388
+ "[ClH+]": 243,
389
+ "[Ti+2]": 244,
390
+ "[PH+]": 245,
391
+ "[S@]": 246,
392
+ "[Sc]": 247,
393
+ "[Se+]": 248,
394
+ "[Ta+5]": 249,
395
+ "[99Tc+3]": 250,
396
+ "[P@]": 251,
397
+ "[Th+4]": 252,
398
+ "[Tl+3]": 253,
399
+ "[Cr+6]": 254,
400
+ "[GeH]": 255,
401
+ "[PbH]": 256,
402
+ "[Pd+]": 257,
403
+ "%10": 258,
404
+ "[99Tc+4]": 259,
405
+ "[AsH2]": 260,
406
+ "[Os+2]": 261,
407
+ "[ClH2+]": 262,
408
+ "[Ni+3]": 263,
409
+ "[BH4-]": 264,
410
+ "[Ru+]": 265,
411
+ "[13NH2]": 266,
412
+ "[AlH]": 267,
413
+ "[In]": 268,
414
+ "[Np]": 269,
415
+ "[Ru+5]": 270,
416
+ "[S]": 271,
417
+ "[Si-]": 272,
418
+ "[14CH2]": 273,
419
+ "[32P]": 274,
420
+ "[Cr+5]": 275,
421
+ "[Hf+4]": 276,
422
+ "[Pu]": 277,
423
+ "[Rh+]": 278,
424
+ "[GeH3]": 279,
425
+ "[Pt-2]": 280,
426
+ "[Sb+]": 281,
427
+ "[SbH]": 282,
428
+ "[B+]": 283,
429
+ "[P@@]": 284,
430
+ "[S@@]": 285,
431
+ "[PbH2]": 286,
432
+ "[Se-]": 287,
433
+ "[As-]": 288,
434
+ "[Ca]": 289,
435
+ "[HH]": 290,
436
+ "[Li]": 291,
437
+ "[Mo+4]": 292,
438
+ "[Ru+8]": 293,
439
+ "[Sn+]": 294,
440
+ "[14CH3]": 295,
441
+ "[17O]": 296,
442
+ "[Ce+4]": 297,
443
+ "[Cr+4]": 298,
444
+ "[Mg]": 299,
445
+ "[Na]": 300,
446
+ "[Sb-]": 301,
447
+ "[Si-2]": 302,
448
+ "[Ti+]": 303,
449
+ "[TlH]": 304,
450
+ "[14CH]": 305,
451
+ "[Fe-]": 306,
452
+ "[Os+4]": 307,
453
+ "[PH2+]": 308,
454
+ "[Pd-2]": 309,
455
+ "[Ru+6]": 310,
456
+ "[Sn-]": 311,
457
+ "[W+2]": 312,
458
+ "[Zn-2]": 313,
459
+ "[Zn-]": 314,
460
+ "[111In+3]": 315,
461
+ "[Al-]": 316,
462
+ "[AlH2]": 317,
463
+ "[As+5]": 318,
464
+ "[Au-]": 319,
465
+ "[Fe+]": 320,
466
+ "[Lr]": 321,
467
+ "[Mn+]": 322,
468
+ "[Mo+3]": 323,
469
+ "[Tc]": 324,
470
+ "[Ti+3]": 325,
471
+ "%11": 326,
472
+ "[99Tc+5]": 327,
473
+ "[As+3]": 328,
474
+ "[B]": 329,
475
+ "[Be]": 330,
476
+ "[CH2+]": 331,
477
+ "[Cm]": 332,
478
+ "[Es]": 333,
479
+ "[Eu+2]": 334,
480
+ "[Hg-2]": 335,
481
+ "[IH2+]": 336,
482
+ "[K]": 337,
483
+ "[NH]": 338,
484
+ "[PH2]": 339,
485
+ "[SbH2]": 340,
486
+ "[SiH4]": 341,
487
+ "[SnH3]": 342,
488
+ "[Xe]": 343,
489
+ "[51Cr]": 344,
490
+ "[68Ga+3]": 345,
491
+ "[75Se]": 346,
492
+ "[76Br]": 347,
493
+ "[BiH2]": 348,
494
+ "[Hg-]": 349,
495
+ "[Rh-]": 350,
496
+ "[Ru-2]": 351,
497
+ "[Te+]": 352,
498
+ "[TeH]": 353,
499
+ "%12": 354,
500
+ "[203Hg+2]": 355,
501
+ "[239Pu]": 356,
502
+ "[249Cf]": 357,
503
+ "[35S]": 358,
504
+ "[37Cl]": 359,
505
+ "[74Se]": 360,
506
+ "[Am]": 361,
507
+ "[Ba]": 362,
508
+ "[BiH]": 363,
509
+ "[Bk]": 364,
510
+ "[Cf]": 365,
511
+ "[Cs]": 366,
512
+ "[GaH]": 367,
513
+ "[He]": 368,
514
+ "[InH]": 369,
515
+ "[Pm]": 370,
516
+ "[Sn+3]": 371,
517
+ "[Zr+3]": 372,
518
+ "[14C@H]": 373,
519
+ "[15N]": 374,
520
+ "[203Hg]": 375,
521
+ "[211At]": 376,
522
+ "[237Np]": 377,
523
+ "[241Am]": 378,
524
+ "[Al-2]": 379,
525
+ "[AlH4-]": 380,
526
+ "[AsH3]": 381,
527
+ "[Cu-2]": 382,
528
+ "[Fe+6]": 383,
529
+ "[FeH]": 384,
530
+ "[GaH3]": 385,
531
+ "[H]": 386,
532
+ "[Ni+]": 387,
533
+ "[Ni-]": 388,
534
+ "[Os+6]": 389,
535
+ "[Os+7]": 390,
536
+ "[Os+8]": 391,
537
+ "[Os+]": 392,
538
+ "[PH-]": 393,
539
+ "[PH2-]": 394,
540
+ "[PbH2+2]": 395,
541
+ "[PbH4]": 396,
542
+ "[Re-]": 397,
543
+ "[Si@]": 398,
544
+ "[Sr]": 399,
545
+ "[Tc+7]": 400,
546
+ "[Te+4]": 401,
547
+ "[TlH2]": 402,
548
+ "[V+4]": 403,
549
+ "[W+]": 404,
550
+ "%13": 405,
551
+ "[106Ru]": 406,
552
+ "[11C@@H]": 407,
553
+ "[11C@H]": 408,
554
+ "[11CH3-]": 409,
555
+ "[122I]": 410,
556
+ "[123IH]": 411,
557
+ "[125IH]": 412,
558
+ "[131I-]": 413,
559
+ "[137Cs+]": 414,
560
+ "[13CH2]": 415,
561
+ "[153Sm+3]": 416,
562
+ "[159Dy]": 417,
563
+ "[15NH2]": 418,
564
+ "[15NH]": 419,
565
+ "[165Dy]": 420,
566
+ "[186Re]": 421,
567
+ "[231Pa]": 422,
568
+ "[238Pu]": 423,
569
+ "[239Np]": 424,
570
+ "[249Bk]": 425,
571
+ "[253Cf]": 426,
572
+ "[253Es]": 427,
573
+ "[33P]": 428,
574
+ "[45Ca+2]": 429,
575
+ "[55Fe+3]": 430,
576
+ "[59Fe+3]": 431,
577
+ "[59Fe]": 432,
578
+ "[60Co]": 433,
579
+ "[65Zn]": 434,
580
+ "[67Cu]": 435,
581
+ "[77Br]": 436,
582
+ "[85Sr+2]": 437,
583
+ "[89Sr+2]": 438,
584
+ "[90Sr+2]": 439,
585
+ "[91Y]": 440,
586
+ "[AlH2+]": 441,
587
+ "[BiH3]": 442,
588
+ "[Cr+]": 443,
589
+ "[Fe-3]": 444,
590
+ "[FeH2]": 445,
591
+ "[GaH2]": 446,
592
+ "[Ge-2]": 447,
593
+ "[Hf+2]": 448,
594
+ "[IH+]": 449,
595
+ "[InH2]": 450,
596
+ "[Ir+]": 451,
597
+ "[Ir-2]": 452,
598
+ "[Kr]": 453,
599
+ "[Li-]": 454,
600
+ "[PH3+]": 455,
601
+ "[P]": 456,
602
+ "[Pa]": 457,
603
+ "[PtH]": 458,
604
+ "[Rb]": 459,
605
+ "[Rh-2]": 460,
606
+ "[Ru-3]": 461,
607
+ "[RuH]": 462,
608
+ "[SbH3]": 463,
609
+ "[Se+4]": 464,
610
+ "[SeH2+]": 465,
611
+ "[SeH2]": 466,
612
+ "[Si@@]": 467,
613
+ "[Ta-2]": 468,
614
+ "[Tc+3]": 469,
615
+ "[Tc+4]": 470,
616
+ "[Th+2]": 471,
617
+ "[Zr-]": 472,
618
+ "[ZrH]": 473,
619
+ "[100Pd]": 474,
620
+ "[100Rh]": 475,
621
+ "[101Mo]": 476,
622
+ "[101Pd]": 477,
623
+ "[101Rh]": 478,
624
+ "[101Tc]": 479,
625
+ "[102Ag]": 480,
626
+ "[102Rh]": 481,
627
+ "[102Ru]": 482,
628
+ "[103Ag]": 483,
629
+ "[103Ru+2]": 484,
630
+ "[103Ru]": 485,
631
+ "[104Ag]": 486,
632
+ "[104Cd]": 487,
633
+ "[104Tc]": 488,
634
+ "[105Ag]": 489,
635
+ "[105Pd]": 490,
636
+ "[105Rh]": 491,
637
+ "[105Ru]": 492,
638
+ "[106Ag]": 493,
639
+ "[106Rh]": 494,
640
+ "[107Ag]": 495,
641
+ "[107Cd]": 496,
642
+ "[107Pd]": 497,
643
+ "[107Rh]": 498,
644
+ "[108Ag]": 499,
645
+ "[108Pd]": 500,
646
+ "[109Ag]": 501,
647
+ "[109Cd]": 502,
648
+ "[109Pd+2]": 503,
649
+ "[109Pd]": 504,
650
+ "[10BH3]": 505,
651
+ "[110Ag+]": 506,
652
+ "[110Ag]": 507,
653
+ "[110Ru]": 508,
654
+ "[111Ag]": 509,
655
+ "[111InH3]": 510,
656
+ "[112Ag]": 511,
657
+ "[113Cd]": 512,
658
+ "[115Ag]": 513,
659
+ "[115Cd]": 514,
660
+ "[117Cd]": 515,
661
+ "[117SnH2]": 516,
662
+ "[11C-]": 517,
663
+ "[11CH4]": 518,
664
+ "[120IH]": 519,
665
+ "[121IH]": 520,
666
+ "[121SnH2]": 521,
667
+ "[123I-]": 522,
668
+ "[123Xe]": 523,
669
+ "[124I-]": 524,
670
+ "[124IH]": 525,
671
+ "[124I]": 526,
672
+ "[125I-]": 527,
673
+ "[126IH]": 528,
674
+ "[127Te]": 529,
675
+ "[127Xe]": 530,
676
+ "[128IH]": 531,
677
+ "[129I]": 532,
678
+ "[130IH]": 533,
679
+ "[131IH]": 534,
680
+ "[131La]": 535,
681
+ "[132IH]": 536,
682
+ "[132La]": 537,
683
+ "[133Ba+2]": 538,
684
+ "[133IH]": 539,
685
+ "[133Xe]": 540,
686
+ "[134Ce]": 541,
687
+ "[134IH]": 542,
688
+ "[135Ce]": 543,
689
+ "[135IH]": 544,
690
+ "[135La]": 545,
691
+ "[135Xe]": 546,
692
+ "[136Nd]": 547,
693
+ "[136Pr]": 548,
694
+ "[137Ce]": 549,
695
+ "[137La]": 550,
696
+ "[137Pr]": 551,
697
+ "[138La]": 552,
698
+ "[138Nd]": 553,
699
+ "[138Pr]": 554,
700
+ "[139Ce]": 555,
701
+ "[139Nd]": 556,
702
+ "[139Pr]": 557,
703
+ "[13CH4]": 558,
704
+ "[13NH2-]": 559,
705
+ "[13NH3]": 560,
706
+ "[140La]": 561,
707
+ "[141Ce+3]": 562,
708
+ "[141Ce]": 563,
709
+ "[141La]": 564,
710
+ "[141Nd]": 565,
711
+ "[141Pm]": 566,
712
+ "[141Sm]": 567,
713
+ "[142La]": 568,
714
+ "[142Pr]": 569,
715
+ "[142Sm]": 570,
716
+ "[143Ce]": 571,
717
+ "[143La]": 572,
718
+ "[143Pm]": 573,
719
+ "[143Pr]": 574,
720
+ "[144Ce]": 575,
721
+ "[144Pm]": 576,
722
+ "[144Pr]": 577,
723
+ "[145Eu]": 578,
724
+ "[145Gd]": 579,
725
+ "[145Pm]": 580,
726
+ "[145Pr]": 581,
727
+ "[145Sm]": 582,
728
+ "[146Eu]": 583,
729
+ "[146Gd]": 584,
730
+ "[146Nd]": 585,
731
+ "[146Pm]": 586,
732
+ "[147Eu]": 587,
733
+ "[147Gd]": 588,
734
+ "[147Nd]": 589,
735
+ "[147Pm]": 590,
736
+ "[147Pr]": 591,
737
+ "[147Tb]": 592,
738
+ "[148Eu]": 593,
739
+ "[148Pm]": 594,
740
+ "[149Eu]": 595,
741
+ "[149Gd]": 596,
742
+ "[149Nd]": 597,
743
+ "[149Pm]": 598,
744
+ "[149Tb]": 599,
745
+ "[14CH4]": 600,
746
+ "[150Eu]": 601,
747
+ "[150Pm]": 602,
748
+ "[150Sm]": 603,
749
+ "[150Tb]": 604,
750
+ "[151Gd]": 605,
751
+ "[151Nd]": 606,
752
+ "[151Pm]": 607,
753
+ "[151Sm]": 608,
754
+ "[151Tb]": 609,
755
+ "[152Eu]": 610,
756
+ "[153Gd]": 611,
757
+ "[153Sm]": 612,
758
+ "[153Tb]": 613,
759
+ "[154Eu]": 614,
760
+ "[154Sm]": 615,
761
+ "[154Tb]": 616,
762
+ "[155Dy]": 617,
763
+ "[155Eu]": 618,
764
+ "[155Ho]": 619,
765
+ "[155Sm]": 620,
766
+ "[155Tb]": 621,
767
+ "[156Eu]": 622,
768
+ "[156Sm]": 623,
769
+ "[156Tb]": 624,
770
+ "[157Eu]": 625,
771
+ "[157Ho]": 626,
772
+ "[157Tb]": 627,
773
+ "[158Eu]": 628,
774
+ "[158Gd]": 629,
775
+ "[158Tb]": 630,
776
+ "[159Gd]": 631,
777
+ "[159Ho]": 632,
778
+ "[15CH4]": 633,
779
+ "[15NH3]": 634,
780
+ "[15OH]": 635,
781
+ "[160Gd]": 636,
782
+ "[160Tb]": 637,
783
+ "[161Er]": 638,
784
+ "[161Gd]": 639,
785
+ "[161Ho]": 640,
786
+ "[161Tb]": 641,
787
+ "[162Dy]": 642,
788
+ "[162Ho]": 643,
789
+ "[162Tm]": 644,
790
+ "[162Yb]": 645,
791
+ "[164Ho]": 646,
792
+ "[165Er]": 647,
793
+ "[166Dy]": 648,
794
+ "[166Er]": 649,
795
+ "[166Ho]": 650,
796
+ "[166Tm]": 651,
797
+ "[166Yb]": 652,
798
+ "[167Ho]": 653,
799
+ "[167Tm]": 654,
800
+ "[167Yb]": 655,
801
+ "[168Er]": 656,
802
+ "[169Er]": 657,
803
+ "[169Lu]": 658,
804
+ "[169Yb]": 659,
805
+ "[170Er]": 660,
806
+ "[170Hf]": 661,
807
+ "[170Lu]": 662,
808
+ "[170Tm]": 663,
809
+ "[171Er]": 664,
810
+ "[171Lu]": 665,
811
+ "[171Tm]": 666,
812
+ "[172Er]": 667,
813
+ "[172Hf]": 668,
814
+ "[172Lu]": 669,
815
+ "[172Ta]": 670,
816
+ "[172Tm]": 671,
817
+ "[173Hf]": 672,
818
+ "[173Lu]": 673,
819
+ "[173Ta]": 674,
820
+ "[173Tm]": 675,
821
+ "[174Lu]": 676,
822
+ "[174Ta]": 677,
823
+ "[174Yb]": 678,
824
+ "[175Hf]": 679,
825
+ "[175Ta]": 680,
826
+ "[175Tm]": 681,
827
+ "[175Yb]": 682,
828
+ "[176Lu]": 683,
829
+ "[176Ta]": 684,
830
+ "[176W]": 685,
831
+ "[176Yb]": 686,
832
+ "[177Hf]": 687,
833
+ "[177Lu]": 688,
834
+ "[177Re]": 689,
835
+ "[177Ta]": 690,
836
+ "[177W]": 691,
837
+ "[177Yb]": 692,
838
+ "[178Hf]": 693,
839
+ "[178Lu]": 694,
840
+ "[178Re]": 695,
841
+ "[178Ta]": 696,
842
+ "[178W]": 697,
843
+ "[178Yb]": 698,
844
+ "[179Hf]": 699,
845
+ "[179Lu]": 700,
846
+ "[179Ta]": 701,
847
+ "[179W]": 702,
848
+ "[17NH3]": 703,
849
+ "[180Hf]": 704,
850
+ "[180Os]": 705,
851
+ "[180Ta]": 706,
852
+ "[181Hf]": 707,
853
+ "[181Os]": 708,
854
+ "[181Re]": 709,
855
+ "[181W]": 710,
856
+ "[182Hf]": 711,
857
+ "[182Ir]": 712,
858
+ "[182Os]": 713,
859
+ "[182Re]": 714,
860
+ "[182Ta]": 715,
861
+ "[183Hf]": 716,
862
+ "[184Hf]": 717,
863
+ "[184Ir]": 718,
864
+ "[184Re]": 719,
865
+ "[184Ta]": 720,
866
+ "[185Ir]": 721,
867
+ "[185Os]": 722,
868
+ "[185Ta]": 723,
869
+ "[185W]": 724,
870
+ "[186Ir]": 725,
871
+ "[186Pt]": 726,
872
+ "[186Ta]": 727,
873
+ "[187Ir]": 728,
874
+ "[187Re]": 729,
875
+ "[187W]": 730,
876
+ "[188Ir]": 731,
877
+ "[188Pt]": 732,
878
+ "[188Re]": 733,
879
+ "[188W]": 734,
880
+ "[189Ir]": 735,
881
+ "[189Os]": 736,
882
+ "[189Pt]": 737,
883
+ "[189Re]": 738,
884
+ "[18FH]": 739,
885
+ "[18OH2]": 740,
886
+ "[18O]": 741,
887
+ "[190Ir]": 742,
888
+ "[191Os]": 743,
889
+ "[191Pt]": 744,
890
+ "[192Ir]": 745,
891
+ "[193Au]": 746,
892
+ "[193Hg]": 747,
893
+ "[193Os]": 748,
894
+ "[193Pt]": 749,
895
+ "[194Au]": 750,
896
+ "[194Hg]": 751,
897
+ "[194Ir]": 752,
898
+ "[194Os]": 753,
899
+ "[195Au]": 754,
900
+ "[195Hg]": 755,
901
+ "[195Ir]": 756,
902
+ "[195Pt]": 757,
903
+ "[197Hg]": 758,
904
+ "[197Pt]": 759,
905
+ "[198Au]": 760,
906
+ "[199Au]": 761,
907
+ "[199Hg]": 762,
908
+ "[199Pt]": 763,
909
+ "[1H-]": 764,
910
+ "[200Au]": 765,
911
+ "[200Pt]": 766,
912
+ "[201Au]": 767,
913
+ "[201Tl+]": 768,
914
+ "[210BiH3]": 769,
915
+ "[210PoH2]": 770,
916
+ "[212PbH2]": 771,
917
+ "[218Rn]": 772,
918
+ "[219Rn]": 773,
919
+ "[220Rn]": 774,
920
+ "[222Rn]": 775,
921
+ "[224Ac]": 776,
922
+ "[224Rn]": 777,
923
+ "[225Ac]": 778,
924
+ "[226Ac]": 779,
925
+ "[226Rn]": 780,
926
+ "[226Th]": 781,
927
+ "[227Ac]": 782,
928
+ "[227Pa]": 783,
929
+ "[227Th]": 784,
930
+ "[228Ac]": 785,
931
+ "[228Pa]": 786,
932
+ "[228Rn]": 787,
933
+ "[228Th]": 788,
934
+ "[229Th]": 789,
935
+ "[230Pa]": 790,
936
+ "[230Pu]": 791,
937
+ "[230Th]": 792,
938
+ "[230U]": 793,
939
+ "[231Th]": 794,
940
+ "[231U]": 795,
941
+ "[232Np]": 796,
942
+ "[232Pa]": 797,
943
+ "[232U]": 798,
944
+ "[233Np]": 799,
945
+ "[233Pa]": 800,
946
+ "[233U]": 801,
947
+ "[234Np]": 802,
948
+ "[234Pa]": 803,
949
+ "[234Pu]": 804,
950
+ "[234Th]": 805,
951
+ "[234U]": 806,
952
+ "[235Np]": 807,
953
+ "[235Pu]": 808,
954
+ "[235U]": 809,
955
+ "[236Np]": 810,
956
+ "[236Pu]": 811,
957
+ "[236U]": 812,
958
+ "[237Am]": 813,
959
+ "[237Pu]": 814,
960
+ "[237U]": 815,
961
+ "[238Am]": 816,
962
+ "[238Cm]": 817,
963
+ "[238Np]": 818,
964
+ "[238Th]": 819,
965
+ "[239Am]": 820,
966
+ "[239U]": 821,
967
+ "[240Am]": 822,
968
+ "[240Cm]": 823,
969
+ "[240Np]": 824,
970
+ "[240Pu]": 825,
971
+ "[240U]": 826,
972
+ "[241Cm]": 827,
973
+ "[241Pu]": 828,
974
+ "[242Am]": 829,
975
+ "[242Cm]": 830,
976
+ "[242Pu]": 831,
977
+ "[243Am]": 832,
978
+ "[243Cm]": 833,
979
+ "[243Pu]": 834,
980
+ "[244Am]": 835,
981
+ "[244Cf]": 836,
982
+ "[244Cm]": 837,
983
+ "[244Pu]": 838,
984
+ "[245Am]": 839,
985
+ "[245Bk]": 840,
986
+ "[245Cm]": 841,
987
+ "[245Pu]": 842,
988
+ "[246Am]": 843,
989
+ "[246Bk]": 844,
990
+ "[246Cf]": 845,
991
+ "[246Cm]": 846,
992
+ "[246Pu]": 847,
993
+ "[247Bk]": 848,
994
+ "[247Cm]": 849,
995
+ "[248Am]": 850,
996
+ "[248Cf]": 851,
997
+ "[248Cm]": 852,
998
+ "[249Cm]": 853,
999
+ "[250Bk]": 854,
1000
+ "[250Cf]": 855,
1001
+ "[250Cm]": 856,
1002
+ "[250Es]": 857,
1003
+ "[251Cf]": 858,
1004
+ "[251Es]": 859,
1005
+ "[252Cf]": 860,
1006
+ "[252Fm]": 861,
1007
+ "[253Fm]": 862,
1008
+ "[254Cf]": 863,
1009
+ "[254Es]": 864,
1010
+ "[254Fm]": 865,
1011
+ "[255Fm]": 866,
1012
+ "[257Fm]": 867,
1013
+ "[257Md]": 868,
1014
+ "[258Md]": 869,
1015
+ "[2HH]": 870,
1016
+ "[32PH3]": 871,
1017
+ "[32SH2]": 872,
1018
+ "[33PH3]": 873,
1019
+ "[34SH2]": 874,
1020
+ "[35S-2]": 875,
1021
+ "[35SH2]": 876,
1022
+ "[36ClH]": 877,
1023
+ "[36SH2]": 878,
1024
+ "[38ClH]": 879,
1025
+ "[39ClH]": 880,
1026
+ "[3HH]": 881,
1027
+ "[40PH3]": 882,
1028
+ "[41Ar]": 883,
1029
+ "[43Sc]": 884,
1030
+ "[44Sc]": 885,
1031
+ "[44Ti]": 886,
1032
+ "[45Ti]": 887,
1033
+ "[46Sc]": 888,
1034
+ "[47Sc]": 889,
1035
+ "[47V]": 890,
1036
+ "[48Cr]": 891,
1037
+ "[48Sc]": 892,
1038
+ "[48V]": 893,
1039
+ "[49Cr]": 894,
1040
+ "[49Sc]": 895,
1041
+ "[49V]": 896,
1042
+ "[51Cr+3]": 897,
1043
+ "[51Cr+6]": 898,
1044
+ "[51Mn]": 899,
1045
+ "[52Mn]": 900,
1046
+ "[53Mn]": 901,
1047
+ "[54Mn]": 902,
1048
+ "[55Co]": 903,
1049
+ "[55Fe]": 904,
1050
+ "[56Co]": 905,
1051
+ "[56Mn]": 906,
1052
+ "[56Ni]": 907,
1053
+ "[57Co+2]": 908,
1054
+ "[57Co]": 909,
1055
+ "[57Fe]": 910,
1056
+ "[57Ni]": 911,
1057
+ "[58Co]": 912,
1058
+ "[58Ni]": 913,
1059
+ "[59Fe+2]": 914,
1060
+ "[59Ni]": 915,
1061
+ "[60Co+2]": 916,
1062
+ "[60Co+3]": 917,
1063
+ "[60Cu]": 918,
1064
+ "[60Fe]": 919,
1065
+ "[61Co]": 920,
1066
+ "[61Cu]": 921,
1067
+ "[61Ni]": 922,
1068
+ "[62Co]": 923,
1069
+ "[62Zn]": 924,
1070
+ "[63Ni]": 925,
1071
+ "[63Zn]": 926,
1072
+ "[64Cu]": 927,
1073
+ "[65Ni]": 928,
1074
+ "[65Zn+2]": 929,
1075
+ "[66Ni]": 930,
1076
+ "[67Ga+3]": 931,
1077
+ "[67GaH3]": 932,
1078
+ "[68GaH3]": 933,
1079
+ "[69Zn]": 934,
1080
+ "[71Zn]": 935,
1081
+ "[72Br-]": 936,
1082
+ "[72BrH]": 937,
1083
+ "[72Zn]": 938,
1084
+ "[74BrH]": 939,
1085
+ "[75BrH]": 940,
1086
+ "[75Br]": 941,
1087
+ "[76BrH]": 942,
1088
+ "[77BrH]": 943,
1089
+ "[79Kr]": 944,
1090
+ "[80BrH]": 945,
1091
+ "[80Br]": 946,
1092
+ "[81Kr]": 947,
1093
+ "[82BrH]": 948,
1094
+ "[82Rb+]": 949,
1095
+ "[83BrH]": 950,
1096
+ "[84BrH]": 951,
1097
+ "[85Kr]": 952,
1098
+ "[86Y]": 953,
1099
+ "[86Zr]": 954,
1100
+ "[87Y]": 955,
1101
+ "[88Nb]": 956,
1102
+ "[88Y]": 957,
1103
+ "[88Zr]": 958,
1104
+ "[89Kr]": 959,
1105
+ "[89Nb]": 960,
1106
+ "[89Zr]": 961,
1107
+ "[90Mo]": 962,
1108
+ "[90Nb]": 963,
1109
+ "[90Y+3]": 964,
1110
+ "[90Y]": 965,
1111
+ "[92Y]": 966,
1112
+ "[93Mo]": 967,
1113
+ "[93Tc]": 968,
1114
+ "[93Y]": 969,
1115
+ "[93Zr]": 970,
1116
+ "[94Nb]": 971,
1117
+ "[94Ru]": 972,
1118
+ "[94Tc]": 973,
1119
+ "[94Y]": 974,
1120
+ "[95Nb]": 975,
1121
+ "[95Y]": 976,
1122
+ "[95Zr]": 977,
1123
+ "[96Nb]": 978,
1124
+ "[96Tc]": 979,
1125
+ "[97Nb]": 980,
1126
+ "[97Ru]": 981,
1127
+ "[97Tc]": 982,
1128
+ "[97Zr]": 983,
1129
+ "[98Nb]": 984,
1130
+ "[98Tc]": 985,
1131
+ "[99Mo]": 986,
1132
+ "[99Rh]": 987,
1133
+ "[99Ru]": 988,
1134
+ "[99Tc+]": 989,
1135
+ "[Ag-]": 990,
1136
+ "[AgH]": 991,
1137
+ "[Al-3]": 992,
1138
+ "[AlH+2]": 993,
1139
+ "[Ar]": 994,
1140
+ "[AtH]": 995,
1141
+ "[At]": 996,
1142
+ "[B@-]": 997,
1143
+ "[BH2]": 998,
1144
+ "[Bi+2]": 999,
1145
+ "[Br]": 1000,
1146
+ "[CH3+]": 1001,
1147
+ "[CH3]": 1002,
1148
+ "[Cd-2]": 1003,
1149
+ "[Cl+3]": 1004,
1150
+ "[CoH2]": 1005,
1151
+ "[Cu-]": 1006,
1152
+ "[CuH]": 1007,
1153
+ "[Fm]": 1008,
1154
+ "[Ga-]": 1009,
1155
+ "[Gd+2]": 1010,
1156
+ "[GeH4]": 1011,
1157
+ "[I]": 1012,
1158
+ "[InH3]": 1013,
1159
+ "[Ir+2]": 1014,
1160
+ "[IrH+2]": 1015,
1161
+ "[Md]": 1016,
1162
+ "[Mn-2]": 1017,
1163
+ "[MoH2]": 1018,
1164
+ "[NH2]": 1019,
1165
+ "[Nb+2]": 1020,
1166
+ "[Ne]": 1021,
1167
+ "[Ni-2]": 1022,
1168
+ "[NiH]": 1023,
1169
+ "[OH]": 1024,
1170
+ "[Os+5]": 1025,
1171
+ "[OsH]": 1026,
1172
+ "[PH4+]": 1027,
1173
+ "[PoH2]": 1028,
1174
+ "[Re-2]": 1029,
1175
+ "[ReH]": 1030,
1176
+ "[Rh-3]": 1031,
1177
+ "[RhH+2]": 1032,
1178
+ "[RhH]": 1033,
1179
+ "[Rn]": 1034,
1180
+ "[Se+6]": 1035,
1181
+ "[Se@@H]": 1036,
1182
+ "[Si+]": 1037,
1183
+ "[Si@@H]": 1038,
1184
+ "[SiH-]": 1039,
1185
+ "[SiH3-]": 1040,
1186
+ "[SnH4]": 1041,
1187
+ "[SrH2]": 1042,
1188
+ "[Ta+2]": 1043,
1189
+ "[TaH2]": 1044,
1190
+ "[Tc+6]": 1045,
1191
+ "[Te-]": 1046,
1192
+ "[TeH2]": 1047,
1193
+ "[TlH3]": 1048,
1194
+ "[WH2]": 1049,
1195
+ "[Yb+2]": 1050,
1196
+ "[ZrH2]": 1051
1197
+ },
1198
+ "unk_token": "[UNK]"
1199
+ }
1200
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[BOS]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "[EOS]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "[PAD]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "[UNK]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "4": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "5": {
44
+ "content": "[CLS]",
45
+ "lstrip": false,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": true
50
+ },
51
+ "6": {
52
+ "content": "[SEP]",
53
+ "lstrip": false,
54
+ "normalized": false,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": true
58
+ }
59
+ },
60
+ "bos_token": "[BOS]",
61
+ "clean_up_tokenization_spaces": true,
62
+ "cls_token": "[CLS]",
63
+ "eos_token": "[EOS]",
64
+ "mask_token": "[MASK]",
65
+ "model_max_length": 512,
66
+ "pad_token": "[PAD]",
67
+ "padding_side": "right",
68
+ "sep_token": "[SEP]",
69
+ "tokenizer_class": "PreTrainedTokenizerFast",
70
+ "truncation_side": "left",
71
+ "unk_token": "[UNK]"
72
+ }