philocifer commited on
Commit
09afbd0
·
verified ·
1 Parent(s): cf0d9fd

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 1024,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,692 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:300
8
+ - loss:MatryoshkaLoss
9
+ - loss:MultipleNegativesRankingLoss
10
+ base_model: Snowflake/snowflake-arctic-embed-l
11
+ widget:
12
+ - source_sentence: How many full-time staff members are employed at 787 Market & Cafe?
13
+ sentences:
14
+ - 'STORE ANALYSIS: 787 Market & Cafe (7858724)
15
+
16
+ Location: 6105 Memphis Ave, Cleveland
17
+
18
+
19
+ Store 7858724 - 787 Market & Cafe operates as a Open Store Supermarket-Conventional
20
+ establishment at 6105 Memphis Ave, Cleveland, OH 441442252 (FIPS 39-35). Geographically
21
+ precise at coordinates 41.4399,-81.7293 (Geocoded to specific address), this location
22
+ generates $2,028,000 in annual sales ($2,000,001 to $4,000,000) from its 2000.0
23
+ square foot space. The operation employs 23 full-time staff across 3 checkout
24
+ lanes, yielding a sales density of $1,014.00/sqft. Owned by Independent (Family
25
+ ID: 99999) as part of a 1 Store-location network, the store sources inventory
26
+ through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its
27
+ position in the Grocery sector''s Supermarket-Conventional segment.'
28
+ - 'STORE ANALYSIS: 128 Teresa Grocery (729684)
29
+
30
+ Location: 128 Audubon Ave, New York
31
+
32
+
33
+ Store 729684 - 128 Teresa Grocery operates as a Open Store Superette establishment
34
+ at 128 Audubon Ave, New York, NY 100322109 (FIPS 36-61). Geographically precise
35
+ at coordinates 40.8427,-73.9369 (Geocoded to specific address), this location
36
+ generates $1,560,000 in annual sales ($1,500,001 to $2,000,000) from its 2000.0
37
+ square foot space. The operation employs 7 full-time staff across 1 checkout lanes,
38
+ yielding a sales density of $780.00/sqft. Owned by Independent (Family ID: 99999)
39
+ as part of a 1 Store-location network, the store sources inventory through Small
40
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
41
+ Grocery sector''s Superette segment.'
42
+ - 'STORE ANALYSIS: 2 Star Grocery (1818510)
43
+
44
+ Location: 1123 Dellwood Ave, Memphis
45
+
46
+
47
+ Store 1818510 - 2 Star Grocery operates as a Open Store Superette establishment
48
+ at 1123 Dellwood Ave, Memphis, TN 381277761 (FIPS 47-157). Geographically precise
49
+ at coordinates 35.2105,-90.0267 (Geocoded to specific address), this location
50
+ generates $1,664,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0
51
+ square foot space. The operation employs 7 full-time staff across 2 checkout lanes,
52
+ yielding a sales density of $1,664.00/sqft. Owned by Independent (Family ID: 99999)
53
+ as part of a 1 Store-location network, the store sources inventory through Small
54
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
55
+ Grocery sector''s Superette segment.'
56
+ - source_sentence: What is the annual sales figure for the 2031 Webster Food Court
57
+ supermarket?
58
+ sentences:
59
+ - 'STORE ANALYSIS: 2031 Webster Food Court (2192571)
60
+
61
+ Location: 2031 Webster Ave, Bronx
62
+
63
+
64
+ Store 2192571 - 2031 Webster Food Court operates as a Open Store Supermarket-Conventional
65
+ establishment at 2031 Webster Ave, Bronx, NY 104572411 (FIPS 36-5). Geographically
66
+ precise at coordinates 40.8509,-73.8992 (Geocoded to specific address), this location
67
+ generates $2,028,000 in annual sales ($2,000,001 to $4,000,000) from its 1000.0
68
+ square foot space. The operation employs 14 full-time staff across 1 checkout
69
+ lanes, yielding a sales density of $2,028.00/sqft. Owned by Independent (Family
70
+ ID: 99999) as part of a 1 Store-location network, the store sources inventory
71
+ through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its
72
+ position in the Grocery sector''s Supermarket-Conventional segment.'
73
+ - 'STORE ANALYSIS: 4M Foods (1483149)
74
+
75
+ Location: 6349 Macarthur Blvd, Oakland
76
+
77
+
78
+ Store 1483149 - 4M Foods operates as a Open Store Superette establishment at 6349
79
+ Macarthur Blvd, Oakland, CA 946051635 (FIPS 6-1). Geographically precise at coordinates
80
+ 37.7741,-122.1801 (Geocoded to specific address), this location generates $1,560,000
81
+ in annual sales ($1,500,001 to $2,000,000) from its 2000.0 square foot space.
82
+ The operation employs 7 full-time staff across 2 checkout lanes, yielding a sales
83
+ density of $780.00/sqft. Owned by Independent (Family ID: 99999) as part of a
84
+ 1 Store-location network, the store sources inventory through Small Supplier (Supplier
85
+ ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector''s
86
+ Superette segment.'
87
+ - 'STORE ANALYSIS: 16th Food Max (701011)
88
+
89
+ Location: 9901 N 16th St, Tampa
90
+
91
+
92
+ Store 701011 - 16th Food Max operates as a Open Store Superette establishment
93
+ at 9901 N 16th St, Tampa, FL 336128233 (FIPS 12-57). Geographically precise at
94
+ coordinates 28.0393,-82.4415 (Geocoded to specific address), this location generates
95
+ $1,196,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot
96
+ space. The operation employs 6 full-time staff across 2 checkout lanes, yielding
97
+ a sales density of $1,196.00/sqft. Owned by Independent (Family ID: 99999) as
98
+ part of a 1 Store-location network, the store sources inventory through Small
99
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
100
+ Grocery sector''s Superette segment.'
101
+ - source_sentence: What is the annual sales figure for 103 Deli & Grocery located
102
+ at 148 E 103rd St, New York?
103
+ sentences:
104
+ - 'STORE ANALYSIS: 103 Deli & Grocery (970062)
105
+
106
+ Location: 148 E 103rd St, New York
107
+
108
+
109
+ Store 970062 - 103 Deli & Grocery operates as a Open Store Superette establishment
110
+ at 148 E 103rd St, New York, NY 100295334 (FIPS 36-61). Geographically precise
111
+ at coordinates 40.7902,-73.9476 (Geocoded to specific address), this location
112
+ generates $1,352,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0
113
+ square foot space. The operation employs 6 full-time staff across 1 checkout lanes,
114
+ yielding a sales density of $1,352.00/sqft. Owned by Independent (Family ID: 99999)
115
+ as part of a 1 Store-location network, the store sources inventory through Small
116
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
117
+ Grocery sector''s Superette segment.'
118
+ - 'STORE ANALYSIS: 1158 Grocery & Deli (1666158)
119
+
120
+ Location: 1158 Saint Lawrence Ave, Bronx
121
+
122
+
123
+ Store 1666158 - 1158 Grocery & Deli operates as a Open Store Superette establishment
124
+ at 1158 Saint Lawrence Ave, Bronx, NY 104724612 (FIPS 36-5). Geographically precise
125
+ at coordinates 40.8295,-73.8666 (Geocoded to specific address), this location
126
+ generates $1,352,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0
127
+ square foot space. The operation employs 6 full-time staff across 2 checkout lanes,
128
+ yielding a sales density of $1,352.00/sqft. Owned by Independent (Family ID: 99999)
129
+ as part of a 1 Store-location network, the store sources inventory through Small
130
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
131
+ Grocery sector''s Superette segment.'
132
+ - 'STORE ANALYSIS: A & A Grocery (7345282)
133
+
134
+ Location: 6776 Biggers Reyno Rd, Reyno
135
+
136
+
137
+ Store 7345282 - A & A Grocery operates as a Open Store Supermarket-Conventional
138
+ establishment at 6776 Biggers Reyno Rd, Reyno, AR 72462 (FIPS 5-121). Geographically
139
+ precise at coordinates 36.3629,-90.7534 (Geocoded to specific address), this location
140
+ generates $4,160,000 in annual sales ($4,000,001 to $6,000,000) from its 2000.0
141
+ square foot space. The operation employs 16 full-time staff across 2 checkout
142
+ lanes, yielding a sales density of $2,080.00/sqft. Owned by Independent (Family
143
+ ID: 99999) as part of a 1 Store-location network, the store sources inventory
144
+ through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its
145
+ position in the Grocery sector''s Supermarket-Conventional segment.'
146
+ - source_sentence: What is the annual sales figure for 12th St Mike Gourmet Deli?
147
+ sentences:
148
+ - 'STORE ANALYSIS: 8 Brothers Grocery Store (1639326)
149
+
150
+ Location: 2120 N 29th St, Philadelphia
151
+
152
+
153
+ Store 1639326 - 8 Brothers Grocery Store operates as a Open Store Superette establishment
154
+ at 2120 N 29th St, Philadelphia, PA 191211234 (FIPS 42-101). Geographically precise
155
+ at coordinates 39.9886,-75.1805 (Geocoded to specific address), this location
156
+ generates $1,820,000 in annual sales ($1,500,001 to $2,000,000) from its 3000.0
157
+ square foot space. The operation employs 9 full-time staff across 1 checkout lanes,
158
+ yielding a sales density of $606.67/sqft. Owned by Independent (Family ID: 99999)
159
+ as part of a 1 Store-location network, the store sources inventory through Small
160
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
161
+ Grocery sector''s Superette segment.'
162
+ - 'STORE ANALYSIS: 12th St Mike Gourmet Deli (879406)
163
+
164
+ Location: 1203 40th Ave, Long Island City
165
+
166
+
167
+ Store 879406 - 12th St Mike Gourmet Deli operates as a Open Store Superette establishment
168
+ at 1203 40th Ave, Long Island City, NY 111016107 (FIPS 36-81). Geographically
169
+ precise at coordinates 40.7561,-73.9425 (Geocoded to specific address), this location
170
+ generates $1,508,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0
171
+ square foot space. The operation employs 6 full-time staff across 1 checkout lanes,
172
+ yielding a sales density of $1,508.00/sqft. Owned by Independent (Family ID: 99999)
173
+ as part of a 1 Store-location network, the store sources inventory through Harold
174
+ Levinson Associates (Supplier ID: 195994, Family ID: 2213) to maintain its position
175
+ in the Grocery sector''s Superette segment.'
176
+ - 'STORE ANALYSIS: A & B Naturals (5551436)
177
+
178
+ Location: 101 Cottage St, Bar Harbor
179
+
180
+
181
+ Store 5551436 - A & B Naturals operates as a Open Store Supermarket-Natural/Gourmet
182
+ Foods establishment at 101 Cottage St, Bar Harbor, ME 46091442 (FIPS 23-9). Geographically
183
+ precise at coordinates 44.389,-68.2121 (Geocoded to specific address), this location
184
+ generates $3,640,000 in annual sales ($2,000,001 to $4,000,000) from its 3000.0
185
+ square foot space. The operation employs 15 full-time staff across 2 checkout
186
+ lanes, yielding a sales density of $1,213.33/sqft. Owned by Independent (Family
187
+ ID: 99999) as part of a 1 Store-location network, the store sources inventory
188
+ through United Natural Foods/Dist Ctr (Supplier ID: 74258, Family ID: 10540) to
189
+ maintain its position in the Grocery sector''s Supermarket-Natural/Gourmet Foods
190
+ segment.'
191
+ - source_sentence: How many full-time staff members are employed at the 54 Royal Market?
192
+ sentences:
193
+ - 'STORE ANALYSIS: 114th Gourmet Deli & Grill (805951)
194
+
195
+ Location: 11321 Jamaica Ave, Richmond Hill
196
+
197
+
198
+ Store 805951 - 114th Gourmet Deli & Grill operates as a Open Store Superette establishment
199
+ at 11321 Jamaica Ave, Richmond Hill, NY 114182441 (FIPS 36-81). Geographically
200
+ precise at coordinates 40.6983,-73.835 (Geocoded to specific address), this location
201
+ generates $1,820,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0
202
+ square foot space. The operation employs 11 full-time staff across 1 checkout
203
+ lanes, yielding a sales density of $1,820.00/sqft. Owned by Independent (Family
204
+ ID: 99999) as part of a 1 Store-location network, the store sources inventory
205
+ through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its
206
+ position in the Grocery sector''s Superette segment.'
207
+ - 'STORE ANALYSIS: 54 Royal Market (728885)
208
+
209
+ Location: 817 9th Ave, New York
210
+
211
+
212
+ Store 728885 - 54 Royal Market operates as a Open Store Superette establishment
213
+ at 817 9th Ave, New York, NY 100194401 (FIPS 36-61). Geographically precise at
214
+ coordinates 40.7662,-73.9872 (Geocoded to specific address), this location generates
215
+ $1,768,000 in annual sales ($1,500,001 to $2,000,000) from its 2000.0 square foot
216
+ space. The operation employs 11 full-time staff across 3 checkout lanes, yielding
217
+ a sales density of $884.00/sqft. Owned by Independent (Family ID: 99999) as part
218
+ of a 1 Store-location network, the store sources inventory through Small Supplier
219
+ (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery
220
+ sector''s Superette segment.'
221
+ - 'STORE ANALYSIS: 1683 Jimmy Deli Grocery (1816263)
222
+
223
+ Location: 1683 Woodbine St, Ridgewood
224
+
225
+
226
+ Store 1816263 - 1683 Jimmy Deli Grocery operates as a Open Store Superette establishment
227
+ at 1683 Woodbine St, Ridgewood, NY 113853546 (FIPS 36-81). Geographically precise
228
+ at coordinates 40.7012,-73.9083 (Geocoded to specific address), this location
229
+ generates $1,456,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0
230
+ square foot space. The operation employs 8 full-time staff across 2 checkout lanes,
231
+ yielding a sales density of $1,456.00/sqft. Owned by Independent (Family ID: 99999)
232
+ as part of a 1 Store-location network, the store sources inventory through Small
233
+ Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the
234
+ Grocery sector''s Superette segment.'
235
+ pipeline_tag: sentence-similarity
236
+ library_name: sentence-transformers
237
+ metrics:
238
+ - cosine_accuracy@1
239
+ - cosine_accuracy@3
240
+ - cosine_accuracy@5
241
+ - cosine_accuracy@10
242
+ - cosine_precision@1
243
+ - cosine_precision@3
244
+ - cosine_precision@5
245
+ - cosine_precision@10
246
+ - cosine_recall@1
247
+ - cosine_recall@3
248
+ - cosine_recall@5
249
+ - cosine_recall@10
250
+ - cosine_ndcg@10
251
+ - cosine_mrr@10
252
+ - cosine_map@100
253
+ model-index:
254
+ - name: SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
255
+ results:
256
+ - task:
257
+ type: information-retrieval
258
+ name: Information Retrieval
259
+ dataset:
260
+ name: Unknown
261
+ type: unknown
262
+ metrics:
263
+ - type: cosine_accuracy@1
264
+ value: 0.85
265
+ name: Cosine Accuracy@1
266
+ - type: cosine_accuracy@3
267
+ value: 0.93
268
+ name: Cosine Accuracy@3
269
+ - type: cosine_accuracy@5
270
+ value: 0.96
271
+ name: Cosine Accuracy@5
272
+ - type: cosine_accuracy@10
273
+ value: 0.98
274
+ name: Cosine Accuracy@10
275
+ - type: cosine_precision@1
276
+ value: 0.85
277
+ name: Cosine Precision@1
278
+ - type: cosine_precision@3
279
+ value: 0.30999999999999994
280
+ name: Cosine Precision@3
281
+ - type: cosine_precision@5
282
+ value: 0.19199999999999995
283
+ name: Cosine Precision@5
284
+ - type: cosine_precision@10
285
+ value: 0.09799999999999998
286
+ name: Cosine Precision@10
287
+ - type: cosine_recall@1
288
+ value: 0.85
289
+ name: Cosine Recall@1
290
+ - type: cosine_recall@3
291
+ value: 0.93
292
+ name: Cosine Recall@3
293
+ - type: cosine_recall@5
294
+ value: 0.96
295
+ name: Cosine Recall@5
296
+ - type: cosine_recall@10
297
+ value: 0.98
298
+ name: Cosine Recall@10
299
+ - type: cosine_ndcg@10
300
+ value: 0.9172332496525142
301
+ name: Cosine Ndcg@10
302
+ - type: cosine_mrr@10
303
+ value: 0.8967619047619046
304
+ name: Cosine Mrr@10
305
+ - type: cosine_map@100
306
+ value: 0.8976413373860181
307
+ name: Cosine Map@100
308
+ ---
309
+
310
+ # SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
311
+
312
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Snowflake/snowflake-arctic-embed-l](https://huggingface.co/Snowflake/snowflake-arctic-embed-l). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
313
+
314
+ ## Model Details
315
+
316
+ ### Model Description
317
+ - **Model Type:** Sentence Transformer
318
+ - **Base model:** [Snowflake/snowflake-arctic-embed-l](https://huggingface.co/Snowflake/snowflake-arctic-embed-l) <!-- at revision d8fb21ca8d905d2832ee8b96c894d3298964346b -->
319
+ - **Maximum Sequence Length:** 512 tokens
320
+ - **Output Dimensionality:** 1024 dimensions
321
+ - **Similarity Function:** Cosine Similarity
322
+ <!-- - **Training Dataset:** Unknown -->
323
+ <!-- - **Language:** Unknown -->
324
+ <!-- - **License:** Unknown -->
325
+
326
+ ### Model Sources
327
+
328
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
329
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
330
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
331
+
332
+ ### Full Model Architecture
333
+
334
+ ```
335
+ SentenceTransformer(
336
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
337
+ (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
338
+ (2): Normalize()
339
+ )
340
+ ```
341
+
342
+ ## Usage
343
+
344
+ ### Direct Usage (Sentence Transformers)
345
+
346
+ First install the Sentence Transformers library:
347
+
348
+ ```bash
349
+ pip install -U sentence-transformers
350
+ ```
351
+
352
+ Then you can load this model and run inference.
353
+ ```python
354
+ from sentence_transformers import SentenceTransformer
355
+
356
+ # Download from the 🤗 Hub
357
+ model = SentenceTransformer("philocifer/banner-flip-arctic-embed-l")
358
+ # Run inference
359
+ sentences = [
360
+ 'How many full-time staff members are employed at the 54 Royal Market?',
361
+ "STORE ANALYSIS: 54 Royal Market (728885)\nLocation: 817 9th Ave, New York\n\nStore 728885 - 54 Royal Market operates as a Open Store Superette establishment at 817 9th Ave, New York, NY 100194401 (FIPS 36-61). Geographically precise at coordinates 40.7662,-73.9872 (Geocoded to specific address), this location generates $1,768,000 in annual sales ($1,500,001 to $2,000,000) from its 2000.0 square foot space. The operation employs 11 full-time staff across 3 checkout lanes, yielding a sales density of $884.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.",
362
+ "STORE ANALYSIS: 1683 Jimmy Deli Grocery (1816263)\nLocation: 1683 Woodbine St, Ridgewood\n\nStore 1816263 - 1683 Jimmy Deli Grocery operates as a Open Store Superette establishment at 1683 Woodbine St, Ridgewood, NY 113853546 (FIPS 36-81). Geographically precise at coordinates 40.7012,-73.9083 (Geocoded to specific address), this location generates $1,456,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The operation employs 8 full-time staff across 2 checkout lanes, yielding a sales density of $1,456.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.",
363
+ ]
364
+ embeddings = model.encode(sentences)
365
+ print(embeddings.shape)
366
+ # [3, 1024]
367
+
368
+ # Get the similarity scores for the embeddings
369
+ similarities = model.similarity(embeddings, embeddings)
370
+ print(similarities.shape)
371
+ # [3, 3]
372
+ ```
373
+
374
+ <!--
375
+ ### Direct Usage (Transformers)
376
+
377
+ <details><summary>Click to see the direct usage in Transformers</summary>
378
+
379
+ </details>
380
+ -->
381
+
382
+ <!--
383
+ ### Downstream Usage (Sentence Transformers)
384
+
385
+ You can finetune this model on your own dataset.
386
+
387
+ <details><summary>Click to expand</summary>
388
+
389
+ </details>
390
+ -->
391
+
392
+ <!--
393
+ ### Out-of-Scope Use
394
+
395
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
396
+ -->
397
+
398
+ ## Evaluation
399
+
400
+ ### Metrics
401
+
402
+ #### Information Retrieval
403
+
404
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
405
+
406
+ | Metric | Value |
407
+ |:--------------------|:-----------|
408
+ | cosine_accuracy@1 | 0.85 |
409
+ | cosine_accuracy@3 | 0.93 |
410
+ | cosine_accuracy@5 | 0.96 |
411
+ | cosine_accuracy@10 | 0.98 |
412
+ | cosine_precision@1 | 0.85 |
413
+ | cosine_precision@3 | 0.31 |
414
+ | cosine_precision@5 | 0.192 |
415
+ | cosine_precision@10 | 0.098 |
416
+ | cosine_recall@1 | 0.85 |
417
+ | cosine_recall@3 | 0.93 |
418
+ | cosine_recall@5 | 0.96 |
419
+ | cosine_recall@10 | 0.98 |
420
+ | **cosine_ndcg@10** | **0.9172** |
421
+ | cosine_mrr@10 | 0.8968 |
422
+ | cosine_map@100 | 0.8976 |
423
+
424
+ <!--
425
+ ## Bias, Risks and Limitations
426
+
427
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
428
+ -->
429
+
430
+ <!--
431
+ ### Recommendations
432
+
433
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
434
+ -->
435
+
436
+ ## Training Details
437
+
438
+ ### Training Dataset
439
+
440
+ #### Unnamed Dataset
441
+
442
+ * Size: 300 training samples
443
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
444
+ * Approximate statistics based on the first 300 samples:
445
+ | | sentence_0 | sentence_1 |
446
+ |:--------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
447
+ | type | string | string |
448
+ | details | <ul><li>min: 13 tokens</li><li>mean: 19.23 tokens</li><li>max: 29 tokens</li></ul> | <ul><li>min: 200 tokens</li><li>mean: 215.01 tokens</li><li>max: 232 tokens</li></ul> |
449
+ * Samples:
450
+ | sentence_0 | sentence_1 |
451
+ |:-------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
452
+ | <code>How many full-time staff members are employed at 3 Rivers Grocery Market?</code> | <code>STORE ANALYSIS: 3 Rivers Grocery Market (432489)<br>Location: 9400 US Highway 60 W, Kevil<br><br>Store 432489 - 3 Rivers Grocery Market operates as a Open Store Supermarket-Conventional establishment at 9400 US Highway 60 W, Kevil, KY 420539678 (FIPS 21-145). Geographically precise at coordinates 37.0624,-88.8028 (Geocoded to specific address), this location generates $4,160,000 in annual sales ($4,000,001 to $6,000,000) from its 13000.0 square foot space. The operation employs 18 full-time staff across 4 checkout lanes, yielding a sales density of $320.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Assoc Wholesale/Nashville Div (Supplier ID: 12115, Family ID: 4110) to maintain its position in the Grocery sector's Supermarket-Conventional segment.</code> |
453
+ | <code>How many full-time staff members are employed at the 28th Street Supermarket?</code> | <code>STORE ANALYSIS: 28th Street Supermarket (737932)<br>Location: 2747 Cedar Ave, Cleveland<br><br>Store 737932 - 28th Street Supermarket operates as a Open Store Superette establishment at 2747 Cedar Ave, Cleveland, OH 441152908 (FIPS 39-35). Geographically precise at coordinates 41.4988,-81.6687 (Geocoded to specific address), this location generates $1,404,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The operation employs 4 full-time staff across 3 checkout lanes, yielding a sales density of $1,404.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through H T Hackney Co/Dist Ctr (Supplier ID: 36166, Family ID: 41880) to maintain its position in the Grocery sector's Superette segment.</code> |
454
+ | <code>How many full-time staff members are employed at the 4th Street Market?</code> | <code>STORE ANALYSIS: 4th Street Market (772013)<br>Location: 301 4th St, Richmond<br><br>Store 772013 - 4th Street Market operates as a Open Store Superette establishment at 301 4th St, Richmond, CA 948013001 (FIPS 6-13). Geographically precise at coordinates 37.9362,-122.3657 (Geocoded to specific address), this location generates $1,560,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0 square foot space. The operation employs 7 full-time staff across 2 checkout lanes, yielding a sales density of $1,560.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.</code> |
455
+ * Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
456
+ ```json
457
+ {
458
+ "loss": "MultipleNegativesRankingLoss",
459
+ "matryoshka_dims": [
460
+ 768,
461
+ 512,
462
+ 256,
463
+ 128,
464
+ 64
465
+ ],
466
+ "matryoshka_weights": [
467
+ 1,
468
+ 1,
469
+ 1,
470
+ 1,
471
+ 1
472
+ ],
473
+ "n_dims_per_step": -1
474
+ }
475
+ ```
476
+
477
+ ### Training Hyperparameters
478
+ #### Non-Default Hyperparameters
479
+
480
+ - `eval_strategy`: steps
481
+ - `per_device_train_batch_size`: 10
482
+ - `per_device_eval_batch_size`: 10
483
+ - `num_train_epochs`: 10
484
+ - `multi_dataset_batch_sampler`: round_robin
485
+
486
+ #### All Hyperparameters
487
+ <details><summary>Click to expand</summary>
488
+
489
+ - `overwrite_output_dir`: False
490
+ - `do_predict`: False
491
+ - `eval_strategy`: steps
492
+ - `prediction_loss_only`: True
493
+ - `per_device_train_batch_size`: 10
494
+ - `per_device_eval_batch_size`: 10
495
+ - `per_gpu_train_batch_size`: None
496
+ - `per_gpu_eval_batch_size`: None
497
+ - `gradient_accumulation_steps`: 1
498
+ - `eval_accumulation_steps`: None
499
+ - `torch_empty_cache_steps`: None
500
+ - `learning_rate`: 5e-05
501
+ - `weight_decay`: 0.0
502
+ - `adam_beta1`: 0.9
503
+ - `adam_beta2`: 0.999
504
+ - `adam_epsilon`: 1e-08
505
+ - `max_grad_norm`: 1
506
+ - `num_train_epochs`: 10
507
+ - `max_steps`: -1
508
+ - `lr_scheduler_type`: linear
509
+ - `lr_scheduler_kwargs`: {}
510
+ - `warmup_ratio`: 0.0
511
+ - `warmup_steps`: 0
512
+ - `log_level`: passive
513
+ - `log_level_replica`: warning
514
+ - `log_on_each_node`: True
515
+ - `logging_nan_inf_filter`: True
516
+ - `save_safetensors`: True
517
+ - `save_on_each_node`: False
518
+ - `save_only_model`: False
519
+ - `restore_callback_states_from_checkpoint`: False
520
+ - `no_cuda`: False
521
+ - `use_cpu`: False
522
+ - `use_mps_device`: False
523
+ - `seed`: 42
524
+ - `data_seed`: None
525
+ - `jit_mode_eval`: False
526
+ - `use_ipex`: False
527
+ - `bf16`: False
528
+ - `fp16`: False
529
+ - `fp16_opt_level`: O1
530
+ - `half_precision_backend`: auto
531
+ - `bf16_full_eval`: False
532
+ - `fp16_full_eval`: False
533
+ - `tf32`: None
534
+ - `local_rank`: 0
535
+ - `ddp_backend`: None
536
+ - `tpu_num_cores`: None
537
+ - `tpu_metrics_debug`: False
538
+ - `debug`: []
539
+ - `dataloader_drop_last`: False
540
+ - `dataloader_num_workers`: 0
541
+ - `dataloader_prefetch_factor`: None
542
+ - `past_index`: -1
543
+ - `disable_tqdm`: False
544
+ - `remove_unused_columns`: True
545
+ - `label_names`: None
546
+ - `load_best_model_at_end`: False
547
+ - `ignore_data_skip`: False
548
+ - `fsdp`: []
549
+ - `fsdp_min_num_params`: 0
550
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
551
+ - `fsdp_transformer_layer_cls_to_wrap`: None
552
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
553
+ - `deepspeed`: None
554
+ - `label_smoothing_factor`: 0.0
555
+ - `optim`: adamw_torch
556
+ - `optim_args`: None
557
+ - `adafactor`: False
558
+ - `group_by_length`: False
559
+ - `length_column_name`: length
560
+ - `ddp_find_unused_parameters`: None
561
+ - `ddp_bucket_cap_mb`: None
562
+ - `ddp_broadcast_buffers`: False
563
+ - `dataloader_pin_memory`: True
564
+ - `dataloader_persistent_workers`: False
565
+ - `skip_memory_metrics`: True
566
+ - `use_legacy_prediction_loop`: False
567
+ - `push_to_hub`: False
568
+ - `resume_from_checkpoint`: None
569
+ - `hub_model_id`: None
570
+ - `hub_strategy`: every_save
571
+ - `hub_private_repo`: None
572
+ - `hub_always_push`: False
573
+ - `gradient_checkpointing`: False
574
+ - `gradient_checkpointing_kwargs`: None
575
+ - `include_inputs_for_metrics`: False
576
+ - `include_for_metrics`: []
577
+ - `eval_do_concat_batches`: True
578
+ - `fp16_backend`: auto
579
+ - `push_to_hub_model_id`: None
580
+ - `push_to_hub_organization`: None
581
+ - `mp_parameters`:
582
+ - `auto_find_batch_size`: False
583
+ - `full_determinism`: False
584
+ - `torchdynamo`: None
585
+ - `ray_scope`: last
586
+ - `ddp_timeout`: 1800
587
+ - `torch_compile`: False
588
+ - `torch_compile_backend`: None
589
+ - `torch_compile_mode`: None
590
+ - `dispatch_batches`: None
591
+ - `split_batches`: None
592
+ - `include_tokens_per_second`: False
593
+ - `include_num_input_tokens_seen`: False
594
+ - `neftune_noise_alpha`: None
595
+ - `optim_target_modules`: None
596
+ - `batch_eval_metrics`: False
597
+ - `eval_on_start`: False
598
+ - `use_liger_kernel`: False
599
+ - `eval_use_gather_object`: False
600
+ - `average_tokens_across_devices`: False
601
+ - `prompts`: None
602
+ - `batch_sampler`: batch_sampler
603
+ - `multi_dataset_batch_sampler`: round_robin
604
+
605
+ </details>
606
+
607
+ ### Training Logs
608
+ | Epoch | Step | cosine_ndcg@10 |
609
+ |:------:|:----:|:--------------:|
610
+ | 1.0 | 30 | 0.9111 |
611
+ | 1.6667 | 50 | 0.9106 |
612
+ | 2.0 | 60 | 0.9058 |
613
+ | 3.0 | 90 | 0.9149 |
614
+ | 3.3333 | 100 | 0.9199 |
615
+ | 4.0 | 120 | 0.9185 |
616
+ | 5.0 | 150 | 0.9208 |
617
+ | 6.0 | 180 | 0.9172 |
618
+ | 6.6667 | 200 | 0.9172 |
619
+ | 7.0 | 210 | 0.9172 |
620
+ | 8.0 | 240 | 0.9172 |
621
+ | 8.3333 | 250 | 0.9172 |
622
+ | 9.0 | 270 | 0.9172 |
623
+ | 10.0 | 300 | 0.9172 |
624
+
625
+
626
+ ### Framework Versions
627
+ - Python: 3.11.11
628
+ - Sentence Transformers: 3.4.1
629
+ - Transformers: 4.48.3
630
+ - PyTorch: 2.5.1+cu124
631
+ - Accelerate: 1.3.0
632
+ - Datasets: 3.3.2
633
+ - Tokenizers: 0.21.0
634
+
635
+ ## Citation
636
+
637
+ ### BibTeX
638
+
639
+ #### Sentence Transformers
640
+ ```bibtex
641
+ @inproceedings{reimers-2019-sentence-bert,
642
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
643
+ author = "Reimers, Nils and Gurevych, Iryna",
644
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
645
+ month = "11",
646
+ year = "2019",
647
+ publisher = "Association for Computational Linguistics",
648
+ url = "https://arxiv.org/abs/1908.10084",
649
+ }
650
+ ```
651
+
652
+ #### MatryoshkaLoss
653
+ ```bibtex
654
+ @misc{kusupati2024matryoshka,
655
+ title={Matryoshka Representation Learning},
656
+ author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
657
+ year={2024},
658
+ eprint={2205.13147},
659
+ archivePrefix={arXiv},
660
+ primaryClass={cs.LG}
661
+ }
662
+ ```
663
+
664
+ #### MultipleNegativesRankingLoss
665
+ ```bibtex
666
+ @misc{henderson2017efficient,
667
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
668
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
669
+ year={2017},
670
+ eprint={1705.00652},
671
+ archivePrefix={arXiv},
672
+ primaryClass={cs.CL}
673
+ }
674
+ ```
675
+
676
+ <!--
677
+ ## Glossary
678
+
679
+ *Clearly define terms in order to be accessible across audiences.*
680
+ -->
681
+
682
+ <!--
683
+ ## Model Card Authors
684
+
685
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
686
+ -->
687
+
688
+ <!--
689
+ ## Model Card Contact
690
+
691
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
692
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Snowflake/snowflake-arctic-embed-l",
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "classifier_dropout": null,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 1024,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 4096,
13
+ "layer_norm_eps": 1e-12,
14
+ "max_position_embeddings": 512,
15
+ "model_type": "bert",
16
+ "num_attention_heads": 16,
17
+ "num_hidden_layers": 24,
18
+ "pad_token_id": 0,
19
+ "position_embedding_type": "absolute",
20
+ "torch_dtype": "float32",
21
+ "transformers_version": "4.48.3",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.4.1",
4
+ "transformers": "4.48.3",
5
+ "pytorch": "2.5.1+cu124"
6
+ },
7
+ "prompts": {
8
+ "query": "Represent this sentence for searching relevant passages: "
9
+ },
10
+ "default_prompt_name": null,
11
+ "similarity_fn_name": "cosine"
12
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa5b181bf06b9084809506bc81610b3ceabe292b778ebffa7652530b14346510
3
+ size 1336413848
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_lower_case": true,
47
+ "extra_special_tokens": {},
48
+ "mask_token": "[MASK]",
49
+ "max_length": 512,
50
+ "model_max_length": 512,
51
+ "pad_to_multiple_of": null,
52
+ "pad_token": "[PAD]",
53
+ "pad_token_type_id": 0,
54
+ "padding_side": "right",
55
+ "sep_token": "[SEP]",
56
+ "stride": 0,
57
+ "strip_accents": null,
58
+ "tokenize_chinese_chars": true,
59
+ "tokenizer_class": "BertTokenizer",
60
+ "truncation_side": "right",
61
+ "truncation_strategy": "longest_first",
62
+ "unk_token": "[UNK]"
63
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff