kenhktsui commited on
Commit
f2f4e3a
1 Parent(s): d348411

Add SetFit model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,311 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: sentence-transformers/paraphrase-mpnet-base-v2
3
+ datasets:
4
+ - stanfordnlp/imdb
5
+ library_name: setfit
6
+ metrics:
7
+ - accuracy
8
+ pipeline_tag: text-classification
9
+ tags:
10
+ - setfit
11
+ - sentence-transformers
12
+ - text-classification
13
+ - generated_from_setfit_trainer
14
+ widget:
15
+ - text: What does the " Executive producer " do in a movie . If I remember correctly
16
+ it's the person who raised the financial backing to make the movie . You might
17
+ notice in a great number of movies starring Sean Connery that he is also the executive
18
+ producer which meant Connery himself raised the money since he is a major player
19
+ . Unfortunately it should also be pointed out that a great number of movies "
20
+ starring Sean Connery were solely made because he managed to raise the money since
21
+ he's a major Hollywood player , it's usually an indication that when the credits
22
+ read that the executive producer and the star of the movie are one and the same
23
+ the movie itself is nothing more than a star vehicle with the story/screenplay
24
+ not being up to scratch <br /><br />PROTOCOL follows the saga of one Sunny Davis
25
+ a kooky bimboesque cocktail waitress who saves a visiting dignitary and as a reward
26
+ gets made a top diplomat . Likely ? As things progress Ms Davis ( Who has problems
27
+ being able to string two sentences together ) finds herself in more outlandish
28
+ and less likely situations . When I say that PROTOCOL stars Goldie Hawn who is
29
+ also the film's executive producer do you understand what I'm saying about the
30
+ story/screenplay not being up to scratch ? Exactly
31
+ - text: I've seen all four of the movies in this series. Each one strays further and
32
+ further from the books. This is the worst one yet. My problem is that it does
33
+ not follow the book it is titled after in any way! The directors and producers
34
+ should have named it any thing other than "Love's Abiding Joy." The only thing
35
+ about this movie that remotely resembles the book are the names of some of the
36
+ characters (Willie, Missie, Henry, Clark, Scottie and Cookie). The names/ages/genders
37
+ of the children are wrong. The entire story line is no where in the book.<br /><br
38
+ />I find it a great disservice to Janette Oke, her books and her fans to produce
39
+ a movie under her title that is not correct in any way. The music is too loud.
40
+ The actors are not convincing - they lack emotions.<br /><br />If you want a good
41
+ family movie, this might do. It is clean. Don't watch it, though, if you are hoping
42
+ for a condensed version of the book. I hope that this will be the last movie from
43
+ this series, but I doubt it. If there are more movies made, I wish Michael Landon,
44
+ Jr and others would stick closer to the original plot and story lines. The books
45
+ are excellent and, if closely followed, would make excellent movies!
46
+ - text: 'THE ZOMBIE CHRONICLES <br /><br />Aspect ratio: 1.33:1 (Nu-View 3-D)<br /><br
47
+ />Sound format: Mono<br /><br />Whilst searching for a (literal) ghost town in
48
+ the middle of nowhere, a young reporter (Emmy Smith) picks up a grizzled hitchhiker
49
+ (Joseph Haggerty) who tells her two stories involving flesh-eating zombies reputed
50
+ to haunt the area.<br /><br />An ABSOLUTE waste of time, hobbled from the outset
51
+ by Haggerty''s painfully amateurish performance in a key role. Worse still, the
52
+ two stories which make up the bulk of the running time are utterly routine, made
53
+ worse by indifferent performances and lackluster direction by Brad Sykes, previously
54
+ responsible for the likes of CAMP BLOOD (1999). This isn''t a ''fun'' movie in
55
+ the sense that Ed Wood''s movies are ''fun'' (he, at least, believed in what he
56
+ was doing and was sincere in his efforts, despite a lack of talent); Sykes'' home-made
57
+ movies are, in fact, aggravating, boring and almost completely devoid of any redeeming
58
+ virtue, and most viewers will feel justifiably angry and cheated by such unimaginative,
59
+ badly-conceived junk. The 3-D format is utterly wasted here.'
60
+ - text: If only to avoid making this type of film in the future. This film is interesting
61
+ as an experiment but tells no cogent story.<br /><br />One might feel virtuous
62
+ for sitting thru it because it touches on so many IMPORTANT issues but it does
63
+ so without any discernable motive. The viewer comes away with no new perspectives
64
+ (unless one comes up with one while one's mind wanders, as it will invariably
65
+ do during this pointless film).<br /><br />One might better spend one's time staring
66
+ out a window at a tree growing.<br /><br />
67
+ - text: Sexo Cannibal, or Devil Hunter as it's more commonly known amongst English
68
+ speaking audiences, starts with actress & model Laura Crawford (Ursula Buchfellner
69
+ as Ursula Fellner) checking out locations for her new film along with her assistant
70
+ Jane (Gisela Hahn). After a long days work Laura is relaxing in the bath of her
71
+ room when two very dubious character's named Chris (Werner Pochath) & Thomas (Antonio
72
+ Mayans) burst in & kidnap her having been helped by the treacherous Jane. Laura's
73
+ agent gets on the blower to rent-a-hero Peter Weston (Al Cliver) who is informed
74
+ of the situation, the kidnappers have Laura on an isolated island & are demanding
75
+ a 6 million ransom. Peter is told that he will be paid 200,000 to get her back
76
+ safely & a further 10% of the 6 million if he brings that back as well, faster
77
+ than a rat up a drain pipe Peter & his Vietnam Vet buddy helicopter pilot Jack
78
+ are on the island & deciding on how to save Laura. So, the kidnappers have Laura
79
+ & Peter has the 6 million but neither want to hand them over that much. Just to
80
+ complicate things further this particular isolated island is home to a primitive
81
+ tribe (hell, in all the generations they've lived there they've only managed to
82
+ build one straw hut, now that's primitive) who worship some cannibal monster dude
83
+ (Burt Altman) with bulging eyes as a God with human sacrifices & this cannibal
84
+ has a liking for young, white female flesh & intestines...<br /><br />This Spanish,
85
+ French & German co-production was co-written & directed by the prolific Jesus
86
+ Franco who also gets the credit for the music as well. Sexo Cannibal has gained
87
+ a certain amount of notoriety here in the UK as it was placed on the 'Video Nasties'
88
+ list in the early 80's under it's alternate Devil Hunter title & therefore officially
89
+ classed as obscene & banned, having said that I have no idea why as it is one
90
+ bad film & even Franco, who isn't afraid to be associated with a turkey, decides
91
+ he wants to hide under the pseudonym of Clifford Brown. I'd imagine even the most
92
+ die-hard Franco fan would have a hard time defending this thing. The script by
93
+ Franco, erm sorry I mean Clifford Brown & Julian Esteban as Julius Valery who
94
+ was obviously another one less than impressed with the finished product & wanted
95
+ his named removed, is awful. It's as simple & straight forward as that. For a
96
+ start the film is so boring it's untrue, the kidnap plot is one of the dullest
97
+ I've ever seen without the slightest bit of tension or excitement involved & the
98
+ horror side of things don't improve as we get a big black guy with stupid looking
99
+ over-sized bloodshot eyes plus two tame cannibal scenes. As a horror film Sexo
100
+ Cannibal fails & as an action adventure it has no more success, this is one to
101
+ avoid.<br /><br />Director Franco shows his usual incompetence throughout, a decapitated
102
+ head is achieved by an actor lying on the ground with large leaves placed around
103
+ the bottom of his neck to try & give the impression it's not attached to anything!
104
+ The cannibal scenes are poor, the action is lame & it has endless scenes of people
105
+ randomly walking around the jungle getting from 'A' to 'B' & not really doing
106
+ anything when they get there either. It becomes incredibly dull & tedious to watch
107
+ after about 10 minutes & don't forget this thing goes on for 94 minutes in it's
108
+ uncut state. I also must mention the hilarious scene when Al Cliver is supposed
109
+ to be climbing a cliff, this is achieved by Franco turning his camera on it's
110
+ side & having Cliver crawl along the floor! Just look at the way his coat hangs
111
+ & the way he never grabs onto to anything as he just pulls himself along! The
112
+ gore isn't that great & as far as Euro cannibal films go this is very tame, there
113
+ are some gross close ups of the cannibals mouth as it chews bits of meat, a man
114
+ is impaled on spikes, there's some blood & a handful of intestines. There's a
115
+ fair bit of nudity in Sexo Cannibal & an unpleasant rape scene.<br /><br />Sexo
116
+ Cannibal must have had a low budget & I mean low. This is a shoddy poorly made
117
+ film with awful special effects & rock bottom production values. The only decent
118
+ thing about it is the jungle setting which at least looks authentic. The music
119
+ sucks & sound effects become annoying as there is lots of heavy breathing whenever
120
+ the cannibal is on screen. The acting sucks, the whole thing was obviously dubbed
121
+ anyway but no one in this thing can act.<br /><br />Sexo Cannibal is a terrible
122
+ film that commits the fatal mistake of being as boring as hell. The only good
123
+ things I can say is that it has a certain sleazy atmosphere to it & those close
124
+ ups of the cannibal chewing meat are pretty gross. Anyone looking for a decent
125
+ cinematic experience should give Sexo Cannibal as wide a berth as possible, one
126
+ to avoid.
127
+ inference: true
128
+ model-index:
129
+ - name: SetFit with sentence-transformers/paraphrase-mpnet-base-v2
130
+ results:
131
+ - task:
132
+ type: text-classification
133
+ name: Text Classification
134
+ dataset:
135
+ name: stanfordnlp/imdb
136
+ type: stanfordnlp/imdb
137
+ split: test
138
+ metrics:
139
+ - type: accuracy
140
+ value: 0.8242
141
+ name: Accuracy
142
+ ---
143
+
144
+ # SetFit with sentence-transformers/paraphrase-mpnet-base-v2
145
+
146
+ This is a [SetFit](https://github.com/huggingface/setfit) model trained on the [stanfordnlp/imdb](https://huggingface.co/datasets/stanfordnlp/imdb) dataset that can be used for Text Classification. This SetFit model uses [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
147
+
148
+ The model has been trained using an efficient few-shot learning technique that involves:
149
+
150
+ 1. Fine-tuning a [Sentence Transformer](https://www.sbert.net) with contrastive learning.
151
+ 2. Training a classification head with features from the fine-tuned Sentence Transformer.
152
+
153
+ ## Model Details
154
+
155
+ ### Model Description
156
+ - **Model Type:** SetFit
157
+ - **Sentence Transformer body:** [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2)
158
+ - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
159
+ - **Maximum Sequence Length:** 512 tokens
160
+ - **Number of Classes:** 2 classes
161
+ - **Training Dataset:** [stanfordnlp/imdb](https://huggingface.co/datasets/stanfordnlp/imdb)
162
+ <!-- - **Language:** Unknown -->
163
+ <!-- - **License:** Unknown -->
164
+
165
+ ### Model Sources
166
+
167
+ - **Repository:** [SetFit on GitHub](https://github.com/huggingface/setfit)
168
+ - **Paper:** [Efficient Few-Shot Learning Without Prompts](https://arxiv.org/abs/2209.11055)
169
+ - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
170
+
171
+ ### Model Labels
172
+ | Label | Examples |
173
+ |:------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
174
+ | 0 | <ul><li>"I can't believe that those praising this movie herein aren't thinking of some other film. I was prepared for the possibility that this would be awful, but the script (or lack thereof) makes for a film that's also pointless. On the plus side, the general level of craft on the part of the actors and technical crew is quite competent, but when you've got a sow's ear to work with you can't make a silk purse. Ben G fans should stick with just about any other movie he's been in. Dorothy S fans should stick to Galaxina. Peter B fans should stick to Last Picture Show and Target. Fans of cheap laughs at the expense of those who seem to be asking for it should stick to Peter B's amazingly awful book, Killing of the Unicorn."</li><li>'This has to be the worst piece of garbage I\'ve seen in a while.<br /><br />Heath Ledger is a heartthrob? He looked deformed. I wish I\'d known that he and Naomi Watts are an item in real life because I spent 2 of the longest hours of my life wondering what she saw in him. <br /><br />Orlando Bloom is a heartthrob? With the scraggly beard and deer-in-the-headlights look about him, I can\'t say I agree.<br /><br />Rachel Griffiths was her usual fabulous self, but Geoffrey Rush looked as if he couldn\'t wait to get off the set. <br /><br />I\'m supposed to feel sorry for bankrobbers and murderers? This is a far cry from Butch Cassidy, which actually WAS an entertaining film. This was trite, cliche-ridden and boring. We only stayed because we were convinced it would get better. It didn\'t.<br /><br />The last 10-15 minutes or so were unintentionally hilarious. Heath and his gang are holed up in a frontier hotel, and women and children are dying because of their presence. That\'s not funny. But it was funny when they walked out of the hotel with the armor on, because all we could think of was the Black Knight from Monty Python and the Holy Grail. I kept waiting for them to say "I\'ll bite yer leg off!" We were howling with laughter, as were several other warped members of the audience. When we left, pretty much everyone was talking about what a waste of time this film was.<br /><br />I may not have paid cash to see this disaster (sneak preview), but it certainly wasn\'t free. It cost me 2 hours of my life that I will never get back.'</li><li>"This movie was awful. The ending was absolutely horrible. There was no plot to the movie whatsoever. The only thing that was decent about the movie was the acting done by Robert DuVall and James Earl Jones. Their performances were excellent! The only problem was that the movie did not do their acting performances any justice. If the script would have come close to capturing a halfway decent story, it would be worth watching. Instead, Robert DuVall's and James Earl Jones' performances are completely wasted on a god awful storyline...or lack thereof. Not only was I left waiting throughout the movie for something to happen to make the movie....well an actual movie...not just utterless dialog between characters for what ended up being absolutely no reason. It was nothing more than common dialog that would have taken place back in that period of time. There was nothing special about any of the characters. The only thing special was how Robert DuVall portrayed a rambling, senile, drunk, old man. Nothing worthy happens during the entire movie including the end. When the movie ended, I sat amazed...amazed that I sat through the entire movie waiting for something of interest to happen to make watching the movie worth while. It never happened! The cast of characters suddenly started rolling making it apparent that the movie really was over and I realized that I had just wasted 2 hours of my life watching a movie with absolutely no plot and no meaning. It wasn't even a story. The entire movie takes place in a day's worth of time. That's it. It was one day in the life (and death) of some Southerners on a plantation. How much of a story can take place in a single day (other than the movie Training Day)? The acting performances by the entire cast were excellent, but they were grossly wasted on such a disappointment of a movie...if you can even call it a movie."</li></ul> |
175
+ | 1 | <ul><li>"OK its not the best film I've ever seen but at the same time I've been able to sit and watch it TWICE!!! story line was pretty awful and during the first part of the first short story i wondered what the hell i was watching but at the same time it was so awful i loved it cheap laughs all the way.<br /><br />And Jebidia deserves an Oscar for his role in this movie the only thing that let him down was half way through he stopped his silly name calling.<br /><br />overall the film was pretty perfetic but if your after cheap laughs and you see it in pound land go by it."</li><li>"I very much looked forward to this movie. Its a good family movie; however, if Michael Landon Jr.'s editing team did a better job of editing, the movie would be much better. Too many scenes out of context. I do hope there is another movie from the series, they're all very good. But, if another one is made, I beg them to take better care at editing. This story was all over the place and didn't seem to have a center. Which is unfortunate because the other movies of the series were great. I enjoy the story of Willie and Missy; they're both great role models. Plus, the romantic side of the viewers always enjoy a good love story."</li><li>"or anyone who was praying for the sight of Al Cliver wrestling a naked, 7ft tall black guy into a full nelson, your film has arrived! Film starlet Laura Crawford (Ursula Buchfellner) is kidnapped by a group who demand the ransom of $6 million to be delivered to their island hideaway. What they don't count on is rugged Vietnam vet Peter Weston (Cliver) being hired by a film producer to save the girl. And what they really didn't count on was a local tribe that likes to offer up young women to their monster cannibal god with bloodshot bug eyes.<br /><br />Pretty much the same filming set up as CANNIBALS, this one fares a bit better when it comes to entertainment value, thanks mostly a hilarious dub track and the impossibly goofy monster with the bulging eyes (Franco confirms they were split ping pong balls on the disc's interview). Franco gets a strong EuroCult supporting cast including Gisela Hahn (CONTAMINATION) and Werner Pochath (whose death is one of the most head-scratching things I ever seen as a guy who is totally not him is shown - in close up - trying to be him). The film features tons of nudity and the gore (Tempra paint variety) is there. The highlight for me was the world's slowly fistfight between Cliver and Antonio de Cabo in the splashing waves. Sadly, ol' Jess pads this one out to an astonishing (and, at times, agonizing) 1 hour and 40 minutes when it should have run 80 minutes tops. <br /><br />For the most part, the Severin DVD looks pretty nice but there are some odd ghosting images going on during some of the darker scenes. Also, one long section of dialog is in Spanish with no subs (they are an option, but only when you listen to the French track). Franco gives a nice 16- minute interview about the film and has much more pleasant things to say about Buchfellner than his CANNIBALS star Sabrina Siani."</li></ul> |
176
+
177
+ ## Evaluation
178
+
179
+ ### Metrics
180
+ | Label | Accuracy |
181
+ |:--------|:---------|
182
+ | **all** | 0.8242 |
183
+
184
+ ## Uses
185
+
186
+ ### Direct Use for Inference
187
+
188
+ First install the SetFit library:
189
+
190
+ ```bash
191
+ pip install setfit
192
+ ```
193
+
194
+ Then you can load this model and run inference.
195
+
196
+ ```python
197
+ from setfit import SetFitModel
198
+
199
+ # Download from the 🤗 Hub
200
+ model = SetFitModel.from_pretrained("setfit_model_id")
201
+ # Run inference
202
+ preds = model("If only to avoid making this type of film in the future. This film is interesting as an experiment but tells no cogent story.<br /><br />One might feel virtuous for sitting thru it because it touches on so many IMPORTANT issues but it does so without any discernable motive. The viewer comes away with no new perspectives (unless one comes up with one while one's mind wanders, as it will invariably do during this pointless film).<br /><br />One might better spend one's time staring out a window at a tree growing.<br /><br />")
203
+ ```
204
+
205
+ <!--
206
+ ### Downstream Use
207
+
208
+ *List how someone could finetune this model on their own dataset.*
209
+ -->
210
+
211
+ <!--
212
+ ### Out-of-Scope Use
213
+
214
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
215
+ -->
216
+
217
+ <!--
218
+ ## Bias, Risks and Limitations
219
+
220
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
221
+ -->
222
+
223
+ <!--
224
+ ### Recommendations
225
+
226
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
227
+ -->
228
+
229
+ ## Training Details
230
+
231
+ ### Training Set Metrics
232
+ | Training set | Min | Median | Max |
233
+ |:-------------|:----|:---------|:----|
234
+ | Word count | 48 | 244.4571 | 888 |
235
+
236
+ | Label | Training Sample Count |
237
+ |:------|:----------------------|
238
+ | 1 | 7 |
239
+ | 0 | 63 |
240
+
241
+ ### Training Hyperparameters
242
+ - batch_size: (16, 16)
243
+ - num_epochs: (1, 1)
244
+ - max_steps: -1
245
+ - sampling_strategy: oversampling
246
+ - body_learning_rate: (2e-05, 1e-05)
247
+ - head_learning_rate: 0.01
248
+ - loss: CosineSimilarityLoss
249
+ - distance_metric: cosine_distance
250
+ - margin: 0.25
251
+ - end_to_end: False
252
+ - use_amp: False
253
+ - warmup_proportion: 0.1
254
+ - seed: 42
255
+ - eval_max_steps: -1
256
+ - load_best_model_at_end: True
257
+
258
+ ### Training Results
259
+ | Epoch | Step | Training Loss | Validation Loss |
260
+ |:-------:|:-------:|:-------------:|:---------------:|
261
+ | 0.0039 | 1 | 0.2493 | - |
262
+ | 0.1953 | 50 | 0.0016 | - |
263
+ | 0.3906 | 100 | 0.0003 | - |
264
+ | 0.5859 | 150 | 0.003 | - |
265
+ | 0.7812 | 200 | 0.0014 | - |
266
+ | 0.9766 | 250 | 0.0002 | - |
267
+ | **1.0** | **256** | **-** | **0.4699** |
268
+
269
+ * The bold row denotes the saved checkpoint.
270
+ ### Framework Versions
271
+ - Python: 3.8.19
272
+ - SetFit: 1.0.3
273
+ - Sentence Transformers: 3.0.1
274
+ - Transformers: 4.39.0
275
+ - PyTorch: 2.4.0
276
+ - Datasets: 2.20.0
277
+ - Tokenizers: 0.15.2
278
+
279
+ ## Citation
280
+
281
+ ### BibTeX
282
+ ```bibtex
283
+ @article{https://doi.org/10.48550/arxiv.2209.11055,
284
+ doi = {10.48550/ARXIV.2209.11055},
285
+ url = {https://arxiv.org/abs/2209.11055},
286
+ author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
287
+ keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
288
+ title = {Efficient Few-Shot Learning Without Prompts},
289
+ publisher = {arXiv},
290
+ year = {2022},
291
+ copyright = {Creative Commons Attribution 4.0 International}
292
+ }
293
+ ```
294
+
295
+ <!--
296
+ ## Glossary
297
+
298
+ *Clearly define terms in order to be accessible across audiences.*
299
+ -->
300
+
301
+ <!--
302
+ ## Model Card Authors
303
+
304
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
305
+ -->
306
+
307
+ <!--
308
+ ## Model Card Contact
309
+
310
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
311
+ -->
config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "setfit/step_256",
3
+ "architectures": [
4
+ "MPNetModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 514,
16
+ "model_type": "mpnet",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 1,
20
+ "relative_attention_num_buckets": 32,
21
+ "torch_dtype": "float32",
22
+ "transformers_version": "4.39.0",
23
+ "vocab_size": 30527
24
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.0.1",
4
+ "transformers": "4.39.0",
5
+ "pytorch": "2.4.0"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": null
10
+ }
config_setfit.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels": [
3
+ "1",
4
+ "0"
5
+ ],
6
+ "normalize_embeddings": false
7
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a246a9bff2a49daa7c720648bb2fe7731371fef61173c5fe2b39d3e834768cf1
3
+ size 437967672
model_head.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:acf495d66eebbc7f9587d2db39b675ca2f27db50f59f36583889482de74df204
3
+ size 6991
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "[UNK]",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "104": {
28
+ "content": "[UNK]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "30526": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "do_basic_tokenize": true,
48
+ "do_lower_case": true,
49
+ "eos_token": "</s>",
50
+ "mask_token": "<mask>",
51
+ "max_length": 512,
52
+ "model_max_length": 512,
53
+ "never_split": null,
54
+ "pad_to_multiple_of": null,
55
+ "pad_token": "<pad>",
56
+ "pad_token_type_id": 0,
57
+ "padding_side": "right",
58
+ "sep_token": "</s>",
59
+ "stride": 0,
60
+ "strip_accents": null,
61
+ "tokenize_chinese_chars": true,
62
+ "tokenizer_class": "MPNetTokenizer",
63
+ "truncation_side": "right",
64
+ "truncation_strategy": "longest_first",
65
+ "unk_token": "[UNK]"
66
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff