am-azadi commited on
Commit
9a0c83b
·
verified ·
1 Parent(s): 0b9b4a8

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +82 -134
  2. config.json +1 -1
  3. model.safetensors +1 -1
README.md CHANGED
@@ -6,97 +6,70 @@ tags:
6
  - generated_from_trainer
7
  - dataset_size:25743
8
  - loss:MultipleNegativesRankingLoss
9
- base_model: Lajavaness/bilingual-embedding-large
10
  widget:
11
- - source_sentence: 'Greta Thunberg leads People With Money''s annual list of the "100
12
- highest-paid activists" released on Monday. It''s been a rough year for the activist,
13
- but at least she has her millions of dollars to ease the pain. Greta Thunberg,
14
- 16, has been ranked No. 1 on People With Money''s 10 highest-paid activists for
15
- 2019 with an estimated $46 million in combined earnings. UPDATE 09/24/2019: This
16
- story appears to be false. (Read more) Greta Thunberg tops annual list of highest-paid
17
- activists In 2017 it seemed that the activist''s spectacular career was coming
18
- to an end. Suddenly she was back on top. People With Money reports on Monday (September
19
- 23) that Thunberg is the highest-paid activist in the world, with an astonishing
20
- $46 million between August 2018 and August 2019, a nearly $20 million lead over
21
- her closest competition. . Factors of people with money In compiling this annual
22
- list, the magazine considers factors such as down payment, profit sharing, residuals,
23
- endorsements and advertising work. The Swedish activist has an estimated net worth
24
- of $145 million. She owes her fortune to savvy stock investments, substantial
25
- property holdings, lucrative endorsement deals with CoverGirl cosmetics. He also
26
- owns several restaurants (the "Fat Thunberg Burger" chain) in Stockholm, a football
27
- team (the "Stockholm Angels"), has launched his own brand of vodka (Pure Wonderthunberg
28
- - Sweden), and is tackling the youth market with best-selling perfume (With Love
29
- from Greta) and a fashion line called "Greta Thunberg Seduction." The ranking
30
- is significant for many Greta fans, who have been waiting for her triumphant return
31
- to the glory days for what seems like a lifetime.'
32
  sentences:
33
- - Donald Trump next to a stack of declassified files
34
- - Greta Thunberg leads People With Money's annual list of the "100 highest-paid
35
- activists"
36
- - Women should not to use shampoo during their periods “because the pores of the
37
- head are open during menstruation and it can cause headache”.
38
- - source_sentence: Dr. Bonnie Henry, British Columbia's Provincial Health Officer
39
- made a rare Saturday appearance in the media to issue a warning to people who
40
- plan on using old pieces of gypsum boards to make their own glory holes, saying
41
- that there is a risk of exposure to asbestos and recommends airing on the side
42
- of caution and using newer sheets of drywall only. "We realize that many British
43
- Columbians have been waiting for a long time for the opportunity to get intimate
44
- and a few of those who will choose to use glory holes need to be aware that older
45
- gypsum sheet that may be sitting in the basement or backyard will often contain
46
- toxic levels of asbestos." Henry told reporters. "The safest method yet to ensure
47
- zero risk of exposure to Covid 19 is getting off alone, using your own imagination,
48
- toys and electronic devices." The health officer added. "I know this is a topic
49
- that makes some people uncomfortable but I need to add that I've come across a
50
- few hospital reports of young men getting injured getting their genitals stuck
51
- in a small hole they made in a wall with a kitchen knife." Dr, Henry said with
52
- a softer tone. "Just to be clear, glory holes are small openings in a wall where
53
- a person will offer their intimate parts to a partner who is on the opposite side
54
- who will be pleasuring the exposed parts, ideally, until full satisfaction. This
55
- is all done in the spirit of doing your share to flatten the curve while having
56
- a bit of fun this summer."
57
  sentences:
58
- - Petai can treat all types of cancer and muscle pain
59
- - Terrible scene of dead is prepared to generate fear by covid-19 in the population
60
- - BC Minister of Health warned against asbestos poisoning when using glory holes
61
- - source_sentence: THIS YOUNG CHADIAN WHO HAS A GUN POINTED ON THE CHEST BY A FRENCH
62
- SOLDIER IN HIS COUNTRY BECAUSE HE DARED TO DEMONSTRATE AGAINST THE DESPOTIC REGIME
63
- IN PAYMENT TO FRANCE IS THE NEW SYMBOL OF RESISTANCE TO FRENCH NEO-COLONIALISM.
64
- Qatar Foundati
65
  sentences:
66
- - This young Chadian who has a gun pointed at his chest by a French soldier in his
67
- country because he dared to demonstrate against the despotic regime in the pay
68
- of France
69
- - CNN uses footage of an explosion in Ukraine from 2015 to illustrate the current
70
- conflict.
71
- - Woman gives birth despite having no womb
72
- - source_sentence: The Prime Minister of Israel is infected with the Corona virus,
73
- pray for him, may God have mercy on us from him
74
  sentences:
75
- - Photo of Netanyahu in hospital with coronavirus
76
- - Documents about Covid-19 written by Dr. Thanin Kongsuk?
77
- - Police warn burglars are going "door to door" with contaminated face masks in
78
- a new scam
79
- - source_sentence: Jose Mujica Rafael Correa The United Nations appoints them like
80
- the best presidents in the world AND THEY ARE LEFT
 
 
 
81
  sentences:
82
- - José Mujica and Rafael Correa were chosen by the United Nations as the best presidents
83
- in the world
84
- - Vaccination made the number of deaths from Covid-19 this year surpass that of
85
- 2020
86
- - China defeated the new coronavirus
87
  pipeline_tag: sentence-similarity
88
  library_name: sentence-transformers
89
  ---
90
 
91
- # SentenceTransformer based on Lajavaness/bilingual-embedding-large
92
 
93
- This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Lajavaness/bilingual-embedding-large](https://huggingface.co/Lajavaness/bilingual-embedding-large). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
94
 
95
  ## Model Details
96
 
97
  ### Model Description
98
  - **Model Type:** Sentence Transformer
99
- - **Base model:** [Lajavaness/bilingual-embedding-large](https://huggingface.co/Lajavaness/bilingual-embedding-large) <!-- at revision e83179d7a66e8aed1b3015e98bb5ae234ed89598 -->
100
  - **Maximum Sequence Length:** 512 tokens
101
  - **Output Dimensionality:** 1024 dimensions
102
  - **Similarity Function:** Cosine Similarity
@@ -138,9 +111,9 @@ from sentence_transformers import SentenceTransformer
138
  model = SentenceTransformer("sentence_transformers_model_id")
139
  # Run inference
140
  sentences = [
141
- 'Jose Mujica Rafael Correa The United Nations appoints them like the best presidents in the world AND THEY ARE LEFT',
142
- 'José Mujica and Rafael Correa were chosen by the United Nations as the best presidents in the world',
143
- 'Vaccination made the number of deaths from Covid-19 this year surpass that of 2020',
144
  ]
145
  embeddings = model.encode(sentences)
146
  print(embeddings.shape)
@@ -200,13 +173,13 @@ You can finetune this model on your own dataset.
200
  | | sentence_0 | sentence_1 | label |
201
  |:--------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------|
202
  | type | string | string | float |
203
- | details | <ul><li>min: 2 tokens</li><li>mean: 110.17 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 19.45 tokens</li><li>max: 190 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 1.0</li><li>max: 1.0</li></ul> |
204
  * Samples:
205
- | sentence_0 | sentence_1 | label |
206
- |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------|:-----------------|
207
- | <code>best music k.m KOSE CELLIE HINS GUINOT SKIN CARE KWhat people fear most is not being physically disabled, but giving up on themselves. There are still many beautiful things in life to aspire to! This stunning performance, known as the American spirit, brought tears to the eyes of 10,000 spectators. Male dancer Babo has been blind since childhood due to a fire in his home. In order to protect him, his mother held him tightly in her arms and jumped from the 7th floor. The mother died as a result, and the little baby became blind due to bleeding from the fundus. His mother was an ice skater before he died, and Babo also had a soft spot for ice skating. Although he couldn't see anything, he still pursued dance enthusiastically. He danced the famous tango "La Cumparsita" with his partner at the World Figure Skating Championships in Helsinki! 1. His ears are like bats that can measure the sound and distance around him. 2. The female dancer is very amazing. She danced with him and led him for...</code> | <code>Performance by a blind American ice dancer</code> | <code>1.0</code> |
208
- | <code>Photo from 2016. "Good" times when health was "fine" and the press did not report anything about. Bunch of Hypocrites...Let's go fight my people... . left right not army above all</code> | <code>Photo of a hospital in 2016. Good times when health was "good" and the press didn't report anything about it</code> | <code>1.0</code> |
209
- | <code>Haifa Oh Tel Aviv-Yafo Oh N WEST BANK Jerusalem is GAZA STRIPE Be'er Sheva Israel 65 65 35 35 15 M5 10 40Google and Apple maps have officially removed Palestine from the World Maps. Today Palestine was erased from the maps tomorrow Palestine will be erased from the world. PUT PALESTINE BACK ON THE MAP. Please unite now Pakistanio. Enemy is very strong if we are divided. Think just about Pakistan. Support each other, support Pakistan and support your leadership.</code> | <code>Google and Apple removed Palestine from its maps</code> | <code>1.0</code> |
210
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
211
  ```json
212
  {
@@ -348,56 +321,31 @@ You can finetune this model on your own dataset.
348
  ### Training Logs
349
  | Epoch | Step | Training Loss |
350
  |:------:|:-----:|:-------------:|
351
- | 0.0388 | 500 | 0.0334 |
352
- | 0.0777 | 1000 | 0.0595 |
353
- | 0.1165 | 1500 | 0.0597 |
354
- | 0.1554 | 2000 | 0.046 |
355
- | 0.1942 | 2500 | 0.0238 |
356
- | 0.2331 | 3000 | 0.0667 |
357
- | 0.2719 | 3500 | 0.0283 |
358
- | 0.3108 | 4000 | 0.0429 |
359
- | 0.3496 | 4500 | 0.0414 |
360
- | 0.3884 | 5000 | 0.0295 |
361
- | 0.4273 | 5500 | 0.0323 |
362
- | 0.4661 | 6000 | 0.0288 |
363
- | 0.5050 | 6500 | 0.0389 |
364
- | 0.5438 | 7000 | 0.0399 |
365
- | 0.5827 | 7500 | 0.0245 |
366
- | 0.6215 | 8000 | 0.0334 |
367
- | 0.6603 | 8500 | 0.0212 |
368
- | 0.6992 | 9000 | 0.0207 |
369
- | 0.7380 | 9500 | 0.0206 |
370
- | 0.7769 | 10000 | 0.0163 |
371
- | 0.8157 | 10500 | 0.0318 |
372
- | 0.8546 | 11000 | 0.0256 |
373
- | 0.8934 | 11500 | 0.0277 |
374
- | 0.9323 | 12000 | 0.027 |
375
- | 0.9711 | 12500 | 0.0179 |
376
- | 0.0388 | 500 | 0.0222 |
377
- | 0.0777 | 1000 | 0.0254 |
378
- | 0.1165 | 1500 | 0.0206 |
379
- | 0.1554 | 2000 | 0.0109 |
380
- | 0.1942 | 2500 | 0.0236 |
381
- | 0.2331 | 3000 | 0.0167 |
382
- | 0.2719 | 3500 | 0.022 |
383
- | 0.3108 | 4000 | 0.0148 |
384
- | 0.3496 | 4500 | 0.0342 |
385
- | 0.3884 | 5000 | 0.0084 |
386
- | 0.4273 | 5500 | 0.0237 |
387
- | 0.4661 | 6000 | 0.0156 |
388
- | 0.5050 | 6500 | 0.022 |
389
- | 0.5438 | 7000 | 0.0257 |
390
- | 0.5827 | 7500 | 0.031 |
391
- | 0.6215 | 8000 | 0.0123 |
392
- | 0.6603 | 8500 | 0.0174 |
393
- | 0.6992 | 9000 | 0.0107 |
394
- | 0.7380 | 9500 | 0.0166 |
395
- | 0.7769 | 10000 | 0.0185 |
396
- | 0.8157 | 10500 | 0.0087 |
397
- | 0.8546 | 11000 | 0.0128 |
398
- | 0.8934 | 11500 | 0.007 |
399
- | 0.9323 | 12000 | 0.0048 |
400
- | 0.9711 | 12500 | 0.0267 |
401
 
402
 
403
  ### Framework Versions
 
6
  - generated_from_trainer
7
  - dataset_size:25743
8
  - loss:MultipleNegativesRankingLoss
9
+ base_model: am-azadi/bilingual-embedding-large_Fine_Tuned
10
  widget:
11
+ - source_sentence: Anabel Hernandez This morning I received this in an envelope to
12
+ my name that said "We're coming for you." Both me and my family we feel safe.
13
+ This happens when the government Instead of defending the press is the first that
14
+ we attack. 00| After exhibiting negotiations between the Morenoite government
15
+ and organized crime, drug trafficking expert Anabel Hernández denounces being
16
+ harassed and receiving death threats. We demand to activate the protection protocol
17
+ for journalists
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  sentences:
19
+ - Covid-19 vaccines contain tracking devices
20
+ - The Mexican journalist Anabel Hernández denounced on Twitter a bullet threat received
21
+ in an envelope
22
+ - Video shows Indian PM Modi giving a speech in January 2022 in Goa
23
+ - source_sentence: Richard German 19 min. According to a publication of portal Plain
24
+ Words, the mother of the Chapo Guzman, Maria Consuelo Loera Pérez contributed
25
+ to Morena a total of 900 millions. brunette pr FOUNDER BONG $900,000,000 fometre.
26
+ Marma of mine ... September 12 48149 To Yo K R 89 art Rib Parts Hodon yesWITHOUT
27
+ WORDS
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
  sentences:
29
+ - Members of the Vox political party in Spain hold a banner against a sex toy
30
+ - Image shows Islamic missionaries killed in Pakistan
31
+ - The mother of "Chapo" Guzmán donated 900 million pesos to Morena
32
+ - source_sentence: EXA playground for children and children Stay from 7am to 8pm permitted.
33
+ head of state Hanover subject area Un and city green Languatastro17 2016 ver Telepo,
34
+ 160-439 11It really is time to stop giving these idiots any more leeway!
 
35
  sentences:
36
+ - The city of Hanover changed the word "children" to "children" on a playground
37
+ sign
38
+ - Russia Did Target Africa With Missiles For Talking About Its War With Ukraine
39
+ - Periodic table assembled to say “F**k Boris” in front of the UK PM while he attends
40
+ a press conference
41
+ - source_sentence: on the encounter Why is Faith sad?When Telangana Police killed
42
+ the victims of Hyderabad rape and murder case in an encounter, Kavi Kumar Vishwas
43
+ gave a big statement! , , ,
44
  sentences:
45
+ - Photo of suspects killed by police in Hyderabad rape-murder case
46
+ - A video in which the water washes away several buildings, supposedly in Turkey,
47
+ has been shared more than 220,000 times on Facebook. In reality, it is an altered
48
+ version of a video recorded during the tsunami that hit Japan in 2011. In 9 minutes
49
+ an entire neighborhood in Turkey disappears
50
+ - “vaccinated individuals carry 251 times the load of COVID-19 viruses in their
51
+ nostrils compared to the unvaccinated”
52
+ - source_sentence: Well it is Australian Dr's point of view that Queen Elizabeth may
53
+ benefit from Ivermectin, that's right common old horse paste.
54
  sentences:
55
+ - Australian doctor told news programme that Queen Elizabeth II should use ivermectin
56
+ to treat Covid-19
57
+ - These 12 rights are no longer in the proposed Chilean Constitution
58
+ - “If you want to know who controls you, just look at whom you cannot criticize,”
59
+ Voltaire said.
60
  pipeline_tag: sentence-similarity
61
  library_name: sentence-transformers
62
  ---
63
 
64
+ # SentenceTransformer based on am-azadi/bilingual-embedding-large_Fine_Tuned
65
 
66
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [am-azadi/bilingual-embedding-large_Fine_Tuned](https://huggingface.co/am-azadi/bilingual-embedding-large_Fine_Tuned). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
67
 
68
  ## Model Details
69
 
70
  ### Model Description
71
  - **Model Type:** Sentence Transformer
72
+ - **Base model:** [am-azadi/bilingual-embedding-large_Fine_Tuned](https://huggingface.co/am-azadi/bilingual-embedding-large_Fine_Tuned) <!-- at revision 0b9b4a8c5e6212588461d6df7c9428f571b1c823 -->
73
  - **Maximum Sequence Length:** 512 tokens
74
  - **Output Dimensionality:** 1024 dimensions
75
  - **Similarity Function:** Cosine Similarity
 
111
  model = SentenceTransformer("sentence_transformers_model_id")
112
  # Run inference
113
  sentences = [
114
+ "Well it is Australian Dr's point of view that Queen Elizabeth may benefit from Ivermectin, that's right common old horse paste.",
115
+ 'Australian doctor told news programme that Queen Elizabeth II should use ivermectin to treat Covid-19',
116
+ 'These 12 rights are no longer in the proposed Chilean Constitution',
117
  ]
118
  embeddings = model.encode(sentences)
119
  print(embeddings.shape)
 
173
  | | sentence_0 | sentence_1 | label |
174
  |:--------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------|
175
  | type | string | string | float |
176
+ | details | <ul><li>min: 2 tokens</li><li>mean: 116.09 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 19.96 tokens</li><li>max: 130 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 1.0</li><li>max: 1.0</li></ul> |
177
  * Samples:
178
+ | sentence_0 | sentence_1 | label |
179
+ |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
180
+ | <code>10 66% 8:06 PM M<R RIVALS Extinction Rebellion Member charged with starting Green Wattle Creek bushfi n Hussey AAP/7NEWS Sunday, 5 January 2020 12:44 pm FILE IMAGE: A 26-year-old Man from Sydney has been charged with intentionally lighting four fire Credit: KELLY BARNES/AAP A 26-year-old Man from Sydney has been charged with intentionally lighting four grass and scrub fires in the state's southeast in recent days Police will allege the man lit fires on December 15 and January 1, and th two on Saturday, all around the Anderson Fire Trail, Wentworth Falls In the video above: Bushfires continue to rage across Australia It's all about the dessert.This taken from another page.. I don't know if it's true or not but this is what it said.. "This photo was taken of a story CH7 did and posted to their FB page. Now, that post has been removed and no trace of it can be found on google. Share it far and wide "</code> | <code>Extinction Rebellion member charged with bushfire arson</code> | <code>1.0</code> |
181
+ | <code>Even with the coronavirus, no one wants to eat Vegan</code> | <code>A photo shows that, even with the coronavirus, no one wants to eat vegan</code> | <code>1.0</code> |
182
+ | <code>CONFUSION IN BENI: For the Population of this Corner, the Mask has just Fallen from the MONUSCO-ADF Marriage. Attached, the Adf surprised in the Monusco facilities with FARDC and PNC outfits. Let's figure it out.</code> | <code>Members of the armed group of Ugandan origin of the Allied Democratic Forces (ADF), surprised in the UN installations with the uniforms of the Congolese army</code> | <code>1.0</code> |
183
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
184
  ```json
185
  {
 
321
  ### Training Logs
322
  | Epoch | Step | Training Loss |
323
  |:------:|:-----:|:-------------:|
324
+ | 0.0388 | 500 | 0.0125 |
325
+ | 0.0777 | 1000 | 0.0072 |
326
+ | 0.1165 | 1500 | 0.0209 |
327
+ | 0.1554 | 2000 | 0.0112 |
328
+ | 0.1942 | 2500 | 0.0268 |
329
+ | 0.2331 | 3000 | 0.017 |
330
+ | 0.2719 | 3500 | 0.0211 |
331
+ | 0.3108 | 4000 | 0.039 |
332
+ | 0.3496 | 4500 | 0.013 |
333
+ | 0.3884 | 5000 | 0.0225 |
334
+ | 0.4273 | 5500 | 0.0182 |
335
+ | 0.4661 | 6000 | 0.0208 |
336
+ | 0.5050 | 6500 | 0.0071 |
337
+ | 0.5438 | 7000 | 0.0071 |
338
+ | 0.5827 | 7500 | 0.0132 |
339
+ | 0.6215 | 8000 | 0.0101 |
340
+ | 0.6603 | 8500 | 0.015 |
341
+ | 0.6992 | 9000 | 0.0062 |
342
+ | 0.7380 | 9500 | 0.0037 |
343
+ | 0.7769 | 10000 | 0.0061 |
344
+ | 0.8157 | 10500 | 0.0056 |
345
+ | 0.8546 | 11000 | 0.0084 |
346
+ | 0.8934 | 11500 | 0.0208 |
347
+ | 0.9323 | 12000 | 0.0052 |
348
+ | 0.9711 | 12500 | 0.0081 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
349
 
350
 
351
  ### Framework Versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "Lajavaness/bilingual-embedding-large",
3
  "architectures": [
4
  "BilingualModel"
5
  ],
 
1
  {
2
+ "_name_or_path": "am-azadi/bilingual-embedding-large_Fine_Tuned",
3
  "architectures": [
4
  "BilingualModel"
5
  ],
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:da3123bc09bad768fd067b3a67bd2752cfae598ee4fd8ac0ae8468cb1ecfe5ab
3
  size 2239607176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5027ccc9b2e2301ff74039dbd9841c268da1b7877d167ab214a86fa5392173f
3
  size 2239607176