Upload folder using huggingface_hub
Browse files- README.md +82 -134
- config.json +1 -1
- model.safetensors +1 -1
README.md
CHANGED
@@ -6,97 +6,70 @@ tags:
|
|
6 |
- generated_from_trainer
|
7 |
- dataset_size:25743
|
8 |
- loss:MultipleNegativesRankingLoss
|
9 |
-
base_model:
|
10 |
widget:
|
11 |
-
- source_sentence:
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
to an end. Suddenly she was back on top. People With Money reports on Monday (September
|
19 |
-
23) that Thunberg is the highest-paid activist in the world, with an astonishing
|
20 |
-
$46 million between August 2018 and August 2019, a nearly $20 million lead over
|
21 |
-
her closest competition. . Factors of people with money In compiling this annual
|
22 |
-
list, the magazine considers factors such as down payment, profit sharing, residuals,
|
23 |
-
endorsements and advertising work. The Swedish activist has an estimated net worth
|
24 |
-
of $145 million. She owes her fortune to savvy stock investments, substantial
|
25 |
-
property holdings, lucrative endorsement deals with CoverGirl cosmetics. He also
|
26 |
-
owns several restaurants (the "Fat Thunberg Burger" chain) in Stockholm, a football
|
27 |
-
team (the "Stockholm Angels"), has launched his own brand of vodka (Pure Wonderthunberg
|
28 |
-
- Sweden), and is tackling the youth market with best-selling perfume (With Love
|
29 |
-
from Greta) and a fashion line called "Greta Thunberg Seduction." The ranking
|
30 |
-
is significant for many Greta fans, who have been waiting for her triumphant return
|
31 |
-
to the glory days for what seems like a lifetime.'
|
32 |
sentences:
|
33 |
-
-
|
34 |
-
-
|
35 |
-
|
36 |
-
-
|
37 |
-
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
-
|
42 |
-
of caution and using newer sheets of drywall only. "We realize that many British
|
43 |
-
Columbians have been waiting for a long time for the opportunity to get intimate
|
44 |
-
and a few of those who will choose to use glory holes need to be aware that older
|
45 |
-
gypsum sheet that may be sitting in the basement or backyard will often contain
|
46 |
-
toxic levels of asbestos." Henry told reporters. "The safest method yet to ensure
|
47 |
-
zero risk of exposure to Covid 19 is getting off alone, using your own imagination,
|
48 |
-
toys and electronic devices." The health officer added. "I know this is a topic
|
49 |
-
that makes some people uncomfortable but I need to add that I've come across a
|
50 |
-
few hospital reports of young men getting injured getting their genitals stuck
|
51 |
-
in a small hole they made in a wall with a kitchen knife." Dr, Henry said with
|
52 |
-
a softer tone. "Just to be clear, glory holes are small openings in a wall where
|
53 |
-
a person will offer their intimate parts to a partner who is on the opposite side
|
54 |
-
who will be pleasuring the exposed parts, ideally, until full satisfaction. This
|
55 |
-
is all done in the spirit of doing your share to flatten the curve while having
|
56 |
-
a bit of fun this summer."
|
57 |
sentences:
|
58 |
-
-
|
59 |
-
-
|
60 |
-
-
|
61 |
-
- source_sentence:
|
62 |
-
|
63 |
-
|
64 |
-
Qatar Foundati
|
65 |
sentences:
|
66 |
-
-
|
67 |
-
|
68 |
-
|
69 |
-
-
|
70 |
-
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
sentences:
|
75 |
-
- Photo of
|
76 |
-
-
|
77 |
-
|
78 |
-
a
|
79 |
-
|
80 |
-
|
|
|
|
|
|
|
81 |
sentences:
|
82 |
-
-
|
83 |
-
|
84 |
-
-
|
85 |
-
|
86 |
-
|
87 |
pipeline_tag: sentence-similarity
|
88 |
library_name: sentence-transformers
|
89 |
---
|
90 |
|
91 |
-
# SentenceTransformer based on
|
92 |
|
93 |
-
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [
|
94 |
|
95 |
## Model Details
|
96 |
|
97 |
### Model Description
|
98 |
- **Model Type:** Sentence Transformer
|
99 |
-
- **Base model:** [
|
100 |
- **Maximum Sequence Length:** 512 tokens
|
101 |
- **Output Dimensionality:** 1024 dimensions
|
102 |
- **Similarity Function:** Cosine Similarity
|
@@ -138,9 +111,9 @@ from sentence_transformers import SentenceTransformer
|
|
138 |
model = SentenceTransformer("sentence_transformers_model_id")
|
139 |
# Run inference
|
140 |
sentences = [
|
141 |
-
|
142 |
-
'
|
143 |
-
'
|
144 |
]
|
145 |
embeddings = model.encode(sentences)
|
146 |
print(embeddings.shape)
|
@@ -200,13 +173,13 @@ You can finetune this model on your own dataset.
|
|
200 |
| | sentence_0 | sentence_1 | label |
|
201 |
|:--------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------|
|
202 |
| type | string | string | float |
|
203 |
-
| details | <ul><li>min: 2 tokens</li><li>mean:
|
204 |
* Samples:
|
205 |
-
| sentence_0
|
206 |
-
|
207 |
-
| <code>
|
208 |
-
| <code>
|
209 |
-
| <code>
|
210 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
211 |
```json
|
212 |
{
|
@@ -348,56 +321,31 @@ You can finetune this model on your own dataset.
|
|
348 |
### Training Logs
|
349 |
| Epoch | Step | Training Loss |
|
350 |
|:------:|:-----:|:-------------:|
|
351 |
-
| 0.0388 | 500 | 0.
|
352 |
-
| 0.0777 | 1000 | 0.
|
353 |
-
| 0.1165 | 1500 | 0.
|
354 |
-
| 0.1554 | 2000 | 0.
|
355 |
-
| 0.1942 | 2500 | 0.
|
356 |
-
| 0.2331 | 3000 | 0.
|
357 |
-
| 0.2719 | 3500 | 0.
|
358 |
-
| 0.3108 | 4000 | 0.
|
359 |
-
| 0.3496 | 4500 | 0.
|
360 |
-
| 0.3884 | 5000 | 0.
|
361 |
-
| 0.4273 | 5500 | 0.
|
362 |
-
| 0.4661 | 6000 | 0.
|
363 |
-
| 0.5050 | 6500 | 0.
|
364 |
-
| 0.5438 | 7000 | 0.
|
365 |
-
| 0.5827 | 7500 | 0.
|
366 |
-
| 0.6215 | 8000 | 0.
|
367 |
-
| 0.6603 | 8500 | 0.
|
368 |
-
| 0.6992 | 9000 | 0.
|
369 |
-
| 0.7380 | 9500 | 0.
|
370 |
-
| 0.7769 | 10000 | 0.
|
371 |
-
| 0.8157 | 10500 | 0.
|
372 |
-
| 0.8546 | 11000 | 0.
|
373 |
-
| 0.8934 | 11500 | 0.
|
374 |
-
| 0.9323 | 12000 | 0.
|
375 |
-
| 0.9711 | 12500 | 0.
|
376 |
-
| 0.0388 | 500 | 0.0222 |
|
377 |
-
| 0.0777 | 1000 | 0.0254 |
|
378 |
-
| 0.1165 | 1500 | 0.0206 |
|
379 |
-
| 0.1554 | 2000 | 0.0109 |
|
380 |
-
| 0.1942 | 2500 | 0.0236 |
|
381 |
-
| 0.2331 | 3000 | 0.0167 |
|
382 |
-
| 0.2719 | 3500 | 0.022 |
|
383 |
-
| 0.3108 | 4000 | 0.0148 |
|
384 |
-
| 0.3496 | 4500 | 0.0342 |
|
385 |
-
| 0.3884 | 5000 | 0.0084 |
|
386 |
-
| 0.4273 | 5500 | 0.0237 |
|
387 |
-
| 0.4661 | 6000 | 0.0156 |
|
388 |
-
| 0.5050 | 6500 | 0.022 |
|
389 |
-
| 0.5438 | 7000 | 0.0257 |
|
390 |
-
| 0.5827 | 7500 | 0.031 |
|
391 |
-
| 0.6215 | 8000 | 0.0123 |
|
392 |
-
| 0.6603 | 8500 | 0.0174 |
|
393 |
-
| 0.6992 | 9000 | 0.0107 |
|
394 |
-
| 0.7380 | 9500 | 0.0166 |
|
395 |
-
| 0.7769 | 10000 | 0.0185 |
|
396 |
-
| 0.8157 | 10500 | 0.0087 |
|
397 |
-
| 0.8546 | 11000 | 0.0128 |
|
398 |
-
| 0.8934 | 11500 | 0.007 |
|
399 |
-
| 0.9323 | 12000 | 0.0048 |
|
400 |
-
| 0.9711 | 12500 | 0.0267 |
|
401 |
|
402 |
|
403 |
### Framework Versions
|
|
|
6 |
- generated_from_trainer
|
7 |
- dataset_size:25743
|
8 |
- loss:MultipleNegativesRankingLoss
|
9 |
+
base_model: am-azadi/bilingual-embedding-large_Fine_Tuned
|
10 |
widget:
|
11 |
+
- source_sentence: Anabel Hernandez This morning I received this in an envelope to
|
12 |
+
my name that said "We're coming for you." Both me and my family we feel safe.
|
13 |
+
This happens when the government Instead of defending the press is the first that
|
14 |
+
we attack. 00| After exhibiting negotiations between the Morenoite government
|
15 |
+
and organized crime, drug trafficking expert Anabel Hernández denounces being
|
16 |
+
harassed and receiving death threats. We demand to activate the protection protocol
|
17 |
+
for journalists
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
sentences:
|
19 |
+
- Covid-19 vaccines contain tracking devices
|
20 |
+
- The Mexican journalist Anabel Hernández denounced on Twitter a bullet threat received
|
21 |
+
in an envelope
|
22 |
+
- Video shows Indian PM Modi giving a speech in January 2022 in Goa
|
23 |
+
- source_sentence: Richard German 19 min. According to a publication of portal Plain
|
24 |
+
Words, the mother of the Chapo Guzman, Maria Consuelo Loera Pérez contributed
|
25 |
+
to Morena a total of 900 millions. brunette pr FOUNDER BONG $900,000,000 fometre.
|
26 |
+
Marma of mine ... September 12 48149 To Yo K R 89 art Rib Parts Hodon yesWITHOUT
|
27 |
+
WORDS
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
sentences:
|
29 |
+
- Members of the Vox political party in Spain hold a banner against a sex toy
|
30 |
+
- Image shows Islamic missionaries killed in Pakistan
|
31 |
+
- The mother of "Chapo" Guzmán donated 900 million pesos to Morena
|
32 |
+
- source_sentence: EXA playground for children and children Stay from 7am to 8pm permitted.
|
33 |
+
head of state Hanover subject area Un and city green Languatastro17 2016 ver Telepo,
|
34 |
+
160-439 11It really is time to stop giving these idiots any more leeway!
|
|
|
35 |
sentences:
|
36 |
+
- The city of Hanover changed the word "children" to "children" on a playground
|
37 |
+
sign
|
38 |
+
- Russia Did Target Africa With Missiles For Talking About Its War With Ukraine
|
39 |
+
- Periodic table assembled to say “F**k Boris” in front of the UK PM while he attends
|
40 |
+
a press conference
|
41 |
+
- source_sentence: on the encounter Why is Faith sad?When Telangana Police killed
|
42 |
+
the victims of Hyderabad rape and murder case in an encounter, Kavi Kumar Vishwas
|
43 |
+
gave a big statement! , , ,
|
44 |
sentences:
|
45 |
+
- Photo of suspects killed by police in Hyderabad rape-murder case
|
46 |
+
- A video in which the water washes away several buildings, supposedly in Turkey,
|
47 |
+
has been shared more than 220,000 times on Facebook. In reality, it is an altered
|
48 |
+
version of a video recorded during the tsunami that hit Japan in 2011. In 9 minutes
|
49 |
+
an entire neighborhood in Turkey disappears
|
50 |
+
- “vaccinated individuals carry 251 times the load of COVID-19 viruses in their
|
51 |
+
nostrils compared to the unvaccinated”
|
52 |
+
- source_sentence: Well it is Australian Dr's point of view that Queen Elizabeth may
|
53 |
+
benefit from Ivermectin, that's right common old horse paste.
|
54 |
sentences:
|
55 |
+
- Australian doctor told news programme that Queen Elizabeth II should use ivermectin
|
56 |
+
to treat Covid-19
|
57 |
+
- These 12 rights are no longer in the proposed Chilean Constitution
|
58 |
+
- “If you want to know who controls you, just look at whom you cannot criticize,”
|
59 |
+
Voltaire said.
|
60 |
pipeline_tag: sentence-similarity
|
61 |
library_name: sentence-transformers
|
62 |
---
|
63 |
|
64 |
+
# SentenceTransformer based on am-azadi/bilingual-embedding-large_Fine_Tuned
|
65 |
|
66 |
+
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [am-azadi/bilingual-embedding-large_Fine_Tuned](https://huggingface.co/am-azadi/bilingual-embedding-large_Fine_Tuned). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
|
67 |
|
68 |
## Model Details
|
69 |
|
70 |
### Model Description
|
71 |
- **Model Type:** Sentence Transformer
|
72 |
+
- **Base model:** [am-azadi/bilingual-embedding-large_Fine_Tuned](https://huggingface.co/am-azadi/bilingual-embedding-large_Fine_Tuned) <!-- at revision 0b9b4a8c5e6212588461d6df7c9428f571b1c823 -->
|
73 |
- **Maximum Sequence Length:** 512 tokens
|
74 |
- **Output Dimensionality:** 1024 dimensions
|
75 |
- **Similarity Function:** Cosine Similarity
|
|
|
111 |
model = SentenceTransformer("sentence_transformers_model_id")
|
112 |
# Run inference
|
113 |
sentences = [
|
114 |
+
"Well it is Australian Dr's point of view that Queen Elizabeth may benefit from Ivermectin, that's right common old horse paste.",
|
115 |
+
'Australian doctor told news programme that Queen Elizabeth II should use ivermectin to treat Covid-19',
|
116 |
+
'These 12 rights are no longer in the proposed Chilean Constitution',
|
117 |
]
|
118 |
embeddings = model.encode(sentences)
|
119 |
print(embeddings.shape)
|
|
|
173 |
| | sentence_0 | sentence_1 | label |
|
174 |
|:--------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------|
|
175 |
| type | string | string | float |
|
176 |
+
| details | <ul><li>min: 2 tokens</li><li>mean: 116.09 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 19.96 tokens</li><li>max: 130 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 1.0</li><li>max: 1.0</li></ul> |
|
177 |
* Samples:
|
178 |
+
| sentence_0 | sentence_1 | label |
|
179 |
+
|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
|
180 |
+
| <code>10 66% 8:06 PM M<R RIVALS Extinction Rebellion Member charged with starting Green Wattle Creek bushfi n Hussey AAP/7NEWS Sunday, 5 January 2020 12:44 pm FILE IMAGE: A 26-year-old Man from Sydney has been charged with intentionally lighting four fire Credit: KELLY BARNES/AAP A 26-year-old Man from Sydney has been charged with intentionally lighting four grass and scrub fires in the state's southeast in recent days Police will allege the man lit fires on December 15 and January 1, and th two on Saturday, all around the Anderson Fire Trail, Wentworth Falls In the video above: Bushfires continue to rage across Australia It's all about the dessert.This taken from another page.. I don't know if it's true or not but this is what it said.. "This photo was taken of a story CH7 did and posted to their FB page. Now, that post has been removed and no trace of it can be found on google. Share it far and wide "</code> | <code>Extinction Rebellion member charged with bushfire arson</code> | <code>1.0</code> |
|
181 |
+
| <code>Even with the coronavirus, no one wants to eat Vegan</code> | <code>A photo shows that, even with the coronavirus, no one wants to eat vegan</code> | <code>1.0</code> |
|
182 |
+
| <code>CONFUSION IN BENI: For the Population of this Corner, the Mask has just Fallen from the MONUSCO-ADF Marriage. Attached, the Adf surprised in the Monusco facilities with FARDC and PNC outfits. Let's figure it out.</code> | <code>Members of the armed group of Ugandan origin of the Allied Democratic Forces (ADF), surprised in the UN installations with the uniforms of the Congolese army</code> | <code>1.0</code> |
|
183 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
184 |
```json
|
185 |
{
|
|
|
321 |
### Training Logs
|
322 |
| Epoch | Step | Training Loss |
|
323 |
|:------:|:-----:|:-------------:|
|
324 |
+
| 0.0388 | 500 | 0.0125 |
|
325 |
+
| 0.0777 | 1000 | 0.0072 |
|
326 |
+
| 0.1165 | 1500 | 0.0209 |
|
327 |
+
| 0.1554 | 2000 | 0.0112 |
|
328 |
+
| 0.1942 | 2500 | 0.0268 |
|
329 |
+
| 0.2331 | 3000 | 0.017 |
|
330 |
+
| 0.2719 | 3500 | 0.0211 |
|
331 |
+
| 0.3108 | 4000 | 0.039 |
|
332 |
+
| 0.3496 | 4500 | 0.013 |
|
333 |
+
| 0.3884 | 5000 | 0.0225 |
|
334 |
+
| 0.4273 | 5500 | 0.0182 |
|
335 |
+
| 0.4661 | 6000 | 0.0208 |
|
336 |
+
| 0.5050 | 6500 | 0.0071 |
|
337 |
+
| 0.5438 | 7000 | 0.0071 |
|
338 |
+
| 0.5827 | 7500 | 0.0132 |
|
339 |
+
| 0.6215 | 8000 | 0.0101 |
|
340 |
+
| 0.6603 | 8500 | 0.015 |
|
341 |
+
| 0.6992 | 9000 | 0.0062 |
|
342 |
+
| 0.7380 | 9500 | 0.0037 |
|
343 |
+
| 0.7769 | 10000 | 0.0061 |
|
344 |
+
| 0.8157 | 10500 | 0.0056 |
|
345 |
+
| 0.8546 | 11000 | 0.0084 |
|
346 |
+
| 0.8934 | 11500 | 0.0208 |
|
347 |
+
| 0.9323 | 12000 | 0.0052 |
|
348 |
+
| 0.9711 | 12500 | 0.0081 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
349 |
|
350 |
|
351 |
### Framework Versions
|
config.json
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
{
|
2 |
-
"_name_or_path": "
|
3 |
"architectures": [
|
4 |
"BilingualModel"
|
5 |
],
|
|
|
1 |
{
|
2 |
+
"_name_or_path": "am-azadi/bilingual-embedding-large_Fine_Tuned",
|
3 |
"architectures": [
|
4 |
"BilingualModel"
|
5 |
],
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 2239607176
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d5027ccc9b2e2301ff74039dbd9841c268da1b7877d167ab214a86fa5392173f
|
3 |
size 2239607176
|