Rubyando59 commited on
Commit
3fcf8f5
1 Parent(s): 938b19c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -944
README.md CHANGED
@@ -1,687 +1,35 @@
1
- ---
2
- base_model: BAAI/bge-base-en-v1.5
3
- datasets: []
4
- language: []
5
- library_name: sentence-transformers
6
- metrics:
7
- - cosine_accuracy@1
8
- - cosine_accuracy@3
9
- - cosine_accuracy@5
10
- - cosine_accuracy@10
11
- - cosine_precision@1
12
- - cosine_precision@3
13
- - cosine_precision@5
14
- - cosine_precision@10
15
- - cosine_recall@1
16
- - cosine_recall@3
17
- - cosine_recall@5
18
- - cosine_recall@10
19
- - cosine_ndcg@10
20
- - cosine_mrr@10
21
- - cosine_map@100
22
- - dot_accuracy@1
23
- - dot_accuracy@3
24
- - dot_accuracy@5
25
- - dot_accuracy@10
26
- - dot_precision@1
27
- - dot_precision@3
28
- - dot_precision@5
29
- - dot_precision@10
30
- - dot_recall@1
31
- - dot_recall@3
32
- - dot_recall@5
33
- - dot_recall@10
34
- - dot_ndcg@10
35
- - dot_mrr@10
36
- - dot_map@100
37
- pipeline_tag: sentence-similarity
38
- tags:
39
- - sentence-transformers
40
- - sentence-similarity
41
- - feature-extraction
42
- - generated_from_trainer
43
- - dataset_size:98400
44
- - loss:MultipleNegativesRankingLoss
45
- widget:
46
- - source_sentence: How did the actions of central banks influence investor sentiment
47
- and market expectations regarding future interest rates?
48
- sentences:
49
- - We may be adversely affected by increased governmental and regulatory scrutiny
50
- or negative publicity. Governmental scrutiny from regulators, legislative bodies
51
- and law enforcement agencies with respect to matters relating to compensation,
52
- our business practices, our past actions and other matters remains at high levels.
53
- Political and public sentiment regarding financial institutions has in the past
54
- resulted and may in the future result in a significant amount of adverse press
55
- coverage, as well as adverse statements or charges by regulators or other government
56
- officials. Press coverage and other public statements that assert some form of
57
- wrongdoing (including, in some cases, press coverage and public statements that
58
- do not directly involve us) often result in some type of investigation by regulators,
59
- legislators and law enforcement officials or in lawsuits. Responding to these
60
- investigations and lawsuits, regardless of the ultimate outcome of the proceeding,
61
- is time-consuming and expensive and can divert the time and effort of our senior
62
- management from our business. Penalties and fines sought by regulatory authorities
63
- have increased substantially and certain regulators have been more likely in recent
64
- years to commence enforcement actions or to support legislation targeted at the
65
- financial services industry. Governmental authorities may also be more likely
66
- to pursue criminal or other actions, including seeking admissions of wrongdoing
67
- or guilty pleas, in connection with the resolution of an inquiry or investigation
68
- to the extent a company is viewed as having previously engaged in criminal, regulatory
69
- or other misconduct. Adverse publicity, governmental scrutiny and legal and enforcement
70
- proceedings can also have a negative impact on our reputation and on the morale
71
- and performance of our employees, which could adversely affect our businesses
72
- and results of operations. Further, we are subject to regulatory settlements,
73
- orders and feedback that require significant remediation activities and enhancements
74
- to existing controls, systems and procedures, which has required and will require
75
- us to commit significant resources, including hiring, as well as testing the operation
76
- and effectiveness of new controls, policies and procedures. The failure to complete
77
- these remediation activities in a timely manner could lead to higher operating
78
- expenses, reputational damage and other negative consequences.The financial services
79
- industry generally and our businesses in particular have been subject to negative
80
- publicity. Our reputation and businesses may be adversely affected by negative
81
- publicity or information regarding our businesses and personnel, whether or not
82
- accurate or true, that may be posted on social media or other internet forums
83
- or published by news organizations. Postings on these types of forums may also
84
- adversely impact risk positions of our clients and other parties that owe us money,
85
- securities or other assets and increase the chance that they will not perform
86
- their obligations to us or reduce the revenues we receive from their use of our
87
- services. The speed and pervasiveness with which information can be disseminated
88
- through these channels, in particular social media, may magnify risks relating
89
- to negative publicity. The rapid dissemination of negative information through
90
- social media, in part, is believed to have led to the collapse of Silicon Valley
91
- Bank (SVB). SVB suffered a level of deposit withdrawals within a time period not
92
- previously experienced by a financial institution. We could also be subject to
93
- rapid deposit withdrawals or other outflows as a result of negative social media
94
- posts or other negative publicity . Substantial civil or criminal liability or
95
- significant regulatory action against us could have material adverse financial
96
- effects or cause us significant reputational harm, which in turn could seriously
97
- harm our business prospects. We face significant legal risks in our businesses,
98
- and the volume of claims and amount of damages and penalties claimed in litigation
99
- and regulatory proceedings against financial institutions remain high. See Notes
100
- 18 and 27 to the consolidated financial statements in Part II, Item 8 of this
101
- Form 10-K for information about certain of our legal and regulatory proceedings
102
- and investigations. We have seen legal claims by consumers and clients increase
103
- in a market downturn and employment-related claims increase following periods
104
- in which we have reduced our headcount. Additionally, governmental entities have
105
- been plaintiffs and are parties in certain of our legal proceedings, and we may
106
- face future civil or criminal actions or claims by the same or other governmental
107
- entities, as well as follow-on civil litigation that is often commenced after
108
- regulatory settlements.THE GOLDMAN SACHS GROUP, INC. AND SUBSIDIARIES Goldman
109
- Sachs 2023 Form 10-K 49
110
- - 'Vanguard U.K. Gilt UCITS ETF Managed by Vanguard Global Advisers, LLC.713 Investment
111
- Objective Vanguard U.K. Gilt UCITS ETF seeks to track the performance of the Bloomberg
112
- Sterling Gilt Float Adjusted Index, a market-weighted index of UK government fixed
113
- income securities denominated in pound sterling. Performance Summary (unaudited)
114
- The Performance Summary does not form part of the Financial Statements. • Inflation
115
- and policymakers’ efforts to rein it in took centre stage for the financial markets
116
- during much of the 12 months ended 30 June 2023. • Early in the period, energy
117
- prices continued to cool amid an outlook for slower economic growth, but price
118
- increases then broadened to other categories, notably the services sector, which
119
- felt the effects of tight labour markets. Central banks including the US Federal
120
- Reserve, the European Central Bank and the Bank of England reacted to the prospect
121
- of inflation remaining stubbornly high by aggressively hiking interest rates even
122
- as their actions fanned fears of a global recession down the road. • Although
123
- progress was slow, signs of inflation moderating later in the period led several
124
- major central banks to slow the pace of their interest rate hikes or even hit
125
- the pause button. • Bonds suffered early in the fiscal year amid aggressive rate
126
- hiking and later when markets began to anticipate that rates would remain higher
127
- for longer. With rising yields pushing prices down, global bonds ended the 12
128
- months in negative territory. • In this environment, the ETF ’s benchmark index
129
- returned –16.59% for the period. • Returns were negative across all maturities.
130
- Gilts maturing in less than five years posted single-digit declines, while all
131
- others posted declines in the double digits. • Gilts underperformed the broad
132
- UK investment-grade bond market. Benchmark returns in the commentary above are
133
- in British pounds. Benchmark: Bloomberg Sterling Gilt Float Adjusted Index Total
134
- Returns Periods Ended 30 June 2023 (Annualised for periods over one year) One
135
- Year Five YearsTen Years or Since Inception1 EUR-Hedged Accumulating -17.43 %
136
- — -13.62 % Benchmark -17.47 — -13.64 Tracking Difference* 0.04 GBP Accumulating
137
- -16.61 % — -5.94 % Benchmark -16.59 — -5.89 Tracking Difference* -0.02 GBP Distributing
138
- -16.61 % -4.88 % 0.07 % Benchmark2 -16.59 -4.83 0.14 Tracking Difference* -0.02
139
- Each benchmark is in its respective currency. Sources: Vanguard Global Advisers,
140
- LLC, and Bloomberg. Returns are based on NAV with income reinvested. All of the
141
- returns in this report represent past performance, which is not a guarantee of
142
- future results that may be achieved by the fund. For performance data current
143
- to the most recent month-end, which may be higher or lower than that cited, visit
144
- our website at http://global.vanguard.com. Note, too, that both investment returns
145
- and principal value can fluctuate widely, so an investor ''s shares, when sold,
146
- could be worth more or less than their original cost. * The tracking difference
147
- between the fund return and the index return over a stated period of time can
148
- be attributed to a number of factors, including, without limitation, small differences
149
- in weightings, trading activity, transaction costs and differences in the valuation
150
- and withholding tax treatment between the fund and the index vendor. 1 Since-inception
151
- returns: EUR-Hedged Accumulating, 28 August 2020; GBP Accumulating, 19 February
152
- 2019. 2 Bloomberg Barclays Global Aggregate U.K. Government Float Adjusted Index
153
- through 31 March 2016, Bloomberg Sterling Gilt Float Adjusted Index thereafter.'
154
- - 818 Vanguard USD Corporate Bond UCITS ETF Principal US Dollars ($) CouponMaturity
155
- DateFair Value US Dollars ($)% of Total Net Assets Capital One Financial Corp.
156
- 235,000 3.27% 1/3/2030 200,081 0.01% Bank of New York Mellon Corp. 210,000 3.85%
157
- 28/4/2028 200,072 0.01% T-Mobile USA, Inc. 250,000 2.25% 15/11/2031 200,037 0.01%
158
- Astrazeneca Finance LLC 200,000 4.88% 3/3/2028 199,915 0.01% Aetna, Inc. 233,000
159
- 4.50% 15/5/2042 199,838 0.01% MassMutual Global Funding II 200,000 5.05% 7/12/2027
160
- 199,614 0.01% Dell International LLC/EMC Corp. 200,000 5.25% 1/2/2028 199,599
161
- 0.01% Newmont Corp. 243,000 2.25% 1/10/2030 199,544 0.01% Caterpillar Financial
162
- Services Corp. 200,000 4.80% 6/1/2026 199,539 0.01% Nucor Corp. 210,000 3.95%
163
- 1/5/2028 199,494 0.01% Pfizer, Inc. 211,000 2.75% 3/6/2026 199,437 0.01% HSBC
164
- USA, Inc. 200,000 5.63% 17/3/2025 199,421 0.01% Equinix, Inc. 223,000 1.45% 15/5/2026
165
- 199,361 0.01% UnitedHealth Group, Inc. 200,000 5.00% 15/10/2024 199,331 0.01%
166
- Altria Group, Inc. 255,000 4.25% 9/8/2042 199,273 0.01% JPMorgan Chase & Co. 210,000
167
- 4.85% 1/2/2044 199,236 0.01% Fifth Third Bancorp 200,000 6.36% 27/10/2028 199,177
168
- 0.01% Corebridge Financial, Inc. 209,000 3.50% 4/4/2025 199,170 0.01% Energy Transfer
169
- LP 200,000 5.50% 1/6/2027 199,121 0.01% Apple, Inc. 200,000 4.42% 8/5/2026 199,094
170
- 0.01% Walt Disney Co. 220,000 2.20% 13/1/2028 199,079 0.01% Georgia-Pacific LLC
171
- 225,000 0.95% 15/5/2026 199,066 0.01% Philip Morris International, Inc. 200,000
172
- 5.00% 17/11/2025 199,057 0.01% Foundry JV Holdco LLC 200,000 5.88% 25/1/2034 198,995
173
- 0.01% Apple, Inc. 200,000 4.30% 10/5/2033 198,990 0.01% John Deere Capital Corp.
174
- 200,000 4.70% 10/6/2030 198,715 0.01% Crown Castle, Inc. 205,000 3.20% 1/9/2024
175
- 198,654 0.01% Nestle Holdings, Inc.
176
- - source_sentence: Calculate the total value of bonds issued by companies with a coupon
177
- rate of 1.50% and discuss the potential attractiveness of these bonds to investors.
178
- sentences:
179
- - '774 Vanguard USD Corporate Bond UCITS ETF Principal US Dollars ($) CouponMaturity
180
- DateFair Value US Dollars ($)% of Total Net Assets TotalEnergies Capital International
181
- SA 150,000 2.99% 29/6/2041 114,035 0.01% Banque Federative du Credit Mutuel SA
182
- 100,000 4.52% 13/7/2025 97,283 0.01% TotalEnergies Capital International SA 125,000
183
- 3.39% 29/6/2060 91,711 0.01% BPCE SA 135,000 3.58% 19/10/2042 89,713 0.01% Air
184
- Liquide Finance SA 115,000 3.50% 27/9/2046 88,618 0.01% WEA Finance LLC/Westfield
185
- UK & Europe Finance plc 110,000 4.75% 17/9/2044 77,253 0.00% Pernod Ricard SA
186
- 75,000 3.25% 8/6/2026 72,024 0.00% Societe Generale SA 75,000 1.38% 8/7/2025 68,268
187
- 0.00% Societe Generale SA 75,000 5.63% 24/11/2045 60,538 0.00% BPCE SA 75,000
188
- 3.65% 14/1/2037 59,575 0.00% Legrand France SA 50,000 8.50% 15/2/2025 52,327 0.00%
189
- Societe Generale SA 50,000 3.65% 8/7/2035 39,945 0.00% Lafarge SA 35,000 7.13%
190
- 15/7/2036 38,180 0.00% - - - - 27,649,631 1.69% Germany 1.17% (30 June 2022: 1.46%)
191
- Deutsche Telekom International Finance BV 760,000 8.75% 15/6/2030 910,393 0.06%
192
- Bayer US Finance II LLC 715,000 4.38% 15/12/2028 678,134 0.04% Bayer US Finance
193
- II LLC 550,000 4.25% 15/12/2025 529,590 0.03% Mercedes-Benz Finance North America
194
- LLC 325,000 8.50% 18/1/2031 403,840 0.03% Volkswagen Group of America Finance
195
- LLC 410,000 4.75% 13/11/2028 396,008 0.02% Deutsche Bank AG 450,000 2.31% 16/11/2027
196
- 387,091 0.02% Deutsche Bank AG 401,000 3.96% 26/11/2025 382,263 0.02% Deutsche
197
- Bank AG 430,000 2.13% 24/11/2026 381,773 0.02% Volkswagen Group of America Finance
198
- LLC 410,000 1.25% 24/11/2025 369,307 0.02% EMD Finance LLC 360,000 3.25% 19/3/2025
199
- 345,003 0.02% BMW US Capital LLC 350,000 3.90% 9/4/2025 341,419 0.02% Deutsche
200
- Bank AG 405,000 3.55% 18/9/2031 336,477 0.02% Deutsche Bank AG 335,000 6.72% 18/1/2029
201
- 336,195 0.02% BMW US Capital LLC 350,000 4.15% 9/4/2030 334,517 0.02% Siemens
202
- Financieringsmaatschappij NV 350,000 2.35% 15/10/2026 322,332 0.02% Siemens Financieringsmaatschappij
203
- NV 350,000 4.'
204
- - '300,000 0.50% 15/9/2031 219,934 0.01% Nestle Finance International Ltd. 320,000
205
- 0.88% 14/6/2041 214,208 0.01% Raiffeisen Schweiz Genossenschaft 200,000 4.84%
206
- 3/11/2028 200,305 0.01% Holcim Finance Luxembourg SA 250,000 0.63% 6/4/2030 199,592
207
- 0.01% Holcim Finance Luxembourg SA 200,000 1.50% 6/4/2025 190,741 0.01% Givaudan
208
- Finance Europe BV 200,000 1.00% 22/4/2027 180,794 0.01% Argentum Netherlands BV
209
- for Zurich Insurance Co., Ltd. 200,000 2.75% 19/2/2049 174,999 0.01% Sika Capital
210
- BV 200,000 1.50% 29/4/2031 169,245 0.01% Givaudan Finance Europe BV 200,000 1.63%
211
- 22/4/2032 167,542 0.01% Tyco Electronics Group SA 200,000 0.00% 16/2/2029 163,007
212
- 0.01% ABB Finance BV 200,000 0.00% 19/1/2030 158,826 0.01% Nestle Finance International
213
- Ltd. 200,000 0.63% 14/2/2034 152,119 0.01% ELM BV for Swiss Life Insurance & Pension
214
- Group 160,000 4.50% Perpetual 152,096 0.01% Novartis Finance SA 100,000 1.63%
215
- 9/11/2026 93,717 0.01% Holcim Finance Luxembourg SA 100,000 0.13% 19/7/2027 87,370
216
- 0.01% Adecco International Financial Services BV 100,000 1.00% 21/3/2082 76,895
217
- 0.00% Zurich Finance Ireland Designated Activity Co. 100,000 1.63% 17/6/2039 74,378
218
- 0.00% Nestle Finance International Ltd. 100,000 0.00% 3/3/2033 73,483 0.00% Nestle
219
- Finance International Ltd. 100,000 0.38% 3/12/2040 61,797 0.00% - - - - 54,507,790
220
- 3.47% United Kingdom 8.27% (30 June 2022: 9.16%) BP Capital Markets plc 2,100,000
221
- 3.25% Perpetual 1,935,372 0.12% HSBC Holdings plc 2,060,000 0.31% 13/11/2026 1,863,645
222
- 0.12%'
223
- - 'PART II Item 8 68 (In millions) Fair Value Level Adjusted Cost Basis Unrealized
224
- Gains Unrealized Losses Recorded Basis Cash and Cash Equivalents Short -term Investments
225
- Equity Investments June 30, 2022 Changes in Fair Value Recorded in Other Comprehensive
226
- Income Commercial paper Level 2 $ 2,500 $ 0 $ 0 $ 2,500 $ 2,498 $ 2 $ 0 Certificates
227
- of deposit Level 2 2,071 0 0 2,071 2,032 39 0 U.S. government securities Level
228
- 1 79,696 29 (2,178 ) 77,547 9 77,538 0 U.S. agency securities Level 2 419 0 (9
229
- ) 410 0 410 0 Foreign government bonds Level 2 506 0 (24 ) 482 0 482 0 Mortgage
230
- - and asset -backed securities Level 2 727 1 (30 ) 698 0 698 0 Corporate notes
231
- and bonds Level 2 11,661 4 (554 ) 11,111 0 11,111 0 Corporate notes and bonds
232
- Level 3 67 0 0 67 0 67 0 Municipal securities Level 2 368 19 (13 ) 374 0 374 0
233
- Municipal securities Level 3 103 0 (6 ) 97 0 97 0 Total debt investments $ 98,118
234
- $ 53 $ (2,814 ) $ 95,357 $ 4,539 $ 90,818 $ 0 Changes in Fair Value Recorded in
235
- Net Income Equity investments Level 1 $ 1,590 $ 1,134 $ 0 $ 456 Equity investments
236
- Other 6,435 0 0 6,435 Total equity investments $ 8,025 $ 1,134 $ 0 $ 6,891 Cash
237
- $ 8,258 $ 8,258 $ 0 $ 0 Derivatives, net (a) 8 0 8 0 Total $ 111,648 $ 13,931
238
- $ 90,826 $ 6,891 (a) Refer to Note 5 – Derivatives for further information on
239
- the fair value of our derivative instruments. Equity investments presented as
240
- “Other” in the tables above include investments without readily determinable fair
241
- values measured using the equity method or measured at cost with adjustments for
242
- observable changes in price or impairments, and investments m easured at fair
243
- value using net asset value as a practical expedient which are not categorized
244
- in the fair value hierarchy. As of June 30, 2023 and 2022 , equity investments
245
- without readily determinable fair values measured at cost with adjustments for
246
- obse rvable changes in price or impairments were $ 4.2 billion and $3.8 billion
247
- , respectively. Unrealized Losses on Debt Investments Debt investments with continuous
248
- unrealized losses for less than 12 months and 12 months or greater and their related
249
- fair values were as follows: Less than 12 Months 12 Months or Greater Total Unrealized
250
- Losses (In millions) Fair Value Unrealized Losses Fair Value Unrealized Losses
251
- Total Fair Value June 30 , 2023 U.S.'
252
- - source_sentence: Identify the bond with the longest maturity date from the provided
253
- list and state its issuer and interest rate.
254
- sentences:
255
- - Liquidity Risk. Liquidity risk is the risk that sufficient cash cannot be raised
256
- to meet liabilities when due. One of the key liquidity factors influencing the
257
- Company and the Funds is exposure to cash redemptions of redeemable participating
258
- shares. Hence the Company, through the Funds, invests the large majority of its
259
- assets in investments that are traded in active markets and can ordinarily be
260
- readily disposed. However, liquidity risk will occur if an issuer becomes credit-impaired
261
- or if the relevant market becomes illiquid. In such a case, it may not be possible
262
- to initiate or liquidate a position at a price deemed by the Investment Manager
263
- to be demonstrating fair value. Liquidity risk may be temporary or may last for
264
- extended periods. The Company, through the Funds, invests in securities that form
265
- part of the benchmark indices. Benchmark indices are constructed from index rules
266
- requiring securities to have a specified minimum trading volume, which, although
267
- not guaranteeing liquidity, provides indication of the liquid nature of the securities
268
- underlying the Funds. The Funds are exposed to withdrawals and contributions that
269
- are invested to ensure that exposure to the benchmark indices is maintained to
270
- meet the investment objective of the Funds. All the Funds ’ financial liabilities,
271
- based on contractual maturities, fall due within three months. Additionally, the
272
- Funds may use index futures contracts to a limited extent, to maintain full exposure
273
- to the index, maintain liquidity and minimise transaction costs. Funds may purchase
274
- futures contracts to immediately invest incoming cash in the market, or sell futures
275
- in response to cash outflows, thereby simulating a fully invested position in
276
- the underlying index while maintaining a cash balance for liquidity.
277
- - $100,000 2.20% 17/6/2025 94,044 0.00% Wells Fargo & Co. £100,000 2.13% 24/9/2031
278
- 94,000 0.00% New York Life Global Funding $100,000 1.45% 14/1/2025 93,999 0.00%
279
- PACCAR Financial Corp. $100,000 0.90% 8/11/2024 93,978 0.00% Revvity, Inc. $105,000
280
- 3.30% 15/9/2029 93,910 0.00% Freeport-McMoRan, Inc. $100,000 4.13% 1/3/2028 93,897
281
- 0.00% Wells Fargo Commercial Mortgage Trust 2018 $100,000 4.30% 15/1/2052 93,895
282
- 0.00% Altria Group, Inc. €100,000 3.13% 15/6/2031 93,889 0.00% New York Life Global
283
- Funding $100,000 0.90% 29/10/2024 93,860 0.00% JPMorgan Chase & Co. $100,000 2.95%
284
- 1/10/2026 93,859 0.00% Exelon Corp. $100,000 4.05% 15/4/2030 93,734 0.00% Wells
285
- Fargo & Co. $100,000 2.19% 30/4/2026 93,726 0.00% Morgan Stanley $100,000 3.13%
286
- 27/7/2026 93,704 0.00% Southwest Airlines Co. $110,000 2.63% 10/2/2030 93,681
287
- 0.00% JPMorgan Chase & Co. $100,000 1.56% 10/12/2025 93,650 0.00% Equitable Financial
288
- Life Global Funding $100,000 1.10% 12/11/2024 93,632 0.00% Walt Disney Co. $110,000
289
- 2.00% 1/9/2029 93,622 0.00% Citigroup, Inc. $100,000 3.20% 21/10/2026 93,617 0.00%
290
- PayPal Holdings, Inc. $100,000 1.65% 1/6/2025 93,598 0.00% Brixmor Operating Partnership
291
- LP $100,000 4.13% 15/6/2026 93,553 0.00% Public Service Electric & Gas Co. $100,000
292
- 3.00% 15/5/2027 93,549 0.00% High Street Funding Trust I $100,000 4.11% 15/2/2028
293
- 93,547 0.00% Invesco Finance plc $96,000 5.38% 30/11/2043 93,545 0.00% Philip
294
- Morris International, Inc. $100,000 1.50% 1/5/2025 93,534 0.00% VICI Properties
295
- LP $100,000 5.13% 15/5/2032 93,481 0.00% Citigroup Commercial Mortgage Trust 2018
296
- $100,000 4.23% 10/6/2051 93,422 0.00% Air Products & Chemicals, Inc. €100,000
297
- 0.50% 5/5/2028 93,418 0.00% Duke Energy Progress LLC $150,000 2.50% 15/8/2050
298
- 93,415 0.00% General Motors Co. $100,000 5.95% 1/4/2049 93,381 0.00% Verizon Communications,
299
- Inc.
300
- - $65,000 5.00% 15/7/2032 64,094 0.00% Fifth Third Bancorp $75,000 1.71% 1/11/2027
301
- 64,024 0.00% Devon Energy Corp. $64,000 5.88% 15/6/2028 63,783 0.00% Pacific Gas
302
- & Electric Co. $100,000 3.50% 1/8/2050 63,779 0.00% Medtronic Global Holdings
303
- SCA $65,000 4.50% 30/3/2033 63,708 0.00% Franklin Resources, Inc. $100,000 2.95%
304
- 12/8/2051 63,638 0.00% AT&T, Inc. $75,000 4.55% 9/3/2049 63,608 0.00% Enterprise
305
- Products Operating LLC $75,000 4.25% 15/2/2048 63,582 0.00% Morgan Stanley Bank
306
- of America Merrill Lynch Trust 2016 $67,746 3.27% 15/1/2049 63,553 0.00% Visa,
307
- Inc. $75,000 3.65% 15/9/2047 63,523 0.00% New York City Municipal Water Finance
308
- Authority $60,000 5.44% 15/6/2043 63,518 0.00% Willis-Knighton Medical Center
309
- $100,000 3.07% 1/3/2051 63,500 0.00% Charter Communications Operating LLC/Charter
310
- Communications Operating Capital $100,000 3.70% 1/4/2051 63,461 0.00% Johnson
311
- & Johnson $60,000 4.95% 15/5/2033 63,221 0.00% Federal Home Loan Mortgage Corp.
312
- $75,000 2.07% 25/1/2031 63,215 0.00% CVS Health Corp. $75,000 4.13% 1/4/2040 63,134
313
- 0.00% United States Treasury Note $68,000 1.88% 31/7/2026 62,958 0.00% Enterprise
314
- Products Operating LLC $75,000 4.20% 31/1/2050 62,879 0.00% Commonwealth Edison
315
- Co. $75,000 4.00% 1/3/2048 62,849 0.00% Uniform Mortgage Backed Securities $68,505
316
- 2.50% 1/6/2034 62,837 0.00% Federal Home Loan Mortgage Corp. $75,000 2.02% 25/3/2031
317
- 62,832 0.00% Brighthouse Financial, Inc. $65,000 5.63% 15/5/2030 62,820 0.00%
318
- Eli Lilly & Co. €100,000 1.13% 14/9/2051 62,721 0.00% Stryker Corp. $75,000 1.95%
319
- 15/6/2030 62,690 0.00% Cox Communications, Inc. $100,000 2.95% 1/10/2050 62,543
320
- 0.00%
321
- - source_sentence: Which issuer has the highest number of bonds listed in the context
322
- information, and what are the details of those bonds?
323
- sentences:
324
- - 517 Vanguard EUR Corporate Bond UCITS ETF Principal EUR (€) CouponMaturity DateFair
325
- Value EUR (€)% of Total Net Assets Deutsche Telekom AG 350,000 1.75% 25/3/2031
326
- 312,584 0.02% Volkswagen International Finance NV 400,000 1.25% 23/9/2032 309,383
327
- 0.02% Covestro AG 300,000 4.75% 15/11/2028 305,772 0.02% Hamburg Commercial Bank
328
- AG 300,000 4.88% 17/3/2025 298,161 0.02% MTU Aero Engines AG 300,000 3.00% 1/7/2025
329
- 296,850 0.02% Adidas AG 300,000 3.00% 21/11/2025 294,853 0.02% Knorr-Bremse AG
330
- 300,000 3.25% 21/9/2027 294,564 0.02% Eurogrid GmbH 300,000 3.28% 5/9/2031 290,932
331
- 0.02% Volkswagen Financial Services AG 300,000 1.50% 1/10/2024 290,663 0.02% Henkel
332
- AG & Co. KGaA 300,000 2.63% 13/9/2027 290,397 0.02% Muenchener Hypothekenbank
333
- eG 300,000 0.88% 11/7/2024 289,863 0.02% Conti-Gummi Finance BV 300,000 1.13%
334
- 25/9/2024 289,074 0.02% Volkswagen Leasing GmbH 300,000 0.00% 19/7/2024 287,414
335
- 0.02% HOCHTIEF AG 300,000 1.75% 3/7/2025 286,772 0.02% Santander Consumer Bank
336
- AG 300,000 0.25% 15/10/2024 285,058 0.02% Merck KGaA 300,000 1.63% 25/6/2079 284,407
337
- 0.02% Vonovia Finance BV 400,000 2.75% 22/3/2038 283,910 0.02% E.ON SE 300,000
338
- 1.00% 7/10/2025 282,765 0.02% Heraeus Finance GmbH 300,000 2.63% 9/6/2027 281,328
339
- 0.02% LEG Immobilien SE 400,000 1.00% 19/11/2032 278,523 0.02% Covestro AG 300,000
340
- 0.88% 3/2/2026 278,048 0.02% Deutsche Post AG 279,000 2.88% 11/12/2024 276,566
341
- 0.02% Fresenius Medical Care AG & Co.
342
- - This Annual Report on Form 10-K (“Form 10-K”) contains forward-looking statements,
343
- within the meaning of the Private Securities Litigation Reform Act of 1995, that
344
- involve risks and uncertainties. Many of the forward-looking statements are located
345
- in Part I, Item 1 of this Form 10-K under the heading “Business” and Part II,
346
- Item 7 of this Form 10-K under the heading “Management’s Discussion and Analysis
347
- of Financial Condition and Results of Operations.” Forward-looking statements
348
- provide current expectations of future events based on certain assumptions and
349
- include any statement that does not directly relate to any historical or current
350
- fact. For example, statements in this Form 10-K regarding the potential future
351
- impact of macroeconomic conditions on the Company’s business and results of operations
352
- are forward-looking statements. Forward- looking statements can also be identified
353
- by words such as “future,” “anticipates,” “believes,” “estimates,” “expects,”
354
- “intends,” “plans,” “predicts,” “will,” “would,” “could,” “can,” “may,” and similar
355
- terms. Forward-looking statements are not guarantees of future performance and
356
- the Company’s actual results may differ significantly from the results discussed
357
- in the forward-looking statements. Factors that might cause such differences include,
358
- but are not limited to, those discussed in Part I, Item 1A of this Form 10-K under
359
- the heading “Risk Factors.” The Company assumes no obligation to revise or update
360
- any forward-looking statements for any reason, except as required by law. Unless
361
- otherwise stated, all information presented herein is based on the Company’s fiscal
362
- calendar, and references to particular years, quarters, months or periods refer
363
- to the Company’s fiscal years ended in September and the associated quarters,
364
- months and periods of those fiscal years. Each of the terms the “Company” and
365
- “Apple” as used herein refers collectively to Apple Inc. and its wholly owned
366
- subsidiaries, unless otherwise stated. PART I Item 1. Business Company Background
367
- The Company designs, manufactures and markets smartphones, personal computers,
368
- tablets, wearables and accessories, and sells a variety of related services. The
369
- Company’s fiscal year is the 52- or 53-week period that ends on the last Saturday
370
- of September. Products iPhone iPhone® is the Company’s line of smartphones based
371
- on its iOS operating system. The iPhone line includes iPhone 15 Pro, iPhone 15,
372
- iPhone 14, iPhone 13 and iPhone SE®. Mac Mac® is the Company’s line of personal
373
- computers based on its macOS® operating system. The Mac line includes laptops
374
- MacBook Air® and MacBook Pro®, as well as desktops iMac®, Mac mini®, Mac Studio®
375
- and Mac Pro®. iPad iPad® is the Company’s line of multipurpose tablets based on
376
- its iPadOS® operating system. The iPad line includes iPad Pro®, iPad Air®, iPad
377
- and iPad mini®. Wearables, Home and Accessories Wearables includes smartwatches
378
- and wireless headphones. The Company’s line of smartwatches, based on its watchOS®
379
- operating system, includes Apple Watch Ultra™ 2, Apple Watch® Series 9 and Apple
380
- Watch SE®. The Company’s line of wireless headphones includes AirPods®, AirPods
381
- Pro®, AirPods Max™ and Beats® products. Home includes Apple TV®, the Company’s
382
- media streaming and gaming device based on its tvOS® operating system, and HomePod®
383
- and HomePod mini®, high-fidelity wireless smart speakers. Accessories includes
384
- Apple-branded and third-party accessories. Apple Inc. | 2023 Form 10-K | 1
385
- - Refer to “Note 2 Accounting for the acquisition of the Credit Suisse Group” in
386
- the “Cons olidated financial statements” section of the UBS Group Annual Report
387
- 2023, available under “Annual reporting” at ubs.com/investors, for more information.
388
- 2 Refer to the “Share information and earnings per share” section of the UBS Group
389
- first quarter 2024 report, available under “Quarterly reporting” at ubs.com/investors,
390
- for more information. 3 Refer to the “Targets, capital guidance and ambitions”
391
- section of the UBS Group Annual Report 2023, available under “Annual reporting”
392
- at ubs.com/investors, for more information about our performance targets. 4 Refer
393
- to “Alternative performance measures” in the appendix to the UBS Group first quarter
394
- 2024 report, available under “Quarterly reporting” at ubs.com/investors, for the
395
- definition and calculation method. 5 Profit or loss information for each of the
396
- first quarter of 2024 and the fourth quarter of 2023 is presented on a consolidated
397
- basis, including for each quarter Credit Suisse data for three months and for
398
- the purpose of the calculatio n of return measures has been annualized multiplying
399
- such by four. Profit or loss information for the first quarter of 2023 includes
400
- pre-acquisition UBS data for three months and for the purpose of the calculation
401
- of return measures has been annualized multiplying such by four. 6 Refer to the
402
- “Group performance” section of the UBS Group first quarter 2024 report, available
403
- under “Quarterly reporting” at ubs.com/investors, for more information about underlying
404
- results. 7 The effective tax rate for the fourth quarter of 2023 is not a meaningful
405
- measure, due to the distortive effect of current unbenefited tax losses at the
406
- form er Credit Suisse entities. 8 Based on the Swiss systemically relevant bank
407
- framework as of 1 January 2020. Refer to the “Capital management” section of the
408
- UBS Group first quarter 2024 report, available under “Quarterly reporting” at
409
- ubs.com/inv estors, for more information. 9 The disclosed ratios represent quarterly
410
- averages for the quarters presented and are calculated based on an average of
411
- 61 data points in the first quarter of 2024, 63 data points in the fourth quarter
412
- of 2023 and 64 data points i n the first quarter of 2023. Refer to the “Liquidity
413
- and funding management” section of the UBS Group first quarter 2024 repo rt, available
414
- under “Quarterly reporting” at ubs.com/investors, for more information. 10 Consists
415
- of invested assets for Global Wealth Management, Asset Management and Personal
416
- & Corporate Banking. Refer to “Note 32 Invested assets and net new money” in the
417
- “Consolidated financial statements” section of the UBS Group Annual Report 2023,
418
- available under “Annual reporting” at ubs.co m/investors, for more information.
419
- 11 Starting with the second quarter of 2023, invested assets include invested
420
- assets from associates in the Asset Management business division, to better reflect
421
- the business strategy. Comparative figures have been rest ated to reflect this
422
- change. 12 In the second quarter of 2023, the calculation of market capitalization
423
- was amended to reflect total shares issued multiplied by the share price at the
424
- end of the period. The calculation was previously based on total shar es outstanding
425
- multiplied by the share price at the end of the period. Market capitalization
426
- was increased by USD 10.0bn as of 31 March 2023 as a result.
427
- - source_sentence: What factors contributed to the negative performance of global
428
- bonds over the 12-month period?
429
- sentences:
430
- - Vanguard Global Aggregate Bond UCITS ETF Managed by Vanguard Global Advisers,
431
- LLC.561 Investment Objective Vanguard Global Aggregate Bond UCITS ETF seeks to
432
- track the performance of the Bloomberg Global Aggregate Float Adjusted and Scaled
433
- Index, a widely recognised benchmark designed to reflect the characteristics of
434
- the global aggregate bond universe. Performance Summary (unaudited) The Performance
435
- Summary does not form part of the Financial Statements. • Inflation and policymakers’
436
- efforts to rein it in took centre stage for the financial markets during much
437
- of the 12 months ended 30 June 2023. • Early in the period, energy prices continued
438
- to cool amid an outlook for slower economic growth, but price increases then broadened
439
- to other categories, notably the services sector, which felt the effects of tight
440
- labour markets. Central banks including the US Federal Reserve, the European Central
441
- Bank and the Bank of England reacted to the prospect of inflation remaining stubbornly
442
- high by aggressively hiking interest rates even as their actions fanned fears
443
- of a global recession down the road. • Although progress was slow, signs of inflation
444
- moderating later in the period led several major central banks to slow the pace
445
- of their interest rate hikes or even hit the pause button. • Bonds suffered early
446
- in the fiscal year amid aggressive rate hiking and later when markets began to
447
- anticipate that rates would remain higher for longer. With rising yields pushing
448
- prices down, global bonds ended the 12 months in negative territory. • In this
449
- environment, the ETF ’s benchmark index returned –0.09% for the fiscal year. •
450
- By country, the United States, the largest constituent in the index by far, underperformed
451
- the index, returning –0.94%. Belgium, the UK and the Netherlands were also among
452
- the laggards. Canada, Japan, China and South Korea outperformed and were in positive
453
- territory. • By sector, corporate and non-corporate bonds fared better than government
454
- bonds and US mortgage-backed securities. • By credit quality, bonds on the bottom
455
- rung of the investment-grade ladder tended to perform better than higher-rated
456
- bonds. By maturity, longer-dated bonds lagged. Benchmark returns in the commentary
457
- above are in US dollars.
458
- - $25,000 4.55% 1/3/2029 23,455 0.00% Eaton Corp. $25,000 3.10% 15/9/2027 23,451
459
- 0.00% Penske Truck Leasing Co. LP/PTL Finance Corp. $25,000 4.20% 1/4/2027 23,448
460
- 0.00% Tri-State Generation & Transmission Association, Inc. $25,000 6.00% 15/6/2040
461
- 23,446 0.00% AEP Transmission Co. LLC $25,000 3.10% 1/12/2026 23,440 0.00% Target
462
- Corp. $25,000 3.38% 15/4/2029 23,439 0.00% JM Smucker Co. $25,000 3.38% 15/12/2027
463
- 23,419 0.00% AmerisourceBergen Corp. $25,000 3.45% 15/12/2027 23,403 0.00% Lowe's
464
- Cos, Inc. $25,000 2.50% 15/4/2026 23,399 0.00% McCormick & Co., Inc. $25,000 3.40%
465
- 15/8/2027 23,399 0.00% Wells Fargo Commercial Mortgage Trust 2018 $25,000 4.18%
466
- 15/6/2051 23,399 0.00% Raytheon Technologies Corp. $25,000 3.13% 4/5/2027 23,398
467
- 0.00% Wells Fargo Commercial Mortgage Trust 2018 $25,000 4.15% 15/8/2051 23,398
468
- 0.00% Masco Corp. $25,000 3.50% 15/11/2027 23,394 0.00% Southern Power Co. $25,000
469
- 5.15% 15/9/2041 23,379 0.00% Kirby Corp. $25,000 4.20% 1/3/2028 23,378 0.00% UBS
470
- Commercial Mortgage Trust 2018 $25,000 4.34% 15/12/2051 23,376 0.00% UDR, Inc.
471
- $25,000 3.50% 1/7/2027 23,375 0.00% Citigroup Commercial Mortgage Trust 2016 $25,000
472
- 3.35% 10/2/2049 23,368 0.00% BANK 2018 $25,000 4.05% 15/3/2061 23,365 0.00% Northwestern
473
- Mutual Life Insurance Co. $30,000 3.85% 30/9/2047 23,364 0.00% Southern California
474
- Edison Co. $25,000 3.65% 1/3/2028 23,357 0.00% Citigroup Commercial Mortgage Trust
475
- 2016 $25,000 3.62% 10/2/2049 23,344 0.00% Morgan Stanley Capital I Trust 2016
476
- $24,970 3.33% 15/3/2049 23,332 0.00% Verizon Communications, Inc. $25,000 4.02%
477
- 3/12/2029 23,321 0.00% Raytheon Technologies Corp. $25,000 4.80% 15/12/2043 23,289
478
- 0.00% Georgia Power Co. $25,000 3.25% 30/3/2027 23,286 0.00% Bio-Rad Laboratories,
479
- Inc. $25,000 3.30% 15/3/2027 23,269 0.00% FedEx Corp.
480
- - 'Note 26. Credit Concentrations The firm’s concentrations of credit risk arise
481
- from its market- making, client facilitation, investing, underwriting, lending
482
- and collateralized transactions, and cash management activities, and may be impacted
483
- by changes in economic, industry or political factors. These activities expose
484
- the firm to many different industries and counterparties, and may also subject
485
- the firm to a concentration of credit risk to a particular central bank, counterparty,
486
- borrower or issuer, including sovereign issuers, or to a particular clearinghouse
487
- or exchange. The firm seeks to mitigate credit risk by actively monitoring exposures
488
- and obtaining collateral from counterparties as deemed appropriate. The firm measures
489
- and monitors its credit exposure based on amounts owed to the firm after taking
490
- into account risk mitigants that the firm considers when determining credit risk.
491
- Such risk mitigants include netting and collateral arrangements and economic hedges,
492
- such as credit derivatives, futures and forward contracts. Netting and collateral
493
- agreements permit the firm to offset receivables and payables with such counterparties
494
- and/or enable the firm to obtain collateral on an upfront or contingent basis.
495
- The table below presents the credit concentrations included in trading cash instruments
496
- and investments. As of December $ in millions 2023 2022 U.S. government and agency
497
- obligations $ 260,531 $ 205,935 Percentage of total assets 15.9% 14.3% Non-U.S.
498
- government and agency obligations $ 90,681 $ 40,334 Percentage of total assets
499
- 5.5% 2.8% In addition, the firm had $206.07 billion as of December 2023 and $208.53
500
- billion as of December 2022 of cash deposits held at central banks (included in
501
- cash and cash equivalents), of which $105.66 billion as of December 2023 and $165.77
502
- billion as of December 2022 was held at the Federal Reserve. As of both December
503
- 2023 and December 2022 , the firm did not have credit exposure to any other counterparty
504
- that exceeded 2% of total assets. Collateral obtained by the firm related to derivative
505
- assets is principally cash and is held by the firm or a third-party custodian.
506
- Collateral obtained by the firm related to resale agreements and securities borrowed
507
- transactions is primarily U.S. government and agency obligations, and non-U.S.
508
- government and agency obligations. See Note 11 for further information about collateralized
509
- agreements and financings. The table below presents U.S. government and agency
510
- obligations, and non-U.S. government and agency obligations that collateralize
511
- resale agreements and securities borrowed transactions. As of December $ in millions
512
- 2023 2022 U.S. government and agency obligations $ 154,056 $ 164,897 Non-U.S.
513
- government and agency obligations $ 92,833 $ 76,456 In the table above: •Non-U.S.
514
- government and agency obligations primarily consists of securities issued by the
515
- governments of the U.K., Japan, Germany, France and Italy. •Given that the firm’s
516
- primary credit exposure on such transactions is to the counterparty to the transaction,
517
- the firm would be exposed to the collateral issuer only in the event of counterparty
518
- default. Note 27. Legal Proceedings The firm is involved in a number of judicial,
519
- regulatory and arbitration proceedings (including those described below) concerning
520
- matters arising in connection with the conduct of the firm’s businesses. Many
521
- of these proceedings are in early stages, and many of these cases seek an indeterminate
522
- amount of damages. Under ASC 450, an event is “reasonably possible” if “the chance
523
- of the future event or events occurring is more than remote but less than likely”
524
- and an event is “remote” if “the chance of the future event or events occurring
525
- is slight.” Thus, references to the upper end of the range of reasonably possible
526
- loss for cases in which the firm is able to estimate a range of reasonably possible
527
- loss mean the upper end of the range of loss for cases for which the firm believes
528
- the risk of loss is more than slight. THE GOLDMAN SACHS GROUP, INC. AND SUBSIDIARIES
529
- Notes to Consolidated Financial Statements 216 Goldman Sachs 2023 Form 10-K'
530
- model-index:
531
- - name: SentenceTransformer based on BAAI/bge-base-en-v1.5
532
- results:
533
- - task:
534
- type: information-retrieval
535
- name: Information Retrieval
536
- dataset:
537
- name: Unknown
538
- type: unknown
539
- metrics:
540
- - type: cosine_accuracy@1
541
- value: 0.4418505942275042
542
- name: Cosine Accuracy@1
543
- - type: cosine_accuracy@3
544
- value: 0.6752971137521222
545
- name: Cosine Accuracy@3
546
- - type: cosine_accuracy@5
547
- value: 0.7625919637804188
548
- name: Cosine Accuracy@5
549
- - type: cosine_accuracy@10
550
- value: 0.8507357102433503
551
- name: Cosine Accuracy@10
552
- - type: cosine_precision@1
553
- value: 0.4418505942275042
554
- name: Cosine Precision@1
555
- - type: cosine_precision@3
556
- value: 0.22509903791737407
557
- name: Cosine Precision@3
558
- - type: cosine_precision@5
559
- value: 0.15251839275608375
560
- name: Cosine Precision@5
561
- - type: cosine_precision@10
562
- value: 0.08507357102433502
563
- name: Cosine Precision@10
564
- - type: cosine_recall@1
565
- value: 0.4418505942275042
566
- name: Cosine Recall@1
567
- - type: cosine_recall@3
568
- value: 0.6752971137521222
569
- name: Cosine Recall@3
570
- - type: cosine_recall@5
571
- value: 0.7625919637804188
572
- name: Cosine Recall@5
573
- - type: cosine_recall@10
574
- value: 0.8507357102433503
575
- name: Cosine Recall@10
576
- - type: cosine_ndcg@10
577
- value: 0.6432422811456076
578
- name: Cosine Ndcg@10
579
- - type: cosine_mrr@10
580
- value: 0.5770132656911126
581
- name: Cosine Mrr@10
582
- - type: cosine_map@100
583
- value: 0.5834691562837875
584
- name: Cosine Map@100
585
- - type: dot_accuracy@1
586
- value: 0.4418505942275042
587
- name: Dot Accuracy@1
588
- - type: dot_accuracy@3
589
- value: 0.6752971137521222
590
- name: Dot Accuracy@3
591
- - type: dot_accuracy@5
592
- value: 0.7625919637804188
593
- name: Dot Accuracy@5
594
- - type: dot_accuracy@10
595
- value: 0.8507357102433503
596
- name: Dot Accuracy@10
597
- - type: dot_precision@1
598
- value: 0.4418505942275042
599
- name: Dot Precision@1
600
- - type: dot_precision@3
601
- value: 0.22509903791737407
602
- name: Dot Precision@3
603
- - type: dot_precision@5
604
- value: 0.15251839275608375
605
- name: Dot Precision@5
606
- - type: dot_precision@10
607
- value: 0.08507357102433502
608
- name: Dot Precision@10
609
- - type: dot_recall@1
610
- value: 0.4418505942275042
611
- name: Dot Recall@1
612
- - type: dot_recall@3
613
- value: 0.6752971137521222
614
- name: Dot Recall@3
615
- - type: dot_recall@5
616
- value: 0.7625919637804188
617
- name: Dot Recall@5
618
- - type: dot_recall@10
619
- value: 0.8507357102433503
620
- name: Dot Recall@10
621
- - type: dot_ndcg@10
622
- value: 0.6432422811456076
623
- name: Dot Ndcg@10
624
- - type: dot_mrr@10
625
- value: 0.5770132656911126
626
- name: Dot Mrr@10
627
- - type: dot_map@100
628
- value: 0.5834691562837875
629
- name: Dot Map@100
630
- ---
631
 
632
- # SentenceTransformer based on BAAI/bge-base-en-v1.5
633
 
634
- This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
635
 
636
- ## Model Details
 
 
637
 
638
- ### Model Description
639
  - **Model Type:** Sentence Transformer
640
- - **Base model:** [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) <!-- at revision a5beb1e3e68b9ab74eb54cfd186867f64f240e1a -->
 
641
  - **Maximum Sequence Length:** 512 tokens
642
- - **Output Dimensionality:** 768 tokens
643
  - **Similarity Function:** Cosine Similarity
644
- <!-- - **Training Dataset:** Unknown -->
645
- <!-- - **Language:** Unknown -->
646
- <!-- - **License:** Unknown -->
647
-
648
- ### Model Sources
649
-
650
- - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
651
- - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
652
- - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
653
-
654
- ### Full Model Architecture
655
-
656
- ```
657
- SentenceTransformer(
658
- (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
659
- (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
660
- (2): Normalize()
661
- )
662
- ```
663
-
664
- ## Usage
665
-
666
- ### Direct Usage (Sentence Transformers)
667
 
668
- First install the Sentence Transformers library:
669
 
670
- ```bash
671
- pip install -U sentence-transformers
672
- ```
673
 
674
- Then you can load this model and run inference.
675
  ```python
676
  from sentence_transformers import SentenceTransformer
677
 
678
  # Download from the 🤗 Hub
679
- model = SentenceTransformer("sujet-ai/Marsilia-Embedding-EN-base")
 
680
  # Run inference
681
  sentences = [
682
- 'What factors contributed to the negative performance of global bonds over the 12-month period?',
683
- 'Vanguard Global Aggregate Bond UCITS ETF Managed by Vanguard Global Advisers, LLC.561 Investment Objective Vanguard Global Aggregate Bond UCITS ETF seeks to track the performance of the Bloomberg Global Aggregate Float Adjusted and Scaled Index, a widely recognised benchmark designed to reflect the characteristics of the global aggregate bond universe. Performance Summary (unaudited) The Performance Summary does not form part of the Financial Statements. • Inflation and policymakers’ efforts to rein it in took centre stage for the financial markets during much of the 12 months ended 30 June 2023. Early in the period, energy prices continued to cool amid an outlook for slower economic growth, but price increases then broadened to other categories, notably the services sector, which felt the effects of tight labour markets. Central banks including the US Federal Reserve, the European Central Bank and the Bank of England reacted to the prospect of inflation remaining stubbornly high by aggressively hiking interest rates even as their actions fanned fears of a global recession down the road. • Although progress was slow, signs of inflation moderating later in the period led several major central banks to slow the pace of their interest rate hikes or even hit the pause button. Bonds suffered early in the fiscal year amid aggressive rate hiking and later when markets began to anticipate that rates would remain higher for longer. With rising yields pushing prices down, global bonds ended the 12 months in negative territory. • In this environment, the ETF ’s benchmark index returned –0.09% for the fiscal year. • By country, the United States, the largest constituent in the index by far, underperformed the index, returning –0.94%. Belgium, the UK and the Netherlands were also among the laggards. Canada, Japan, China and South Korea outperformed and were in positive territory. • By sector, corporate and non-corporate bonds fared better than government bonds and US mortgage-backed securities. By credit quality, bonds on the bottom rung of the investment-grade ladder tended to perform better than higher-rated bonds. By maturity, longer-dated bonds lagged. Benchmark returns in the commentary above are in US dollars.',
684
- 'Note 26. Credit Concentrations The firm’s concentrations of credit risk arise from its market- making, client facilitation, investing, underwriting, lending and collateralized transactions, and cash management activities, and may be impacted by changes in economic, industry or political factors. These activities expose the firm to many different industries and counterparties, and may also subject the firm to a concentration of credit risk to a particular central bank, counterparty, borrower or issuer, including sovereign issuers, or to a particular clearinghouse or exchange. The firm seeks to mitigate credit risk by actively monitoring exposures and obtaining collateral from counterparties as deemed appropriate. The firm measures and monitors its credit exposure based on amounts owed to the firm after taking into account risk mitigants that the firm considers when determining credit risk. Such risk mitigants include netting and collateral arrangements and economic hedges, such as credit derivatives, futures and forward contracts. Netting and collateral agreements permit the firm to offset receivables and payables with such counterparties and/or enable the firm to obtain collateral on an upfront or contingent basis. The table below presents the credit concentrations included in trading cash instruments and investments. As of December $ in millions 2023 2022 U.S. government and agency obligations $ 260,531 $ 205,935 Percentage of total assets 15.9% 14.3% Non-U.S. government and agency obligations $ 90,681 $ 40,334 Percentage of total assets 5.5% 2.8% In addition, the firm had $206.07 billion as of December 2023 and $208.53 billion as of December 2022 of cash deposits held at central banks (included in cash and cash equivalents), of which $105.66 billion as of December 2023 and $165.77 billion as of December 2022 was held at the Federal Reserve. As of both December 2023 and December 2022 , the firm did not have credit exposure to any other counterparty that exceeded 2% of total assets. Collateral obtained by the firm related to derivative assets is principally cash and is held by the firm or a third-party custodian. Collateral obtained by the firm related to resale agreements and securities borrowed transactions is primarily U.S. government and agency obligations, and non-U.S. government and agency obligations. See Note 11 for further information about collateralized agreements and financings. The table below presents U.S. government and agency obligations, and non-U.S. government and agency obligations that collateralize resale agreements and securities borrowed transactions. As of December $ in millions 2023 2022 U.S. government and agency obligations $ 154,056 $ 164,897 Non-U.S. government and agency obligations $ 92,833 $ 76,456 In the table above: •Non-U.S. government and agency obligations primarily consists of securities issued by the governments of the U.K., Japan, Germany, France and Italy. •Given that the firm’s primary credit exposure on such transactions is to the counterparty to the transaction, the firm would be exposed to the collateral issuer only in the event of counterparty default. Note 27. Legal Proceedings The firm is involved in a number of judicial, regulatory and arbitration proceedings (including those described below) concerning matters arising in connection with the conduct of the firm’s businesses. Many of these proceedings are in early stages, and many of these cases seek an indeterminate amount of damages. Under ASC 450, an event is “reasonably possible” if “the chance of the future event or events occurring is more than remote but less than likely” and an event is “remote” if “the chance of the future event or events occurring is slight.” Thus, references to the upper end of the range of reasonably possible loss for cases in which the firm is able to estimate a range of reasonably possible loss mean the upper end of the range of loss for cases for which the firm believes the risk of loss is more than slight. THE GOLDMAN SACHS GROUP, INC. AND SUBSIDIARIES Notes to Consolidated Financial Statements 216 Goldman Sachs 2023 Form 10-K',
685
  ]
686
  embeddings = model.encode(sentences)
687
  print(embeddings.shape)
@@ -693,253 +41,31 @@ print(similarities.shape)
693
  # [3, 3]
694
  ```
695
 
696
- <!--
697
- ### Direct Usage (Transformers)
698
-
699
- <details><summary>Click to see the direct usage in Transformers</summary>
700
-
701
- </details>
702
- -->
703
-
704
- <!--
705
- ### Downstream Usage (Sentence Transformers)
706
-
707
- You can finetune this model on your own dataset.
708
-
709
- <details><summary>Click to expand</summary>
710
-
711
- </details>
712
- -->
713
-
714
- <!--
715
- ### Out-of-Scope Use
716
-
717
- *List how the model may foreseeably be misused and address what users ought not to do with the model.*
718
- -->
719
-
720
- ## Evaluation
721
-
722
- ### Metrics
723
-
724
- #### Information Retrieval
725
-
726
- * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
727
 
728
- | Metric | Value |
729
- |:--------------------|:-----------|
730
- | cosine_accuracy@1 | 0.4419 |
731
- | cosine_accuracy@3 | 0.6753 |
732
- | cosine_accuracy@5 | 0.7626 |
733
- | cosine_accuracy@10 | 0.8507 |
734
- | cosine_precision@1 | 0.4419 |
735
- | cosine_precision@3 | 0.2251 |
736
- | cosine_precision@5 | 0.1525 |
737
- | cosine_precision@10 | 0.0851 |
738
- | cosine_recall@1 | 0.4419 |
739
- | cosine_recall@3 | 0.6753 |
740
- | cosine_recall@5 | 0.7626 |
741
- | cosine_recall@10 | 0.8507 |
742
- | cosine_ndcg@10 | 0.6432 |
743
- | cosine_mrr@10 | 0.577 |
744
- | **cosine_map@100** | **0.5835** |
745
- | dot_accuracy@1 | 0.4419 |
746
- | dot_accuracy@3 | 0.6753 |
747
- | dot_accuracy@5 | 0.7626 |
748
- | dot_accuracy@10 | 0.8507 |
749
- | dot_precision@1 | 0.4419 |
750
- | dot_precision@3 | 0.2251 |
751
- | dot_precision@5 | 0.1525 |
752
- | dot_precision@10 | 0.0851 |
753
- | dot_recall@1 | 0.4419 |
754
- | dot_recall@3 | 0.6753 |
755
- | dot_recall@5 | 0.7626 |
756
- | dot_recall@10 | 0.8507 |
757
- | dot_ndcg@10 | 0.6432 |
758
- | dot_mrr@10 | 0.577 |
759
- | dot_map@100 | 0.5835 |
760
 
761
- <!--
762
- ## Bias, Risks and Limitations
763
 
764
- *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
765
- -->
766
 
767
- <!--
768
- ### Recommendations
769
-
770
- *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
771
- -->
772
-
773
- ## Training Details
774
-
775
- ### Training Dataset
776
-
777
- #### Unnamed Dataset
778
-
779
-
780
- * Size: 98,400 training samples
781
- * Columns: <code>sentence_0</code> and <code>sentence_1</code>
782
- * Approximate statistics based on the first 1000 samples:
783
- | | sentence_0 | sentence_1 |
784
- |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
785
- | type | string | string |
786
- | details | <ul><li>min: 13 tokens</li><li>mean: 24.55 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>min: 26 tokens</li><li>mean: 468.34 tokens</li><li>max: 512 tokens</li></ul> |
787
- * Samples:
788
- | sentence_0 | sentence_1 |
789
- |:---------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
790
- | <code>What is the potential impact of converting Series A Preferred Stock on the Company's capital structure?</code> | <code>Company’s subsidiaries or consolidated affiliates) and implementing capital standards published by the Basel Committee on Banking Supervision, the SEC, the Board of Governors of the Federal Reserve System (the “Federal Reserve Board”) or any other United States national governmental body, or any other applicable regime based on capital standards published by the Basel Committee on Banking Supervision or its successor, or (iii) provides for a type of capital that in the Company’s judgment (after consultation with counsel of recognized standing) is substantially equivalent to such “Tier 1” capital (such capital described in either (ii) or (iii) is referred to below as “Tier 1 Capital Equivalent”), and • the Company affirmatively elects to qualify the Series A Preferred Stock for such Allowable Capital or Tier 1 Capital Equivalent treatment without any sublimit or other quantitative restriction on the inclusion of the Series A Preferred Stock in Allowable Capital or Tier 1 Capital Equivalent (other than any limitation the Company elects to accept and any limitation requiring that common equity or a specified form of common equity constitute the dominant form of Allowable Capital or Tier 1 Capital Equivalent) under such regulations, then, upon such affirmative election, the Series A Preferred Stock shall be convertible at the Company’s option into a new series of preferred stock having terms and provisions substantially identical to those of the Series A Preferred Stock, except that such new series may have such additional or modified rights, preferences, privileges and voting powers, and such limitations and restrictions thereof, as are necessary, in the Company’s judgment (after consultation with counsel of recognized standing), to comply with the Required Unrestricted Capital Provisions (defined below), provided that the Company will not cause any such conversion unless the Company determines that the rights, preferences, privileges and voting powers of such new series of preferred stock, taken as a whole, are not materially less favorable to the holders thereof than the rights, preferences, privileges and voting powers of the Series A Preferred Stock, taken as a whole. For example, the Company could agree to restrict its ability to pay dividends on or redeem the new series of preferred stock for a specified period or indefinitely, to the extent permitted by the terms and provisions of the new series of preferred stock, since such a restriction would be permitted in the Company’s discretion under the terms and provisions of the Series A Preferred Stock. The Company will provide notice to holders of the Series A Preferred Stock of any election to qualify the Series A Preferred Stock for Allowable Capital or Tier 1 Capital Equivalent treatment and of any determination to convert the Series A Preferred Stock into a new series of preferred stock, promptly upon the effectiveness of any such election or determination. A copy of any such notice and of the relevant regulations will be on file at the Company’s principal offices and, upon request, will be made available to any stockholder. As used above, the term “Required Unrestricted Capital Provisions” means the terms that are, in the Company’s judgment (after consultation with counsel of recognized standing), required for preferred stock to be treated as Allowable Capital or Tier 1 Capital Equivalent, as applicable, without any sublimit or other quantitative restriction on the inclusion of such preferred stock in Allowable Capital or Tier 1 Capital Equivalent (other than any limitation the Company elects to accept and any limitation requiring that common equity or a specified form of common equity constitute the dominant form of Allowable Capital or Tier 1 Capital Equivalent) pursuant to applicable regulations. Voting Rights Except as provided below, the holders of the Series A Preferred Stock have no voting rights. Whenever dividends on any shares of the Series A Preferred Stock shall have not been declared and paid for the equivalent of six or more dividend payments, whether or not for consecutive dividend periods (as used in this section, a “Nonpayment”), the holders of such shares, voting together as a class with holders of any and all other series of voting preferred stock (as defined below) then outstanding, will be entitled to vote for the election of a total of two additional members of the Company’s board of directors (as used in this section, the “Preferred Stock Directors”), provided that the election of any such directors shall not cause the Company to violate the corporate governance requirement of the New York Stock Exchange (or any other exchange on which the Company’s securities may be listed) that listed companies must have a majority of independent directors.</code> |
791
- | <code>What is the age limit for general partners to serve in their capacity according to the Partnership Agreement?</code> | <code>PART III 74 ITEM 10. DIRECTORS, EXECUTIVE OF FICERS AND CORPORATE GOVERNANCE JFC does not have a board of directors. As of February 23, 2024, the Partners hip was composed of 33,857 individual partners, many of whom hold more than one type of partnership interest. Of those individuals, as of February 23, 2024, 610 were general partners, and 33,687 were limited par tners and 720 were subordinated limited partners. Managing Partner. Under the terms of the Partner ship Agreement, the Managing Partne r has primary responsibility for administering the Partnership’s business, determining its policies, and controlling the management and conduct of the Partnership’s business. Under the terms of the Partnership Agreement, the Managi ng Partner's powers include, without limitation, the power to admit and dismiss gene ral partners and the power to adjust t he proportion of their respective interest s in the Partnership. The Managing Partner serves for an indefin ite term and may be removed by a majority vote of the ELT (as discussed below) or a vote of the general partners holding a majority percentage ownership in the Partnership. If at any time the office of the Managing Partner is vacant, the ELT will succeed to all the powers and duties of the Managing Partner until a new Managing Partner is elected by a majority of the EL T. The Partnership’s operating subsidiaries are managed by JFC, under the leadership of the Managing Part ner, pursuant to services agreements. Enterprise Leadership Team. The ELT consists of the Managing Partner and up to 15 additional general partners appointed by the Managing Partner, with the specific number determined by the Managing Partner. Under the terms of the Partnership Agreement, the members of the ELT are the executive officers of the Partnership. The purpose of the ELT is to provide counsel and advice to the Managing Partner in discharging thei r functions, including the consideration of ownership of Partnership capital, ensuring the Part nership’s business risks are managed approp riately and helping to establish the strategic direction of the Partne rship. In addition, the ELT takes an active ro le in identifying, measuring and controlling the risks to which the Partnership is subject. ELT members serve for an indefinite term and may be removed by the Managing Partner or a vote of general partners holding a majority percent age ownership in the Partnership. Furthermore, in the event the position of Managing Partner is vacant, the ELT shall su cceed to all of the powers and duties of the Managing Partner until a new Managing Partner is elected by a majority of the ELT. The Partnership does not have a formal code of ethics that applies to its ELT members, as it relies on the core values and beliefs of the Partnership, as well as the Partnership Agreement. Throughout all of 2023, the ELT included Penny Pennington, Chairman, Andrew Miedler, Kenneth Cella, Jr., David Chubak, Lisa Dolan, David Gunn, Lena Haas, Tina Hrevus, Kristin Johnson, Francis LaQuinta, Hasan Malik, Suzan McDaniel, Timothy Rea and Wayne Roberts. Chris Lewis also serv ed as a member of the ELT prior to his retirement effective March 1, 2023. Effective January 8, 2024, the Managing Part ner appointed Keir Gumbs, General Counsel, to the ELT. The following table is a listing as of February 23, 2024 of t he members of the ELT, the year in which each member became a general partner and each member’s area of responsibility. Under the terms of the Partnership Agreement, all general partners, including the members of the ELT, are required to reti re in their capacity as general partners by the end of the calendar year during which they turn the age of 65. The members’ biographies are below. Enterprise General Name Age Leadership Team Partner Area of Responsibilit y Penn y Pennin gton 60 2014 2006 Mana ging Partner Andrew Miedler 46 2021 2011 Chief Financial Officer Kenneth Cella, Jr. 54 2014 2002 Head of External Affairs and Communit y Engagemen t David Chubak 43 2022 2022 Head of U.S.</code> |
792
- | <code>How does the face value of the bond issued by Biogen, Inc. compare to that of the bond issued by KLA Corp.?</code> | <code>$20,000 1.65% 11/5/2030 16,844 0.00% Wyeth LLC $15,000 6.50% 1/2/2034 16,841 0.00% VeriSign, Inc. $20,000 2.70% 15/6/2031 16,655 0.00% Walgreens Boots Alliance, Inc. $20,000 4.80% 18/11/2044 16,573 0.00% Amgen, Inc. $20,000 4.20% 22/2/2052 16,571 0.00% UnitedHealth Group, Inc. $20,000 3.75% 15/10/2047 16,447 0.00% Motorola Solutions, Inc. $20,000 2.75% 24/5/2031 16,400 0.00% DH Europe Finance II Sarl $20,000 3.25% 15/11/2039 16,294 0.00% Corning, Inc. $20,000 4.38% 15/11/2057 16,262 0.00% Citigroup, Inc. CAD22,000 4.09% 9/6/2025 16,050 0.00% Humana, Inc. $20,000 2.15% 3/2/2032 15,781 0.00% Aptiv plc $25,000 3.10% 1/12/2051 15,700 0.00% JPMorgan Chase & Co. $15,000 5.50% 15/10/2040 15,465 0.00% Citigroup, Inc. $19,000 2.52% 3/11/2032 15,313 0.00% Bank of New York Mellon Corp. $15,000 5.80% 25/10/2028 15,280 0.00% QUALCOMM, Inc. $20,000 3.25% 20/5/2050 15,032 0.00% Biogen, Inc. $15,000 5.20% 15/9/2045 14,997 0.00% KLA Corp. $15,000 4.65% 15/7/2032 14,947 0.00% Stanley Black & Decker, Inc. $25,000 2.75% 15/11/2050 14,896 0.00% Simon Property Group LP $20,000 3.80% 15/7/2050 14,796 0.00% Wachovia Corp. $15,000 5.50% 1/8/2035 14,693 0.00% Alexandria Real Estate Equities, Inc. $20,000 1.88% 1/2/2033 14,636 0.00% Crown Castle, Inc. $15,000 3.20% 1/9/2024 14,536 0.00% Motorola Solutions, Inc. $15,000 4.60% 23/5/2029 14,515 0.00% Morgan Stanley $15,000 3.88% 27/1/2026 14,476 0.00% Truist Financial Corp. $15,000 2.50% 1/8/2024 14,466 0.00% Royalty Pharma plc $20,000 3.30% 2/9/2040 14,239 0.00% Boston Properties LP $15,000 3.20% 15/1/2025 14,227 0.00% Texas Instruments, Inc. $15,000 1.38% 12/3/2025 14,092 0.00% American Express Credit Corp. $15,000 3.30% 3/5/2027 14,079 0.00% McCormick & Co., Inc.</code> |
793
- * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
794
- ```json
795
- {
796
- "scale": 20.0,
797
- "similarity_fct": "cos_sim"
798
- }
799
- ```
800
 
801
  ### Training Hyperparameters
802
- #### Non-Default Hyperparameters
803
-
804
- - `eval_strategy`: steps
805
- - `per_device_train_batch_size`: 200
806
- - `per_device_eval_batch_size`: 200
807
- - `num_train_epochs`: 10
808
- - `batch_sampler`: no_duplicates
809
- - `multi_dataset_batch_sampler`: round_robin
810
-
811
- #### All Hyperparameters
812
- <details><summary>Click to expand</summary>
813
-
814
- - `overwrite_output_dir`: False
815
- - `do_predict`: False
816
- - `eval_strategy`: steps
817
- - `prediction_loss_only`: True
818
- - `per_device_train_batch_size`: 200
819
- - `per_device_eval_batch_size`: 200
820
- - `per_gpu_train_batch_size`: None
821
- - `per_gpu_eval_batch_size`: None
822
- - `gradient_accumulation_steps`: 1
823
- - `eval_accumulation_steps`: None
824
- - `learning_rate`: 5e-05
825
- - `weight_decay`: 0.0
826
- - `adam_beta1`: 0.9
827
- - `adam_beta2`: 0.999
828
- - `adam_epsilon`: 1e-08
829
- - `max_grad_norm`: 1
830
- - `num_train_epochs`: 10
831
- - `max_steps`: -1
832
- - `lr_scheduler_type`: linear
833
- - `lr_scheduler_kwargs`: {}
834
- - `warmup_ratio`: 0.0
835
- - `warmup_steps`: 0
836
- - `log_level`: passive
837
- - `log_level_replica`: warning
838
- - `log_on_each_node`: True
839
- - `logging_nan_inf_filter`: True
840
- - `save_safetensors`: True
841
- - `save_on_each_node`: False
842
- - `save_only_model`: False
843
- - `restore_callback_states_from_checkpoint`: False
844
- - `no_cuda`: False
845
- - `use_cpu`: False
846
- - `use_mps_device`: False
847
- - `seed`: 42
848
- - `data_seed`: None
849
- - `jit_mode_eval`: False
850
- - `use_ipex`: False
851
- - `bf16`: False
852
- - `fp16`: False
853
- - `fp16_opt_level`: O1
854
- - `half_precision_backend`: auto
855
- - `bf16_full_eval`: False
856
- - `fp16_full_eval`: False
857
- - `tf32`: None
858
- - `local_rank`: 0
859
- - `ddp_backend`: None
860
- - `tpu_num_cores`: None
861
- - `tpu_metrics_debug`: False
862
- - `debug`: []
863
- - `dataloader_drop_last`: False
864
- - `dataloader_num_workers`: 0
865
- - `dataloader_prefetch_factor`: None
866
- - `past_index`: -1
867
- - `disable_tqdm`: False
868
- - `remove_unused_columns`: True
869
- - `label_names`: None
870
- - `load_best_model_at_end`: False
871
- - `ignore_data_skip`: False
872
- - `fsdp`: []
873
- - `fsdp_min_num_params`: 0
874
- - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
875
- - `fsdp_transformer_layer_cls_to_wrap`: None
876
- - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
877
- - `deepspeed`: None
878
- - `label_smoothing_factor`: 0.0
879
- - `optim`: adamw_torch
880
- - `optim_args`: None
881
- - `adafactor`: False
882
- - `group_by_length`: False
883
- - `length_column_name`: length
884
- - `ddp_find_unused_parameters`: None
885
- - `ddp_bucket_cap_mb`: None
886
- - `ddp_broadcast_buffers`: False
887
- - `dataloader_pin_memory`: True
888
- - `dataloader_persistent_workers`: False
889
- - `skip_memory_metrics`: True
890
- - `use_legacy_prediction_loop`: False
891
- - `push_to_hub`: False
892
- - `resume_from_checkpoint`: None
893
- - `hub_model_id`: None
894
- - `hub_strategy`: every_save
895
- - `hub_private_repo`: False
896
- - `hub_always_push`: False
897
- - `gradient_checkpointing`: False
898
- - `gradient_checkpointing_kwargs`: None
899
- - `include_inputs_for_metrics`: False
900
- - `eval_do_concat_batches`: True
901
- - `fp16_backend`: auto
902
- - `push_to_hub_model_id`: None
903
- - `push_to_hub_organization`: None
904
- - `mp_parameters`:
905
- - `auto_find_batch_size`: False
906
- - `full_determinism`: False
907
- - `torchdynamo`: None
908
- - `ray_scope`: last
909
- - `ddp_timeout`: 1800
910
- - `torch_compile`: False
911
- - `torch_compile_backend`: None
912
- - `torch_compile_mode`: None
913
- - `dispatch_batches`: None
914
- - `split_batches`: None
915
- - `include_tokens_per_second`: False
916
- - `include_num_input_tokens_seen`: False
917
- - `neftune_noise_alpha`: None
918
- - `optim_target_modules`: None
919
- - `batch_eval_metrics`: False
920
- - `eval_on_start`: False
921
- - `batch_sampler`: no_duplicates
922
- - `multi_dataset_batch_sampler`: round_robin
923
-
924
- </details>
925
-
926
- ### Training Logs
927
- | Epoch | Step | Training Loss | cosine_map@100 |
928
- |:------:|:----:|:-------------:|:--------------:|
929
- | 0.2033 | 100 | - | 0.5090 |
930
- | 0.4065 | 200 | - | 0.5376 |
931
- | 0.6098 | 300 | - | 0.5487 |
932
- | 0.8130 | 400 | - | 0.5595 |
933
- | 1.0 | 492 | - | 0.5716 |
934
- | 1.0163 | 500 | 2.1266 | 0.5707 |
935
- | 1.2195 | 600 | - | 0.5745 |
936
- | 1.4228 | 700 | - | 0.5784 |
937
- | 1.6260 | 800 | - | 0.5789 |
938
- | 1.8293 | 900 | - | 0.5807 |
939
- | 2.0 | 984 | - | 0.5835 |
940
 
 
 
 
 
 
 
 
 
 
 
941
 
942
  ### Framework Versions
 
943
  - Python: 3.10.13
944
  - Sentence Transformers: 3.0.1
945
  - Transformers: 4.42.3
@@ -948,49 +74,27 @@ You can finetune this model on your own dataset.
948
  - Datasets: 2.20.0
949
  - Tokenizers: 0.19.1
950
 
951
- ## Citation
952
 
953
- ### BibTeX
954
 
955
- #### Sentence Transformers
956
- ```bibtex
957
- @inproceedings{reimers-2019-sentence-bert,
958
- title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
959
- author = "Reimers, Nils and Gurevych, Iryna",
960
- booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
961
- month = "11",
962
- year = "2019",
963
- publisher = "Association for Computational Linguistics",
964
- url = "https://arxiv.org/abs/1908.10084",
965
- }
966
- ```
967
 
968
- #### MultipleNegativesRankingLoss
969
- ```bibtex
970
- @misc{henderson2017efficient,
971
- title={Efficient Natural Language Response Suggestion for Smart Reply},
972
- author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
973
- year={2017},
974
- eprint={1705.00652},
975
- archivePrefix={arXiv},
976
- primaryClass={cs.CL}
977
- }
978
- ```
979
-
980
- <!--
981
- ## Glossary
982
 
983
- *Clearly define terms in order to be accessible across audiences.*
984
- -->
985
 
986
- <!--
987
- ## Model Card Authors
988
 
989
- *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
990
- -->
 
 
 
 
 
 
991
 
992
- <!--
993
- ## Model Card Contact
994
 
995
- *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
996
- -->
 
1
+ # Model Card: sujet-ai/Marsilia-Embeddings-EN-Base 🚀
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
+ ## Introduction 🌟
4
 
5
+ **Marsilia-Embeddings-EN-Base** is an English language embedding model specifically designed for financial domain tasks. This model serves as a proof of concept, demonstrating the critical importance of fine-tuning embedding models for specific tasks in Retrieval-Augmented Generation (RAG) applications.
6
 
7
+ By focusing on the financial domain, Marsilia-Embeddings-EN-Base achieves performance that surpasses even closed-source models like OpenAI's embeddings, while offering a more cost-effective solution. This showcases how targeted fine-tuning can dramatically enhance the capabilities of open-source models, making them competitive with or even superior to proprietary alternatives in specialized domains.
8
+
9
+ ## Model Details 📊
10
 
 
11
  - **Model Type:** Sentence Transformer
12
+ - **Language:** English 🇬🇧
13
+ - **Base Model:** [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5)
14
  - **Maximum Sequence Length:** 512 tokens
15
+ - **Output Dimensionality:** 768
16
  - **Similarity Function:** Cosine Similarity
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
 
18
+ ## Usage 💻
19
 
20
+ To use this model with the Sentence Transformers library:
 
 
21
 
 
22
  ```python
23
  from sentence_transformers import SentenceTransformer
24
 
25
  # Download from the 🤗 Hub
26
+ model = SentenceTransformer("sujet-ai/Marsilia-Embeddings-EN-Base")
27
+
28
  # Run inference
29
  sentences = [
30
+ 'What are the key factors affecting the performance of corporate bonds in the current market?',
31
+ 'The corporate bond market has been influenced by several factors in recent months. Interest rates set by central banks have a significant impact, as rising rates tend to decrease bond prices and increase yields. Economic indicators such as GDP growth, inflation rates, and employment figures also play a role in shaping investor sentiment and corporate financial health. Industry-specific trends and individual company performance are crucial, with factors like earnings reports, credit ratings, and debt levels affecting bond valuations. Global events, including geopolitical tensions and trade policies, can create market volatility. Liquidity in the bond market and overall investor risk appetite are additional considerations. It's important for investors to monitor these various factors when assessing corporate bond performance.',
32
+ 'CORPORATE BOND HOLDINGS (Continued) Principal Amount (000) Coupon Rate Maturity Date Market Value ($000) Vanguard Short-Term Corporate Bond ETF Bank of America Corp. 2,285 5.015% 1/22/24 2,285 JPMorgan Chase & Co. 2,250 3.875% 2/1/24 2,249 Goldman Sachs Group Inc. 2,200 3.750% 2/25/24 2,197 Morgan Stanley 2,190 3.875% 1/27/24 2,189 Citigroup Inc. 2,145 3.875% 3/26/24 2,141 Wells Fargo & Co. 2,100 3.750% 1/24/24 2,099 Bank of America Corp. 2,050 4.000% 4/1/24 2,047 Truist Bank 2,000 3.800% 10/30/23 2,000 PNC Bank NA 1,950 3.800% 7/25/23 1,950 U.S. Bancorp 1,900 3.375% 2/5/24 1,896 Bank of America Corp. 1,850 4.125% 1/22/24 1,850 Morgan Stanley 1,800 3.737% 4/24/24 1,795 Citigroup Inc. 1,750 3.668% 7/24/24 1,740 Goldman Sachs Group Inc. 1,700 3.625% 1/22/23 1,700 Wells Fargo & Co. 1,650 3.550% 8/14/23 1,650 JPMorgan Chase & Co. 1,600 3.875% 9/10/24 1,593'
33
  ]
34
  embeddings = model.encode(sentences)
35
  print(embeddings.shape)
 
41
  # [3, 3]
42
  ```
43
 
44
+ ## Intended Use 🎯
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
 
46
+ This model is designed for generating sentence embeddings for English text, particularly in the financial domain. It can be used for various natural language processing tasks such as semantic search, clustering, and information retrieval.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
 
48
+ ## Training Data 📚
 
49
 
50
+ The model was fine-tuned on the [sujet-ai/Sujet-Financial-RAG-EN-Dataset](https://huggingface.co/datasets/sujet-ai/Sujet-Financial-RAG-EN-Dataset). This dataset consists of question-context pairs in English, focusing on financial topics.
 
51
 
52
+ ## Training Procedure 🛠️
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
53
 
54
  ### Training Hyperparameters
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
 
56
+ - **Loss Function:** MultipleNegativesRankingLoss
57
+ - Scale: 20.0
58
+ - Similarity Function: Cosine Similarity
59
+ - **Evaluation Strategy:** Steps
60
+ - **Per Device Train Batch Size:** 200
61
+ - **Per Device Eval Batch Size:** 200
62
+ - **Number of Train Epochs:** 10
63
+ - **Batch Sampler:** no_duplicates
64
+ - **Multi Dataset Batch Sampler:** round_robin
65
+ - **Scheduler:** Warmup cosine
66
 
67
  ### Framework Versions
68
+
69
  - Python: 3.10.13
70
  - Sentence Transformers: 3.0.1
71
  - Transformers: 4.42.3
 
74
  - Datasets: 2.20.0
75
  - Tokenizers: 0.19.1
76
 
77
+ ## Evaluation 📈
78
 
79
+ The model was evaluated using the `InformationRetrievalEvaluator` on the test split of the [sujet-ai/Sujet-Financial-RAG-EN-Dataset](https://huggingface.co/datasets/sujet-ai/Sujet-Financial-RAG-EN-Dataset).
80
 
81
+ ## Limitations ⚠️
 
 
 
 
 
 
 
 
 
 
 
82
 
83
+ The model is specifically trained on English financial texts and may not perform optimally on other domains or languages. Users should be aware of potential biases present in the training data.
 
 
 
 
 
 
 
 
 
 
 
 
 
84
 
85
+ ## Citation 📄
 
86
 
87
+ If you use this model in your research or applications, please cite:
 
88
 
89
+ ```bibtex
90
+ @software{Marsilia-Embeddings-EN-Base,
91
+ author = {Sujet AI, Allaa Boutaleb, Hamed Rahimi},
92
+ title = {Marsilia-Embeddings-EN-Base: A fine-tuned English embedding model for financial texts},
93
+ year = {2024},
94
+ url = {https://huggingface.co/sujet-ai/Marsilia-Embeddings-EN-Base}
95
+ }
96
+ ```
97
 
98
+ ## Contact Information 📧
 
99
 
100
+ For questions, feedback, or collaborations, please reach out to us on [LinkedIn](https://www.linkedin.com/company/sujet-ai/) or visit our website [https://sujet.ai](https://sujet.ai).