Edit model card

SentenceTransformer based on BAAI/bge-base-en-v1.5

This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-base-en-v1.5
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 768 tokens
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("aritrasen/bge-base-en-v1.5-finetuned_ragds_v1")
# Run inference
sentences = [
    'Early this year, there was a buzz on Motorola inviting T-Mobile Moto X owners to take part in a soak test for possible future update. Motorola seemed skeptical in disclosing facts at that point of time but since Moto G was recently upgraded to Android 4.4.2; enthusiasts anticipated the same for T-Mobile Moto X. And it turned out to be true.\nNews Update\nThis T-Mobile version of Moto X is now receiving the upgrade which is a file size of 147.6 MB. The Android 4.4.2 is the latest version of KitKat that includes all the goodies from the earlier installments, plus a few additions. The good news is, Motorola has customized the whole package and made a few tweaks into the update. The Software Version bumped to 161.44.25 and the notable changes are listed as below:\n- It added substantial support for services like printing photos, Google Docs, Gmail messages and other such content via Wi-Fi, Bluetooth and hosted services such as HP ePrinters and Google Cloud Print.\n- It fixed all the bugs identified during the preliminary runs, including the ones that caused a few users to experience short battery life after upgrading to KitKat.\n- Another bug that caused delays in synchronizing email services like Microsoft Exchange was resolved, thus adding to the convenience of the user.\nThis is a noteworthy upgrade, considering the fact that bugs and errors were fixed. Mobile addicts across the world will rejoice, for they can experience the smartness of Android KitKat flawlessly in their devices. This is significant development in terms of update.\nThis variant is an unlocked GSM device so chances are, you can use it on networks of other service providers. In all probability, the update should not be affected and the installation should hardly take much time. The T-Mobile Moto X Android update is now available for manual download. It is accessible in the following sequential way:\n- Click on Settings\n- Click on About Phone\n- Click on System Updates\n- Click on Download\nRecommendations\nFor ensuring a successful installation, it is highly recommended to install this update with at least 50% battery and a strong connectivity; preferably Wi-Fi. Follow the notification message and select download-> once the download is over, select Install-> Once the installation is over, and the phone will automatically restart. This marks the completion of the installation process. The phone is now updated to 161.44.25 – This build is same as the soak test.\nThis upgrade is free in the carrier network and Motorola and Google has collaborated for a back up service for those in trouble. In case of distress, a user can contact them through the Moto X web interface and avail the service. There is still no news on other carrier variants of this update but we can safely hope that it will roll out very soon. Though the upgrade doesn’t appeal in terms of version number but it is definitely significant for users to live with the latest KitKat.',
    'What are some of the notable changes in the T-Mobile Moto X update?',
    'Who is the editor of "The Routledge Handbook of Tourism Geographies"?',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Training Details

Training Dataset

Unnamed Dataset

  • Size: 9,598 training samples
  • Columns: positive and anchor
  • Approximate statistics based on the first 1000 samples:
    positive anchor
    type string string
    details
    • min: 172 tokens
    • mean: 467.87 tokens
    • max: 512 tokens
    • min: 7 tokens
    • mean: 18.68 tokens
    • max: 43 tokens
  • Samples:
    positive anchor
    Caption: Tasmanian berry grower Nic Hansen showing Macau chef Antimo Merone around his property as part of export engagement activities.
    THE RISE and rise of the Australian strawberry, raspberry and blackberry industries has seen the sectors redouble their international trade focus, with the release of a dedicated export plan to grow their global presence over the next 10 years.
    Driven by significant grower input, the Berry Export Summary 2028 maps the sectors’ current position, where they want to be, high-opportunity markets and next steps.
    Hort Innovation trade manager Jenny Van de Meeberg said the value and volume of raspberry and blackberry exports rose by 100 per cent between 2016 and 2017. She said the Australian strawberry industry experienced similar success with an almost 30 per cent rise in export volume and a 26 per cent rise in value to $32.6M over the same period.
    “Australian berry sectors are in a firm position at the moment,” she said. “Production, adoption of protected substrate cropping, improved genetics and an expanding geographic footprint have all helped put Aussie berries on a positive trajectory.
    “We are seeing a real transition point. Broad industry interest and a strong commercial appetite for export market development combined with the potential to capitalise on existing trade agreements and build new trade partnerships has created this perfect environment for growth.”
    High-income countries across Europe, North America and Northern Asia have been identified as having a palate for Australian grown berries with more than 4244 tonnes of fresh berries exported in the last financial year alone.
    The strategy identified the best short-term prospect markets for the Australian blackberry and raspberry industry as Hong Kong, Singapore, The United Arab Emirates and Canada. The strongest short-term trade options identified for the strawberry sector were Thailand, Malaysia, New Zealand and Macau.
    The strategy focuses heavily on growing the existing strawberry export market from 4 per cent to at least 8 per cent of national production by volume, in markets with a capacity and willingness to pay a premium for quality fruit. For raspberries and blackberries, the sectors aim to achieve a 5 per cent boost in exports assessed by volume across identified markets by 2021.
    Tasmanian raspberry exporter Nic Hansen said Australia offers some of the sweetest and most attractive berries in the world, and this combined with our stringent food safety standards across all stages of the supply chain puts growers in a solid position.
    “We have a great product, we are hungry to expand trade and now with this new plan in place, we have a clear roadmap towards driving growth,” Mr Hansen said.
    He said it is exciting to see new export market prospects for raspberries: “The more options we have for export the better. Now we just have to get on with the job of ensuring industry has all the tools it needs, such as supporting data and relationship building opportunities, to thrive in new markets.”
    This project was commissioned by Hort Innovation, and developed by market analysts and research consultants Auspex Strategic Advisory and AgInfinity. Hort Innovation will work now with berry sectors to determine levy-funded activities to support trade.
    See a summary of the strategy on the Hort Innovation website.
    For more information on the berry industries, refer to the Horticulture Statistics Handbook and the Strategic Investment Plans for strawberries, raspberries and blackberries. Growers seeking more information should email trade@horticulture.com.au
    What is the Berry Export Summary 2028 and what is its purpose?
    RWSN Collaborations
    Southern Africa Self-supply Study Review of Self-supply and its support services in African countries
    A lady in Zimbabwe proudly shows off her onions - watered from her self-supply well
    © 2015 André Olschewski • Skat
    Project starts: 2015
    Project finished: 2016
    Collaborators & Partners:.
    Project Description
    UNICEF and Skat have collaborated on a).
    Perspectives
    Reach and benefits:
    - Self-supply is practised by millions of rural households in Sub-Sahara Africa as well as in Europe, USA and other areas of the world.
    - Benefits reported from having access to Self-supply water sources include convenience, less time spent for fetching water and access to more and better quality water. In some areas, Self-supply sources offer important added values such as water for productive use, income generation, family safety and improved food security.
    - Sustainability of services from Self-supply is high as there is strong ownership by people investing in own sources.
    - As Self-supply sources are shared sources, many people, including poor and vulnerable households, benefit from investments in Self-supply, often at no costs. This means that Self-supply can be effective in reaching the hard-to-reach.
    - For millions of people in rural areas of Africa, supported Self-supply will be the most cost effective service delivery model to provide access to safe water. This also includes those parts of the population which actually have poor access as they e.g. cannot afford water from communal supplies.
    - However, in areas where external support for Self-supply is lacking, only marginal improvements can usually be achieved, and the quality of services is lower than in areas where a dedicated support effort was made.
    Costs and business model for supported Self-supply
    - In many rural contexts, supported Self-supply is the most cost effective approach for water service delivery. However, as it is not applicable in all contexts, a blended approach combining communal water supply and supported Self-supply models should be followed.
    - Based on a Life Cycle Cost (LCC) analysis of different service delivery approaches, the LCC for communal supplies are about 40 US$/capita served in the study countries, whereas the LCC for supported Self-supply is about 10 U$/capita.
    - In sparsely populated areas, communal supplies (e.g. handpumps) are even more costly (up to 100 U$/capita served) as only few people can be served with one additional unit. Serving all rural people with communal supply is therefore not financially viable.
    - Considering the applicability of Self-supply technologies, in Zambia and Zimbabwe, the cost saving of following a blended approach using both communal supplies and supported Self-supply is almost 50% of the total LCC for reaching 100% of the population by 2030. These cost savings are equivalent to more than 330 million US$ in Zambia and more than 260 million US$ in Zimbabwe.
    Support services needed
    - Supported Self-supply is a service delivery model putting support services in place to improve Self-supply, so it is not about a particular technology.
    - Supported Self-supply is aligned with the Human Rights to Water and Sanitation, which allows a progressive realisation of the universal access to safe water. However, supported Self-supply is not a way to exempt government from its duties: Government has specific roles to play to ensure that everybody will have access to safe water finally.
    - To sustain and to take Self-supply to scale there is need for contextualised support as well as long-term engagement, capacity development at all levels, M&E and technical support, reliable funding and learning and sharing.
    - Interministerial cooperation and champions within government agencies are needed to ensure sustainable embedding and for taking Self-supply further, particularly in remote rural areas.
    - There is no-one-size-fits-all solution for supported Self-supply – for each programme, it needs a contextualized design and follow-up to achieve desired impact.
    - Hygiene promotion, including Household Water Treatment and Safe Storage (HWTS), is highly recommended for any non-piped water supply services, including Self-supply water sources.
    - The huge potential for substantially improving the level of water supply for millions of people in rural areas should be accessed through supported Self-supply. Some countries have endorsed supported Self-supply as service delivery model, such as Zimbabwe or Sierra Leone, and in Ethiopia, Self-supply is now being scaled up at national level.
    More Information
    » Review of Self-supply and its support services in African countries: Synthesis Report).
    What are some of the benefits reported from having access to Self-supply water sources?
    All Android applications categories
    Description
    Coolands for Twitter is a revolutionary twitter client. It has many unique features, gives you the best mobile twitter experience you never imagined before.
    The first unique feature is Real-Time.
    You can’t find any refresh button in this app, because you absolutely don’t need to. Every time you open it, you’ll get the latest tweets and while you’re reading, you’ll get incoming tweets in Real-Time. So if your friend mentioned you, you can reply instantly.
    The second unique feature is Avatar Indicator.
    Avatar Indicator is small avatars showed on the title bar to indicate that you’ve got new message/tweet/mention. Since it’s real-time, you’ll keep getting incoming tweets while you’re reading your older timeline, Avatar-Indicator will let you know who’s tweet you’ve just got, and decide whether to check it out right away.
    The third unique feature is Direct Link
    I think it is obviously the most intuitive and convenient way to open a link. When you want to open a link, just click it in the time line . You can also click a username to open a profile window, click a hash tag to open a search result window. Different kind of links displayed in different colors, you can change it to whatever color you like.
    The fourth unique feature is Smart Bookmark
    Have you ever experienced this scenario? When you are reading your home timeline, the app notified you that you’ve got some new tweets, you click “go-to-top” button to read the newest tweets, and then you want to get back to the previous position to continue your reading. How can you do this? In other twitter clients you have to scroll down all the way to find where you were, a lot of time wasted. But in this app, “go-to-top” button will appear when you scrolling up, click it, you can got to top, read the newest tweets. After that when you scrolling down, Smart Bookmark button will appear at the corner. Click it, you can get back to exactly where you were.
    The fifth unique feature is User Level Notification
    Notification for all your new tweets is meaningless, if you following more than a few users, you’ll get new tweets all the time. What if you only want to be notified when someone you most care about posted a new tweet? In this app, it’s easy. You can change your friend’s notification setting directly in his/her profile screen. You can also set different notification ringtone for your friends respectively. So when you heard a notification ringtone, you’ll know who he/she is without the need to open your phone. And you can manage all the enabled User-Level-Notification settings in one place.
    Also has most of the basic twitter client features, like post/delete tweets, retweet, retweet with comment, reply, quote, send/delete direct message, subscribe/unsubscribe lists, follow/unfollow user, multiple accounts support, append picture with your tweets, mention auto complete, recent search auto complete, conversation view.
    Please give me feedbacks if you have tried it, and I PROMISE to reply all your emails.
    1.30-1.36 update:
    *Ad-Free.
    *3 times faster when launching and loading older tweets.
    *Support unlimited accounts rather than 3 accounts.
    *Support notifications for all accounts rather than only for the current account.
    *Improved mention suggestion feature.
    *Use URL link to do RT with comment, so you can comment more characters.
    *Conversation view, click the orange(you can change the color) username in replied tweet to show conversation view.
    *Support longer tweet, longer tweet will be converted to a picture automatically. You really should try it out your self.
    *Refined tweet composing view, to support longer tweet.
    *Support handle text shared by other app.
    from 103 reviews
    Download Coolands for Twitter
    Free - V1.38 - 298K
    Sorry ...
    This app is no longer available.
    Share this app
    Screenshots
    What are the unique features of the Coolands for Twitter app?
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Evaluation Dataset

Unnamed Dataset

  • Size: 500 evaluation samples
  • Columns: positive and anchor
  • Approximate statistics based on the first 1000 samples:
    positive anchor
    type string string
    details
    • min: 188 tokens
    • mean: 460.02 tokens
    • max: 512 tokens
    • min: 10 tokens
    • mean: 18.91 tokens
    • max: 39 tokens
  • Samples:
    positive anchor
    Perhaps Not such a Good Idea
    I have found trying to run a blog is very time-consuming, and there are other calls on my time. I think it has been demonstrated that if enough people are unable to self-moderate, the nuggets of interest are swamped by the rubbish. Sadly I agree with Mark Frank's assessment. I had hoped more thread topics would be proposed nothing has been suggested by anyone for a while.
    My personal view is that, considering DaveScot's generally perceived blog persona, I have to admit that he hasn't been (on this site) quite the unmitigated disaster predicted. John Davison, on the other hand has conformed perfectly to predictions, which is a shame, but his choice.
    I am happy to let things run for a while, but would like to hear from anyone who has a suggestion for a thread topic. Post here or in the suggestions thread
    23 comments:
    How about an "ID: show me the research" thread?
    OK Rich, put some meat on the bones and I'll paste it.
    Of course I have. I have no respect for you or your cronies from Psnda's Thumb. What did you expect kudos? What do you want another thread for? No one has even attempted to answer my four challenges yet. You know why? I do. It is because they can't, because everything you and your Darwimpian cronies stand for is a myth, an illusion and a hoax. That's why. You might as well close down this flame pit while you are behind as it isn't going anywhere.
    It is hard to believe isn't it?
    I love it so!
    " "How about an "ID: show me the research" thread? "
    Alan Fox said...
    OK Rich, put some meat on the bones and I'll paste it. "
    Why is there no ID research, even on Dembski's blog?
    Why does ID consist solely of an opinion that some parts of human biology are designed?
    How would IDers actually prove that some parts of human biology are in fact designed?
    Expand a bit on the theme, Wonderpants and I'll start a thread if you like.
    "My personal view is that, considering DaveScot's generally perceived blog persona, I have to admit that he hasn't been (on this site) quite the unmitigated disaster predicted. "
    At a guess, it's because he can't duck or delete awkward subjects. I note from skimming through the threads though that he's been rather selective as to which ones he posts in, namely the ones that don't pose awkward ID questions. ;-)
    Well, we can't torture a confession out of him. As Lenny points out frequently, an absence of an answer is in itself an answer. What about a thread from you, entitled " My awkard questions for DaveScot"?
    I'm not sure how much meat can be put on the bones of a non-existent project, but here's my thought:
    I would really like to hear about actual research projects that can be / are being done. Without knowing of any that are running currently, I'm not sure if it would be a good thread to start, but maybe you could ask for ideas.
    It could be a thread dedicated to lab experiments. If X is designed, we will find Y. Here's how we find Y in the lab. Then we watch for the landslide of X and Y that get suggested and, of course, the methods that actually find these things.
    Why would they start posting it now, though, after years of keeping it secret?
    JAD: I have no respect for you or your cronies from Psnda's Thumb.
    Which raises the obvious question of why you hang around in forums like this. Why not submit your work to a technical journal where real scientists will read it?
    Well, unless and until Wonderpants or Blipey want to expand on it, I have framed a thread along the line suggested.
    I think a good thread would be "Place A Vote For or Against the Banning of Professor Emeritus John Davison".
    I invade the ephemeral meaningless world of cyberdom for amusement on the outside chance that I might find a rational mind once in a while, one like johndarius for example. Mostly I encounter mentally impaired ideologues with IQs in the room temperature range or hostile, rabid, certifiably deranged schizophrenic sociopaths like Spravid Dinger. This particular blog seems to be blessed with both varieties.
    Naturally -
    I love it so!
    I'll try and think of something tomorrow.
    Been watching the footie tonight.
    Props to France for winning aghainst Spain, Alan.
    Ah, the World Cup. Something else Mrs Fox and I disagree on. Yes there would have been a few glum faces at work tomorrow. Now if only France can beat Brazil, and England beat Portugal.
    JAD, you didn't answer my second question: Why not submit your work to a technical journal where real scientists will read it?
    Why would real scientists want to read the nonsensical ramblings of a pseudoscientist?
    Give me a shout if you need some "help."
    Naturally -
    I love it so!
    But JAD won't publish in a scientific journal any longer. According to his second post, we can assume that he visits Nature's website "for amusement on the outside chance that [he] might find a rational mind once in a while, one like a [creationist] for example. Mostly [he] encounters mentally impaired [evolutionary biologists] "
    I can't think of a single great scientist who wouldn't describe himself as a creationist, not one. Can anyone?
    I love it so!
    Democritus, Sagan, Darwin, Edison, Feynman, Curie, just to name a handful.
    Of course, they never managed to publish in Rivista... [snicker]
    Feynman once described scientific discovery as a religious experience. I agree entirely as I have had the same experience. That anyone could describe Darwin as a scientist is unthinkable. I didn't know that about Curie and tend not to accept it without some documentation.
    I love it so!
    What a sad little weasel you are. Feynman was an avowed atheist. Curie was raised Catholic but became an atheist on the death of her mother. Darwin was 100 times the scientist you are.
    Creationism is all but dead among true scientists; critical inquiry is poison to that superstitious twaddle.
    What is the author's personal view on DaveScot's blog persona?
    Age reduction Academic atmosphere Beef tendon bottom Straight buckle low-heel cowhide Lefu shoes Mary Jane shoes Spring and summer Women's shoes 0.73
    ins Chaopai shoes Women's Shoes Academic atmosphere Versatile Graffiti Frenulum gym shoes Harajuku leisure time Hip hop jointly skate shoes
    Air force one Men's shoes Low Gang summer skate shoes student Korean version Versatile leisure time gym shoes female Reflection Little white shoes
    autumn Clover ozweego Daddy shoes Jackson Yi Same men and women Reflection motion Running shoes EE6999
    Retro Britain Square head Frenulum Color matching motion Casual shoes 2021 new pattern Versatile Flat bottom Elastic band Little white shoes female
    Thick bottom British style Small leather shoes Women's shoes 2021 new pattern Big square head Spring and Autumn Lefu Autumn shoes black Single shoes
    U.S.A quality goods Jeffrey Campbell temperament crude high-heeled dollskill Buckles Low top shoes female widow
    quality goods Clover ozweego Black Warrior Dad Running shoes Night Walker Retro Men's Shoes Reflection increase Women's Shoes tide
    Internet celebrity Daddy shoes female 2021 summer new pattern ventilation comfortable leisure time gym shoes Retro Thick bottom increase Single shoes tide
    Sao Fen Paris Daddy shoes Three generations combination increase Thick bottom ins tide Single shoes Women's Shoes leisure time motion track3.0
    Paris Home B Daddy shoes one three generation triple s Thick bottom increase men and women lovers leisure time motion Fashion shoes Dirty shoes
    U.S.A quality goods Jeffrey Campbell temperament crude high-heeled dollskill Buckles Low top shoes female widow
    D1G New products anniversary Graffiti high-heeled shoes Internet celebrity Show Sharp point Fine heel Women's shoes Europe Versatile Retro Women's Shoes
    2021 Autumn and winter new pattern Low Gang Single shoes female genuine leather Flat bottom Frenulum Color matching motion Casual shoes male skate shoes tide
    Little white shoes female Josiny Spring and summer 2021 new pattern Korean version Versatile Leisure fashion ventilation student Flat bottom gym shoes
    European goods Forrest Gump Daddy shoes female tide 2021 autumn new pattern Small pretty waist gym shoes Frenulum Slope heel Single shoes Women's Shoes tide
    【 goods in stock 】 devil sisters Sheep puff Lolita original Halloween Thick bottom Women's Shoes hottie high-heeled Women's Shoes
    Zhou Yangqing Same 2021 Spring and summer new pattern Thick bottom Shoe of sponge cake motion leisure time lovers P family Daddy shoes female ins tide
    20 new pattern Internet celebrity Sharp point Single shoes female high-heeled genuine leather Fine heel Shallow mouth sexy Bridesmaid Women's Shoes Wedding shoes 6cm 10cm
    What type of shoes are mentioned as being suitable for both men and women?
    I just started a new blog on my ultralight gear. My gear list in all it's glory is located on: each item of gear, I'm writing an in-depth review for the item and how we have used it. Would love to get feedback and the site and our gear and/or comments from people on how we can fine tune.Currently my wifes pack is 7.5 lbs base weight, and mine is 10.5 lbs.Thanks!-Brett
    Edited by brettmarl on 09/09/2006 15:59:48 MDT.
    Brett, Your BLOG looks good.You should put the size of your items where their is one such as pants, shoes, jacket etc. There is a golf like "handicap' for anyone that wears larger then size medium or size 9.5 shoe. Sure.I think you might recheck some of your math. Not sure but some totals look low. Don't trust the posted weightof gear, weigh it yourself if you haven't.Why is your pack list so heavy?
    I agree, nice looking blog. Bill is right on listing the sizes, other than that....looks great!
    Brett - nice list, and nice format!(One small typo: it currently says "Cloudburt" for the tent.)
    Edited by slnsf on 09/09/2006 18:08:48 MDT.
    Great site with good info. I'm trying to decide between the GoLite Infinity and Jam and I think after reading your blog, that the Jam should be plenty big.I'm interested to see what's in your first aid kit.Also, any issues with the water purification tablets? I currently use a MSR miniworks pump and I'm looking to lighten up...
    At first I thought you might not be warm enough as I reviewed your North Cascade hike, then I recounted your layers. Very nice site! What I was a little confused about was the opening statement of getting four days of backpacking gear into a pack, yet at the bottom the food for two was estimated for three days. However I now understand that these are not mutually exclusive statements.
    thanks for the feedback.i fixed the cloudburt typo (thanks), and the 4 vs. 3 days. ai also completed all my gear posts - including the innards of my first aid kit and my experiences with the MicroPUR tabs.the weights listed should be the ones that i weighed myself (unless, i've mis-typed in some areas)bill - you say to check my math with "Why is your pack list so heavy?". not sure what you are refering to here.great idea on including the sizes.
    You must login to post.
    MEMBERSHIP IS REQUIRED TO POST: You must be a Forum, Annual or Lifetime Member to post messages in the backpackinglight.com forums.
    SUBSCRIBE NOW »
    What are the base weights of the blogger's and his wife's packs?
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 10
  • per_device_eval_batch_size: 10
  • num_train_epochs: 30
  • warmup_ratio: 0.1
  • fp16: True
  • batch_sampler: no_duplicates

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 10
  • per_device_eval_batch_size: 10
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 30
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: True
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • batch_sampler: no_duplicates
  • multi_dataset_batch_sampler: proportional

Training Logs

Click to expand
Epoch Step Training Loss loss
0.0521 50 0.1346 -
0.1042 100 0.1325 0.0550
0.1562 150 0.0818 -
0.2083 200 0.0806 0.0297
0.2604 250 0.065 -
0.3125 300 0.0605 0.0190
0.3646 350 0.0618 -
0.4167 400 0.0609 0.0158
0.4688 450 0.0491 -
0.5208 500 0.0371 0.0137
0.5729 550 0.0435 -
0.625 600 0.0349 0.0144
0.6771 650 0.0459 -
0.7292 700 0.0507 0.0127
0.7812 750 0.028 -
0.8333 800 0.0197 0.0120
0.8854 850 0.0517 -
0.9375 900 0.0412 0.0145
0.9896 950 0.0326 -
1.0417 1000 0.033 0.0122
1.0938 1050 0.0204 -
1.1458 1100 0.0282 0.0105
1.1979 1150 0.0342 -
1.25 1200 0.0217 0.0131
1.3021 1250 0.017 -
1.3542 1300 0.0229 0.0119
1.4062 1350 0.0276 -
1.4583 1400 0.0164 0.0144
1.5104 1450 0.015 -
1.5625 1500 0.0093 0.0115
1.6146 1550 0.0116 -
1.6667 1600 0.0087 0.0127
1.7188 1650 0.0161 -
1.7708 1700 0.0041 0.0126
1.8229 1750 0.0059 -
1.875 1800 0.0155 0.0116
1.9271 1850 0.0048 -
1.9792 1900 0.0054 0.0194
2.0312 1950 0.005 -
2.0833 2000 0.0067 0.0141
2.1354 2050 0.0031 -
2.1875 2100 0.0064 0.0149
2.2396 2150 0.0021 -
2.2917 2200 0.0028 0.0212
2.3438 2250 0.0026 -
2.3958 2300 0.0027 0.0153
2.4479 2350 0.0023 -
2.5 2400 0.0018 0.0215
2.5521 2450 0.0018 -
2.6042 2500 0.0027 0.0214
2.6562 2550 0.0024 -
2.7083 2600 0.0019 0.0155
2.7604 2650 0.0015 -
2.8125 2700 0.0017 0.0123
2.8646 2750 0.0047 -
2.9167 2800 0.0038 0.0176
2.9688 2850 0.0021 -
3.0208 2900 0.0013 0.0150
3.0729 2950 0.001 -
3.125 3000 0.0015 0.0210
3.1771 3050 0.0008 -
3.2292 3100 0.009 0.0229
3.2812 3150 0.0012 -
3.3333 3200 0.0012 0.0215
3.3854 3250 0.0008 -
3.4375 3300 0.0011 0.0225
3.4896 3350 0.001 -
3.5417 3400 0.0006 0.0187
3.5938 3450 0.0006 -
3.6458 3500 0.0005 0.0199
3.6979 3550 0.0008 -
3.75 3600 0.0005 0.0215
3.8021 3650 0.0008 -
3.8542 3700 0.0008 0.0139
3.9062 3750 0.0011 -
3.9583 3800 0.0011 0.0287
4.0104 3850 0.0006 -
4.0625 3900 0.0006 0.0171
4.1146 3950 0.001 -
4.1667 4000 0.001 0.0169
4.2188 4050 0.0006 -
4.2708 4100 0.0006 0.0221
4.3229 4150 0.0006 -
4.375 4200 0.0005 0.0232
4.4271 4250 0.0004 -
4.4792 4300 0.0003 0.0251
4.5312 4350 0.0002 -
4.5833 4400 0.0003 0.0190
4.6354 4450 0.0005 -
4.6875 4500 0.0003 0.0232
4.7396 4550 0.0004 -
4.7917 4600 0.0003 0.0170
4.8438 4650 0.0003 -
4.8958 4700 0.0003 0.0194
4.9479 4750 0.0004 -
5.0 4800 0.0005 0.0305
5.0521 4850 0.0003 -
5.1042 4900 0.0002 0.0213
5.1562 4950 0.0002 -
5.2083 5000 0.0002 0.0201
5.2604 5050 0.0023 -
5.3125 5100 0.0002 0.0230
5.3646 5150 0.0001 -
5.4167 5200 0.0001 0.0237
5.4688 5250 0.0001 -
5.5208 5300 0.0002 0.0244
5.5729 5350 0.0002 -
5.625 5400 0.0001 0.0239
5.6771 5450 0.0001 -
5.7292 5500 0.0001 0.0237
5.7812 5550 0.0001 -
5.8333 5600 0.0001 0.0213
5.8854 5650 0.0001 -
5.9375 5700 0.0001 0.0225
5.9896 5750 0.0001 -
6.0417 5800 0.0001 0.0230
6.0938 5850 0.0001 -
6.1458 5900 0.0001 0.0231
6.1979 5950 0.0001 -
6.25 6000 0.0005 0.0443
6.3021 6050 0.0004 -
6.3542 6100 0.0012 0.0548
6.4062 6150 0.0013 -
6.4583 6200 0.0007 0.0509
6.5104 6250 0.0006 -
6.5625 6300 0.0007 0.0474
6.6146 6350 0.001 -
6.6667 6400 0.0007 0.0326
6.7188 6450 0.0008 -
6.7708 6500 0.0004 0.0275
6.8229 6550 0.0003 -
6.875 6600 0.0009 0.0291
6.9271 6650 0.0011 -
6.9792 6700 0.0024 0.0451
7.0312 6750 0.0008 -
7.0833 6800 0.0005 0.0353
7.1354 6850 0.0005 -
7.1875 6900 0.0012 0.0343
7.2396 6950 0.001 -
7.2917 7000 0.0017 0.0346
7.3438 7050 0.0015 -
7.3958 7100 0.0009 0.0350
7.4479 7150 0.001 -
7.5 7200 0.0009 0.0404
7.5521 7250 0.0015 -
7.6042 7300 0.0017 0.0433
7.6562 7350 0.0026 -
7.7083 7400 0.0004 0.0638
7.7604 7450 0.0009 -
7.8125 7500 0.0004 0.0388
7.8646 7550 0.0005 -
7.9167 7600 0.0006 0.0260
7.9688 7650 0.0004 -
8.0208 7700 0.0006 0.0375
8.0729 7750 0.0004 -
8.125 7800 0.0004 0.0420
8.1771 7850 0.0002 -
8.2292 7900 0.0002 0.0441
8.2812 7950 0.0001 -
8.3333 8000 0.0001 0.0416
8.3854 8050 0.0001 -
8.4375 8100 0.0001 0.0421
8.4896 8150 0.0004 -
8.5417 8200 0.0001 0.0575
8.5938 8250 0.0002 -
8.6458 8300 0.0001 0.0507
8.6979 8350 0.0001 -
8.75 8400 0.0001 0.0451
8.8021 8450 0.0001 -
8.8542 8500 0.0012 0.0387
8.9062 8550 0.0003 -
8.9583 8600 0.0001 0.0386
9.0104 8650 0.0002 -
9.0625 8700 0.0001 0.0393
9.1146 8750 0.0001 -
9.1667 8800 0.0001 0.0379
9.2188 8850 0.0001 -
9.2708 8900 0.0001 0.0348
9.3229 8950 0.0001 -
9.375 9000 0.0001 0.0373
9.4271 9050 0.0001 -
9.4792 9100 0.0001 0.0376
9.5312 9150 0.0001 -
9.5833 9200 0.0001 0.0387
9.6354 9250 0.0 -
9.6875 9300 0.0001 0.0388
9.7396 9350 0.0001 -
9.7917 9400 0.0001 0.0375
9.8438 9450 0.0001 -
9.8958 9500 0.0001 0.0370
9.9479 9550 0.0 -
10.0 9600 0.0 0.0359
10.0521 9650 0.0 -
10.1042 9700 0.0 0.0358
10.1562 9750 0.0 -
10.2083 9800 0.0 0.0368
10.2604 9850 0.0 -
10.3125 9900 0.0 0.0371
10.3646 9950 0.0 -
10.4167 10000 0.0 0.0370
10.4688 10050 0.0 -
10.5208 10100 0.0 0.0376
10.5729 10150 0.0 -
10.625 10200 0.0 0.0380
10.6771 10250 0.0 -
10.7292 10300 0.0 0.0381
10.7812 10350 0.0 -
10.8333 10400 0.0 0.0377
10.8854 10450 0.0 -
10.9375 10500 0.0 0.0376
10.9896 10550 0.0 -
11.0417 10600 0.0 0.0378
11.0938 10650 0.0 -
11.1458 10700 0.0 0.0383
11.1979 10750 0.0 -
11.25 10800 0.0 0.0393
11.3021 10850 0.0 -
11.3542 10900 0.0 0.0394
11.4062 10950 0.0 -
11.4583 11000 0.0 0.0400
11.5104 11050 0.0 -
11.5625 11100 0.0 0.0394
11.6146 11150 0.0 -
11.6667 11200 0.0 0.0393
11.7188 11250 0.0 -
11.7708 11300 0.0 0.0382
11.8229 11350 0.0 -
11.875 11400 0.0 0.0377
11.9271 11450 0.0 -
11.9792 11500 0.0 0.0373
12.0312 11550 0.0 -
12.0833 11600 0.0 0.0372
12.1354 11650 0.0 -
12.1875 11700 0.0 0.0377
12.2396 11750 0.0 -
12.2917 11800 0.0 0.0380
12.3438 11850 0.0 -
12.3958 11900 0.0 0.0381
12.4479 11950 0.0 -
12.5 12000 0.0 0.0388
12.5521 12050 0.0 -
12.6042 12100 0.0 0.0393
12.6562 12150 0.0 -
12.7083 12200 0.0 0.0398
12.7604 12250 0.0 -
12.8125 12300 0.0 0.0384
12.8646 12350 0.0 -
12.9167 12400 0.0 0.0378
12.9688 12450 0.0 -
13.0208 12500 0.0 0.0385
13.0729 12550 0.0 -
13.125 12600 0.0 0.0396
13.1771 12650 0.0 -
13.2292 12700 0.0 0.0391
13.2812 12750 0.0 -
13.3333 12800 0.0 0.0397
13.3854 12850 0.0 -
13.4375 12900 0.0 0.0404
13.4896 12950 0.0 -
13.5417 13000 0.0 0.0406
13.5938 13050 0.0 -
13.6458 13100 0.0 0.0405
13.6979 13150 0.0 -
13.75 13200 0.0 0.0399
13.8021 13250 0.0 -
13.8542 13300 0.0 0.0391
13.9062 13350 0.0 -
13.9583 13400 0.0 0.0389
14.0104 13450 0.0 -
14.0625 13500 0.0 0.0392
14.1146 13550 0.0 -
14.1667 13600 0.0 0.0409
14.2188 13650 0.0 -
14.2708 13700 0.0 0.0409
14.3229 13750 0.0 -
14.375 13800 0.0 0.0418
14.4271 13850 0.0 -
14.4792 13900 0.0 0.0421
14.5312 13950 0.0 -
14.5833 14000 0.0 0.0417
14.6354 14050 0.0 -
14.6875 14100 0.0 0.0427
14.7396 14150 0.0 -
14.7917 14200 0.0 0.0421
14.8438 14250 0.0 -
14.8958 14300 0.0 0.0415
14.9479 14350 0.0 -
15.0 14400 0.0 0.0421
15.0521 14450 0.0 -
15.1042 14500 0.0 0.0431
15.1562 14550 0.0 -
15.2083 14600 0.0 0.0436
15.2604 14650 0.0 -
15.3125 14700 0.0 0.0445
15.3646 14750 0.0 -
15.4167 14800 0.0 0.0443
15.4688 14850 0.0 -
15.5208 14900 0.0 0.0448
15.5729 14950 0.0 -
15.625 15000 0.0 0.0446
15.6771 15050 0.0 -
15.7292 15100 0.0 0.0446
15.7812 15150 0.0 -
15.8333 15200 0.0 0.0436
15.8854 15250 0.0 -
15.9375 15300 0.0 0.0437
15.9896 15350 0.0 -
16.0417 15400 0.0 0.0438
16.0938 15450 0.0 -
16.1458 15500 0.0 0.0454
16.1979 15550 0.0 -
16.25 15600 0.0 0.0457
16.3021 15650 0.0 -
16.3542 15700 0.0 0.0461
16.4062 15750 0.0 -
16.4583 15800 0.0 0.0458
16.5104 15850 0.0 -
16.5625 15900 0.0 0.0462
16.6146 15950 0.0 -
16.6667 16000 0.0 0.0466
16.7188 16050 0.0 -
16.7708 16100 0.0 0.0461
16.8229 16150 0.0 -
16.875 16200 0.0 0.0458
16.9271 16250 0.0 -
16.9792 16300 0.0 0.0458
17.0312 16350 0.0 -
17.0833 16400 0.0 0.0475
17.1354 16450 0.0 -
17.1875 16500 0.0 0.0475
17.2396 16550 0.0 -
17.2917 16600 0.0 0.0487
17.3438 16650 0.0 -
17.3958 16700 0.0 0.0501
17.4479 16750 0.0 -
17.5 16800 0.0 0.0507
17.5521 16850 0.0 -
17.6042 16900 0.0 0.0496
17.6562 16950 0.0 -
17.7083 17000 0.0 0.0493
17.7604 17050 0.0 -
17.8125 17100 0.0 0.0482
17.8646 17150 0.0 -
17.9167 17200 0.0 0.0494
17.9688 17250 0.0 -
18.0208 17300 0.0 0.0488
18.0729 17350 0.0 -
18.125 17400 0.0 0.0503
18.1771 17450 0.0 -
18.2292 17500 0.0 0.0504
18.2812 17550 0.0 -
18.3333 17600 0.0 0.0502
18.3854 17650 0.0 -
18.4375 17700 0.0 0.0496
18.4896 17750 0.0 -
18.5417 17800 0.0 0.0495
18.5938 17850 0.0 -
18.6458 17900 0.0 0.0494
18.6979 17950 0.0 -
18.75 18000 0.0 0.0491
18.8021 18050 0.0 -
18.8542 18100 0.0 0.0462
18.9062 18150 0.0 -
18.9583 18200 0.0 0.0468
19.0104 18250 0.0 -
19.0625 18300 0.0 0.0453
19.1146 18350 0.0 -
19.1667 18400 0.0 0.0481
19.2188 18450 0.0 -
19.2708 18500 0.0 0.0487
19.3229 18550 0.0 -
19.375 18600 0.0 0.0510
19.4271 18650 0.0 -
19.4792 18700 0.0 0.0514
19.5312 18750 0.0 -
19.5833 18800 0.0 0.0518
19.6354 18850 0.0 -
19.6875 18900 0.0 0.0523
19.7396 18950 0.0 -
19.7917 19000 0.0 0.0490
19.8438 19050 0.0 -
19.8958 19100 0.0 0.0493
19.9479 19150 0.0 -
20.0 19200 0.0 0.0488
20.0521 19250 0.0 -
20.1042 19300 0.0 0.0454
20.1562 19350 0.0 -
20.2083 19400 0.0 0.0465
20.2604 19450 0.0 -
20.3125 19500 0.0 0.0494
20.3646 19550 0.0 -
20.4167 19600 0.0 0.0497
20.4688 19650 0.0 -
20.5208 19700 0.0 0.0496
20.5729 19750 0.0 -
20.625 19800 0.0 0.0491
20.6771 19850 0.0 -
20.7292 19900 0.0 0.0491
20.7812 19950 0.0 -
20.8333 20000 0.0 0.0485
20.8854 20050 0.0 -
20.9375 20100 0.0 0.0489
20.9896 20150 0.0 -
21.0417 20200 0.0 0.0496
21.0938 20250 0.0 -
21.1458 20300 0.0 0.0518
21.1979 20350 0.0 -
21.25 20400 0.0 0.0525
21.3021 20450 0.0 -
21.3542 20500 0.0 0.0530
21.4062 20550 0.0 -
21.4583 20600 0.0 0.0511
21.5104 20650 0.0 -
21.5625 20700 0.0 0.0444
21.6146 20750 0.0 -
21.6667 20800 0.0 0.0435
21.7188 20850 0.0 -
21.7708 20900 0.0 0.0439
21.8229 20950 0.0 -
21.875 21000 0.0 0.0446
21.9271 21050 0.0 -
21.9792 21100 0.0 0.0447
22.0312 21150 0.0 -
22.0833 21200 0.0 0.0459
22.1354 21250 0.0 -
22.1875 21300 0.0 0.0482
22.2396 21350 0.0 -
22.2917 21400 0.0 0.0491
22.3438 21450 0.0 -
22.3958 21500 0.0 0.0495
22.4479 21550 0.0 -
22.5 21600 0.0 0.0508
22.5521 21650 0.0 -
22.6042 21700 0.0 0.0504
22.6562 21750 0.0 -
22.7083 21800 0.0 0.0501
22.7604 21850 0.0 -
22.8125 21900 0.0 0.0490
22.8646 21950 0.0 -
22.9167 22000 0.0 0.0487
22.9688 22050 0.0 -
23.0208 22100 0.0 0.0478
23.0729 22150 0.0 -
23.125 22200 0.0 0.0500
23.1771 22250 0.0 -
23.2292 22300 0.0 0.0503
23.2812 22350 0.0 -
23.3333 22400 0.0 0.0510
23.3854 22450 0.0 -
23.4375 22500 0.0 0.0514
23.4896 22550 0.0 -
23.5417 22600 0.0 0.0518
23.5938 22650 0.0 -
23.6458 22700 0.0 0.0521
23.6979 22750 0.0 -
23.75 22800 0.0 0.0529
23.8021 22850 0.0 -
23.8542 22900 0.0 0.0521
23.9062 22950 0.0 -
23.9583 23000 0.0 0.0523
24.0104 23050 0.0 -
24.0625 23100 0.0 0.0516
24.1146 23150 0.0 -
24.1667 23200 0.0 0.0532
24.2188 23250 0.0 -
24.2708 23300 0.0 0.0530
24.3229 23350 0.0 -
24.375 23400 0.0 0.0541
24.4271 23450 0.0 -
24.4792 23500 0.0 0.0543
24.5312 23550 0.0 -
24.5833 23600 0.0 0.0547
24.6354 23650 0.0 -
24.6875 23700 0.0 0.0558
24.7396 23750 0.0 -
24.7917 23800 0.0 0.0549
24.8438 23850 0.0 -
24.8958 23900 0.0 0.0549
24.9479 23950 0.0 -
25.0 24000 0.0 0.0543
25.0521 24050 0.0 -
25.1042 24100 0.0 0.0556
25.1562 24150 0.0 -
25.2083 24200 0.0 0.0572
25.2604 24250 0.0 -
25.3125 24300 0.0 0.0576
25.3646 24350 0.0 -
25.4167 24400 0.0 0.0583
25.4688 24450 0.0 -
25.5208 24500 0.0 0.0574
25.5729 24550 0.0 -
25.625 24600 0.0 0.0581
25.6771 24650 0.0 -
25.7292 24700 0.0 0.0578
25.7812 24750 0.0 -
25.8333 24800 0.0 0.0578
25.8854 24850 0.0 -
25.9375 24900 0.0 0.0576
25.9896 24950 0.0 -
26.0417 25000 0.0 0.0574
26.0938 25050 0.0 -
26.1458 25100 0.0 0.0577
26.1979 25150 0.0 -
26.25 25200 0.0 0.0577
26.3021 25250 0.0 -
26.3542 25300 0.0 0.0582
26.4062 25350 0.0 -
26.4583 25400 0.0 0.0579
26.5104 25450 0.0 -
26.5625 25500 0.0 0.0579
26.6146 25550 0.0 -
26.6667 25600 0.0 0.0577
26.7188 25650 0.0 -
26.7708 25700 0.0 0.0575
26.8229 25750 0.0 -
26.875 25800 0.0 0.0565
26.9271 25850 0.0 -
26.9792 25900 0.0 0.0564
27.0312 25950 0.0 -
27.0833 26000 0.0 0.0570
27.1354 26050 0.0 -
27.1875 26100 0.0 0.0579
27.2396 26150 0.0 -
27.2917 26200 0.0 0.0578
27.3438 26250 0.0 -
27.3958 26300 0.0 0.0577
27.4479 26350 0.0 -
27.5 26400 0.0 0.0583
27.5521 26450 0.0 -
27.6042 26500 0.0 0.0591
27.6562 26550 0.0 -
27.7083 26600 0.0 0.0599
27.7604 26650 0.0 -
27.8125 26700 0.0 0.0596
27.8646 26750 0.0 -
27.9167 26800 0.0 0.0594
27.9688 26850 0.0 -
28.0208 26900 0.0 0.0595
28.0729 26950 0.0 -
28.125 27000 0.0 0.0529
28.1771 27050 0.0 -
28.2292 27100 0.0 0.0537
28.2812 27150 0.0 -
28.3333 27200 0.0 0.0542
28.3854 27250 0.0 -
28.4375 27300 0.0 0.0550
28.4896 27350 0.0 -
28.5417 27400 0.0 0.0555
28.5938 27450 0.0 -
28.6458 27500 0.0 0.0559
28.6979 27550 0.0 -
28.75 27600 0.0 0.0563
28.8021 27650 0.0 -
28.8542 27700 0.0 0.0565
28.9062 27750 0.0 -
28.9583 27800 0.0 0.0565
29.0104 27850 0.0 -
29.0625 27900 0.0 0.0566
29.1146 27950 0.0 -
29.1667 28000 0.0 0.0569
29.2188 28050 0.0 -
29.2708 28100 0.0 0.0569
29.3229 28150 0.0 -
29.375 28200 0.0 0.0572
29.4271 28250 0.0 -
29.4792 28300 0.0 0.0572
29.5312 28350 0.0 -
29.5833 28400 0.0 0.0574
29.6354 28450 0.0 -
29.6875 28500 0.0 0.0576
29.7396 28550 0.0 -
29.7917 28600 0.0 0.0576
29.8438 28650 0.0 -
29.8958 28700 0.0 0.0577
29.9479 28750 0.0 -
30.0 28800 0.0 0.0577

Framework Versions

  • Python: 3.10.13
  • Sentence Transformers: 3.0.1
  • Transformers: 4.42.3
  • PyTorch: 2.1.2
  • Accelerate: 0.27.0
  • Datasets: 2.20.0
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply}, 
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
Downloads last month
7
Safetensors
Model size
109M params
Tensor type
F32
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for aritrasen/bge-base-en-v1.5-finetuned_ragds_v1

Finetuned
this model