Daxtra's picture
Add new SentenceTransformer model
b3245d0 verified
metadata
base_model: sentence-transformers/all-MiniLM-L6-v2
library_name: sentence-transformers
metrics:
  - cosine_accuracy@10
  - cosine_precision@10
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@10
pipeline_tag: sentence-similarity
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:149248
  - loss:MultipleNegativesRankingLoss
widget:
  - source_sentence: >-
      - Tax Accountant position, requiring a minimum of 5 years of accounting
      experience with 2 years in public accounting preferred.

      - Assist the Director of Finance in managing the general ledger and
      preparing financial reports.

      - Reconcile bank accounts, assist with month-end close, and coordinate
      capital projections.

      - Prepare tax returns, handle partners' capital account reconciliations,
      and coordinate tax workpaper preparation with external tax accountants.

      - Requires an accounting degree.

      - Proficiency in Microsoft Office, especially Excel and PowerPoint, is
      essential.
    sentences:
      - >-
        - Customer service professional with experience in sales, customer
        advising, and inventory management.

        - Experience in client services, client satisfaction, and effective
        communication in a retail sales environment.

        - Holds an Associate degree in Retail Sales and a Bachelor's degree in
        Fashion and Business.

        - Proficient in cross-selling, credit card management, and
        problem-solving in a fast-paced retail setting.

        - Demonstrated skills in time management, teamwork, and meeting
        deadlines.

        - Strong customer service skills and knowledge of global markets.

        - Completed a National Cadet Core program; highly motivated and
        hardworking.
      - >-
        - Experienced Special Events Producer with a strong background in event
        management and production.

        - Currently shadows the satellite team at Globe Cast UK Ltd, learning
        about media distribution.

        - Past roles include Booking Coordinator, managing schedules and
        coordinating with satellite networks.

        - Holds a Bachelor of Arts in Graphic Design and BTEC qualifications in
        Art & Design.

        - Proficient in Adobe Creative Suite and skilled in visual
        merchandising, social media, and 3D modeling.

        - Strong communication, decision-making, and organizational skills.
      - >-
        - Finance professional with experience in accounting, finance, and
        financial analysis roles across multiple industries in India.

        - Recently served as Finance Controller, successfully managed accounting
        functions, including payroll, tax compliance, and financial reporting,
        while reducing vendor payment aging from 120 to 30 days.

        - Improved revenue realization by implementing a close monitoring
        process with the AR team, reducing DSO by 7 days.

        - Holds an Executive Master's in Business Administration and a Master's
        in Accountancy, with certification as a CPA.

        - Strong skills in financial planning, budgeting, cost control, and
        analysis, with experience in tax, accounts receivable, and payable
        management.

        - Fluent in Tamil, Telugu, Hindi, and English; proficient in software
        like QuickBooks, NetSuite, Tableau, and SAP.
  - source_sentence: >-
      - Quality Assurance (QA) role, Entry-level, focusing on Java and MySQL
      skills, with a requirement for a Bachelor's Degree in Information Systems,
      Computer Science, or related fields.

      - Essential skills include a methodical and analytical mindset, hands-on
      software development experience, and basic understanding of software
      development lifecycle and automation.

      - Must be able to handle evolving requirements and collaborate effectively
      with teams to achieve common goals.

      - Experience in financial industry and working with Big Data, SQL, and
      programming languages like Java, Groovy, Perl, Python, JavaScript is
      desired.

      - Requires proficiency in core Java (0-5 scale), with 0 years of
      experience necessary; proficiency in Business Analysis is also required
      (0-5 scale).
    sentences:
      - >-
        - Investment Management Consultant with extensive experience in trading,
        client relations, and risk management, achieving 100% trading accuracy.

        - Manages client portfolios for capital growth aligned with risk
        tolerance, and executes multi-leg option trades.

        - Provides guidance on portfolio margin, intraday trading, and IRA
        rules; adept at using Fidelity and Active Trader Pro.

        - Proficient in fraud prevention, cybersecurity, and resolving complex
        financial requests.

        - Bachelor of Science in Accounting and CPQ certification.

        - Experience includes roles in equities, fixed income, and corporate
        actions.

        - Skills: Excel, PowerPoint, Word, Salesforce, and Microsoft Office
        suite.
      - >-
        - CNC Programmer with experience in CNC milling and part production to
        AS9100 standards using various CNC machines.

        - Responsibilities include CNC programming for first runs, maintenance,
        and housekeeping.

        - Skills: CNC Programming, 5S Methodology, Quality Control, Lean
        Manufacturing, CAD/CAM, Solidworks.

        - Experience with machine parts from various materials.

        - Certified in ISO 9001 and AS9100 standards.

        - Proficient in decision-making, equipment selection, and lean
        manufacturing principles.
      - >-
        - Full Stack Developer with 5 years of experience in Java/J2EE, Spring,
        and Node.js.

        - Developed responsive JEE Web Applications using Java 17, Spring
        Framework, and modern technologies.

        - Expertise in Java 17 features (Lambda, Streams, etc.), Spring
        Framework features (DI, Security, REST, etc.), and Spring Boot
        microservices.

        - Proficient in Spring Boot, Spring Security, Spring REST, and Spring
        Integration.

        - Experience with J2EE, Spring4, Spring Boot, and Java Persistence API
        (JPA) for database operations.

        - Skilled in designing applications with microservices architecture,
        using tools like Docker, Kubernetes, and AWS.

        - Strong background in web development, including HTML5, CSS3, and
        JavaScript, with proficiency in Angular 14 and Angular.js.

        - Excellent in Agile methodologies, with a track record in JIRA for
        defect reporting and collaboration.
  - source_sentence: >-
      - Chiller Technician with at least 5 years of experience in servicing and
      repairing various types of chillers, including air-cooled, water-cooled,
      screw, and centrifugal units.

      - Responsibilities include comprehensive chiller maintenance, repair of
      large RTUs, boiler and mechanical system repairs, and VRF systems
      troubleshooting.

      - Requires EPA Universal Certification and experience with RTUs (20-200+
      tons).

      - Preferred: Manufacturer training and certifications from Trane, Carrier,
      York, Daikin, or Mitsubishi.

      - Must have strong troubleshooting skills, a clean driving record, and
      excellent communication skills.

      - Local candidate with a stable work history is required.
    sentences:
      - >-
        - Delivery and installation specialist with experience in security
        services and account management.

        - Recently promoted to accounts, responsible for collections and
        customer agreements.

        - Previous roles include delivery driver, account manager, and security
        guard, handling inventory and customer interactions.

        - Skills: Microsoft Word, Microsoft Excel, customer service, inventory
        management, and safe handling of deliveries.

        - Educational background: High School Diploma.

        - Additional experience: Cashier, Cook, Auto Detailer, and
        Transportation roles.
      - >-
        - Aspiring Graduate Consultant with a strong background in data
        analytics, visualization, and asset management.

        - Experience as Operations Analyst includes liaising with 200+ customers
        weekly, achieving a 95% resolution rate.

        - Enhanced process efficiency by 25% and improved customer experience
        ratings by 20% over eight months.

        - Skilled in SQL, Python, Excel, Power BI, and Tableau, contributing to
        strategic decision-making and operational efficiencies.

        - Previous role as Business Analyst Manager, managing reporting and
        optimizing strategies using Tableau.

        - Supervised 15 interns and managed a 50:1 freelancer-to-project ratio
        with a 95% completion rate.

        - Holds a Master's Degree in Business Analytics and a Bachelor's Degree
        in Technology Electrical & Electronics Engineering.
      - >-
        - Experienced HVAC Maintenance Supervisor with comprehensive industry
        experience in maintenance, repair, and installation of HVAC systems.

        - Lead HVAC Technician with expertise in building maintenance, heat
        pumps, and water treatment.

        - Proficient in troubleshooting, repair, and maintenance across various
        HVAC components.

        - Certified in Building Management and Plumbing, with knowledge in
        central plant operation.

        - Skilled in high-voltage, mechanical, and office building maintenance.

        - Experience as a Project Coordinator and Maintenance Technician.
  - source_sentence: >-
      - Building Automation Service Technician role for candidates with at least
      three years of experience in HVAC systems in commercial buildings,
      requiring local presence and stable work history.

      - Essential skills include BACnet, DDC controls, ALC, Allerton, Tritium,
      Niagara, and DISTEC.

      - Focus on commercial buildings and local candidates are preferred.
    sentences:
      - >-
        - Senior public affairs professional with extensive experience in global
        client operations and executive search across financial sectors.

        - Directed public affairs at The Parliamentary Review, managing
        production and stakeholder engagement.

        - Expertise in research methodologies, including comprehensive support
        on the Equities Desk for top investment banks and hedge funds.

        - Specialized in mapping CTA funds, impacting Dodd-Frank regulations,
        and developing strategic plans in emerging markets.

        - Bachelor of Arts in Politics, with strong analytical, financial
        management, and stakeholder engagement skills.

        - Previous roles include Public Affairs Director, Researcher, and
        Graduate Search Assistant, with experience in event management and
        profit maximization.
      - >-
        - Full Stack .NET Developer with 10+ years of experience in agile
        development and test-driven development.

        - Proficient in .NET technologies, including .NET 5.0, .NET Core,
        ASP.NET, MVC, Angular, and TypeScript, with experience in UML diagrams
        and design patterns.

        - Skilled in web service development with ASP.NET Web API, cloud
        computing (Azure), and DevOps practices.

        - Experience with AngularJS, React JS, Node JS, Azure Cosmos, and SQL
        Server 2014.

        - Specialized in web services, microservices, and system integration for
        financial systems.

        - Strong background in SQL Server, Azure, and GIT for source code
        management.
      - >-
        - Experienced Project Manager and Systems Specialist II with a strong
        background in control systems installation, managing projects worth $3
        million in the past year.

        - Oversees planning, design, installation, commissioning, and customer
        satisfaction for projects, ensuring regulatory compliance and managing
        $3 million worth of controls work.

        - Skilled in bid management, subcontractor training, progress billing,
        and financial management.

        - Certified in Siemens, Johnson Controls, and Tridium systems; holds CPR
        and MCP certifications.

        - Proficient in Siemens Hardware, HVAC, Planned Maintenance, and network
        technologies (LON, BACNET, MODBUS, Ethernet).

        - Over 4 years of experience with Tridium Niagara AX and various
        software platforms (Microsoft Windows, Microsoft Internet Server, Sage
        Peachtree).
  - source_sentence: >-
      - Senior Business Development Executive with over 3+ years of experience
      in business development or a related field, focusing on expanding client
      businesses and maintaining strong relationships with MSPs and Resellers.

      - Responsibilities include client-facing roles, strategic planning, lead
      generation, CRM management using Pipedrive, market analysis, team
      collaboration, and reporting.

      - Requires proven experience in client-facing roles and proficiency in
      lead generation, closing deals, and using CRM tools.

      - Strategic mindset and excellent communication, negotiation, and
      presentation skills are essential.

      - Bachelor's degree in Business, Marketing, Sales, or a related field.
    sentences:
      - >-
        - Business Development Expert with over 4 years in financial services
        consulting and B2B cold-calling, focusing on market analysis and
        stakeholder relationships.

        - Current Role: Business Development Representative at Warehouse Club,
        achieving a 40% increase in sales and maintaining optimal stock levels.

        - Former Role: Brand Ambassador, leading 300% sales growth through
        persuasive communication and product demonstrations.

        - Skills: Leadership, communication, analytical, and stakeholder
        management.

        - Certifications: ILSSI Lean Six Sigma Green Belt, Business
        Analyst-Transfer Pricing, Taxation, and Financial Advisement.

        - Education: Post-graduate with a Master of Science in Management and
        Bachelor of Commerce.

        - Interests: Project execution, FMCG, and financial services.
      - >-
        - Senior Data/Informatica Engineer with over 15 years in IT,
        specializing in Big Data/Cloud and Data Warehousing.

        - Led IDMC migration projects, improving scalability and reducing
        operational costs.

        - Developed mappings for data integration, using Informatica for ETL
        tasks and Python for advanced transformations.

        - Skilled in Informatica parameter files, cloud transformations, and
        IDMC migration.

        - Expertise in Python, Informatica, Teradata, GCP, BigQuery, and other
        data integration tools.

        - Experience in data migration, performance tuning, and cloud
        integration.

        - Proficient in UNIX/Linux scripting for data processing.

        - Expertise in handling GCP buckets, BQ, HDFS, and HBase.

        - Strong skills in GIT, JIRA, Control M, BitBucket, Bamboo, and
        Informatica CICD.
      - >-
        - Sales and Communication Analyst with experience in international
        relations and media management.

        - Managed Instagram content growth of 574 followers in 5 months at Young
        Amnesty ESPOL.

        - Organized events for African youth on neo-colonialism and
        Pan-Africanism.

        - Master of Arts International Studies; Bachelor of Arts International
        Relations.

        - Skills: Media Strategy, International Law, Problem Solving, and Social
        Media Management.

        - Experience with door-to-door sales, branding, and community outreach.
model-index:
  - name: SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: vac res matcher
          type: vac-res-matcher
        metrics:
          - type: cosine_accuracy@10
            value: 0.37306201550387597
            name: Cosine Accuracy@10
          - type: cosine_precision@10
            value: 0.07616279069767443
            name: Cosine Precision@10
          - type: cosine_recall@10
            value: 0.11529940071242636
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.12058166863177461
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.1943515826873384
            name: Cosine Mrr@10
          - type: cosine_map@10
            value: 0.07246095764156223
            name: Cosine Map@10

SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2

This is a sentence-transformers model finetuned from sentence-transformers/all-MiniLM-L6-v2. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: sentence-transformers/all-MiniLM-L6-v2
  • Maximum Sequence Length: 128 tokens
  • Output Dimensionality: 384 tokens
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("Daxtra/sbert-summaries-minilm-128-batch")
# Run inference
sentences = [
    "- Senior Business Development Executive with over 3+ years of experience in business development or a related field, focusing on expanding client businesses and maintaining strong relationships with MSPs and Resellers.\n- Responsibilities include client-facing roles, strategic planning, lead generation, CRM management using Pipedrive, market analysis, team collaboration, and reporting.\n- Requires proven experience in client-facing roles and proficiency in lead generation, closing deals, and using CRM tools.\n- Strategic mindset and excellent communication, negotiation, and presentation skills are essential.\n- Bachelor's degree in Business, Marketing, Sales, or a related field.",
    '- Business Development Expert with over 4 years in financial services consulting and B2B cold-calling, focusing on market analysis and stakeholder relationships.\n- Current Role: Business Development Representative at Warehouse Club, achieving a 40% increase in sales and maintaining optimal stock levels.\n- Former Role: Brand Ambassador, leading 300% sales growth through persuasive communication and product demonstrations.\n- Skills: Leadership, communication, analytical, and stakeholder management.\n- Certifications: ILSSI Lean Six Sigma Green Belt, Business Analyst-Transfer Pricing, Taxation, and Financial Advisement.\n- Education: Post-graduate with a Master of Science in Management and Bachelor of Commerce.\n- Interests: Project execution, FMCG, and financial services.',
    '- Senior Data/Informatica Engineer with over 15 years in IT, specializing in Big Data/Cloud and Data Warehousing.\n- Led IDMC migration projects, improving scalability and reducing operational costs.\n- Developed mappings for data integration, using Informatica for ETL tasks and Python for advanced transformations.\n- Skilled in Informatica parameter files, cloud transformations, and IDMC migration.\n- Expertise in Python, Informatica, Teradata, GCP, BigQuery, and other data integration tools.\n- Experience in data migration, performance tuning, and cloud integration.\n- Proficient in UNIX/Linux scripting for data processing.\n- Expertise in handling GCP buckets, BQ, HDFS, and HBase.\n- Strong skills in GIT, JIRA, Control M, BitBucket, Bamboo, and Informatica CICD.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@10 0.3731
cosine_precision@10 0.0762
cosine_recall@10 0.1153
cosine_ndcg@10 0.1206
cosine_mrr@10 0.1944
cosine_map@10 0.0725

Training Details

Training Dataset

Unnamed Dataset

  • Size: 149,248 training samples
  • Columns: sentence_0 and sentence_1
  • Approximate statistics based on the first 1000 samples:
    sentence_0 sentence_1
    type string string
    details
    • min: 47 tokens
    • mean: 116.19 tokens
    • max: 128 tokens
    • min: 54 tokens
    • mean: 119.22 tokens
    • max: 128 tokens
  • Samples:
    sentence_0 sentence_1
    - Public Relations Account Executive role for graduates with interest in media relations and corporate PR.
    - Key responsibilities include researching media data, managing media relationships, drafting reports, and coordinating media features.
    - Support client teams, manage social media, and contribute to SEO.
    - Require a 2.1 degree from a leading university, preferably in Economics, Finance, Business, English, or Communications/Media.
    - Strong understanding of financial and professional services industries.
    - Essential skills: excellent writing, trend analysis, integrity, proactive teamwork, and leadership in account support.
    - Previous PR experience is desirable.
    - Experienced Social Media and Productions Manager with a Master’s in Digital Communication and Marketing.
    - Led the creation of a multi-channel social media platform, generating Rs 900K annual revenue for a beauty brand.
    - Managed digital asset creation for Palmolive Color Naturals, overseeing talent acquisition to post-production.
    - Expertise in market segmentation, social media management, and post-production processes.
    - Proficient in Microsoft Office, PowerPoint, Excel, and Adobe Suite; holds GCSE and O Levels in Math and Economics.
    - Bilingual in Urdu and English; advanced knowledge in Advertising, Brand Marketing Strategy, and Public Relations.
    - Fire/Safety Senior Sales Executive, requiring either 2-5 years of experience for Sr. Sales Executive or 5+ years for Account Executive.
    - Responsible for developing sales strategies, managing contractor and end-user relationships, and executing sophisticated deals within established guidelines for fire and life safety in Iowa and Nebraska.
    - Build scope development, develop proposals, interact with customers, and provide value through communication on product and installation risks.
    - Coordinate estimating efforts and manage multiple projects.
    - Develop a market understanding, identify new business opportunities, and position Siemens as a leader.
    - Spend minimum 50% of time in customer-facing activities and travel 20% for training and business development.
    - Required: High School Diploma or GED, 2+ years/5+ years experience, working knowledge of fire and life safety systems, and experience with building codes.
    - Must be 21 years old and hold a valid driver's license.
    - Preferred experience includes selling to contractors, design services, and experience in vertical markets.
    - Experienced banking professional with over 7 years in Payments Coordination and Project Management.
    - Currently seeking a role in Project Management, with strong skills in accurate and timely payment processing, customer service, and system monitoring.
    - Proficient in handling customer inquiries, credit card fraud detection, and documentation processes.
    - Holds a Master's Degree in Business Administration and Management, and a Bachelor's in Business.
    - Certified Associate in Project Management (CAPM) and licensed insurance agent.
    - Fluent in English, French, and proficient in ArcGIS, Microsoft Access, and various banking applications.
    - Quality Assurance (QA) role, Entry-level, focusing on Java and MySQL skills, with a requirement for a Bachelor's Degree in Information Systems, Computer Science, or related fields.
    - Essential skills include a methodical and analytical mindset, hands-on software development experience, and basic understanding of software development lifecycle and automation.
    - Must be able to handle evolving requirements and collaborate effectively with teams to achieve common goals.
    - Experience in financial industry and working with Big Data, SQL, and programming languages like Java, Groovy, Perl, Python, JavaScript is desired.
    - Requires proficiency in core Java (0-5 scale), with 0 years of experience necessary; proficiency in Business Analysis is also required (0-5 scale).
    - Test Automation Engineer with over 6 years of experience in Agile/Scrum environments, skilled in Java and Selenium WebDriver for web-based application testing.
    - Developed test automation frameworks using Maven, JUnit, and Page Object Model design pattern.
    - Proficient in Cucumber BDD features, steps, and runner packages for feature testing, as well as data-driven and cross-browser testing.
    - Executed RESTful API testing with Postman and REST Assured, and database testing using SQL queries and JDBC.
    - Expertise in test planning, test scripting, defect tracking, and test reporting using Jira Xray and other tools.
    - Knowledgeable in Continuous Integration (CI/CD) and mentoring junior QA staff.
    - Proficient in Java, Selenium, Cucumber, Maven, JUnit, Postman, REST Assured, MS Excel, SQL, and JDBC.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 128
  • per_device_eval_batch_size: 128
  • num_train_epochs: 1
  • multi_dataset_batch_sampler: round_robin

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 128
  • per_device_eval_batch_size: 128
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1
  • num_train_epochs: 1
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.0
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • eval_use_gather_object: False
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: round_robin

Training Logs

Epoch Step Training Loss vac-res-matcher_cosine_map@10
0.0995 116 - 0.0684
0.1990 232 - 0.0699
0.2985 348 - 0.0711
0.3979 464 - 0.0720
0.4288 500 2.6358 -
0.4974 580 - 0.0697
0.5969 696 - 0.0721
0.6964 812 - 0.0714
0.7959 928 - 0.0717
0.8576 1000 2.373 -
0.8954 1044 - 0.0722
0.9949 1160 - 0.0724
1.0 1166 - 0.0725

Framework Versions

  • Python: 3.10.12
  • Sentence Transformers: 3.2.1
  • Transformers: 4.44.2
  • PyTorch: 2.4.1+cu121
  • Accelerate: 0.34.2
  • Datasets: 3.0.1
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}