philocifer's picture
Add new SentenceTransformer model
09afbd0 verified
metadata
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:300
  - loss:MatryoshkaLoss
  - loss:MultipleNegativesRankingLoss
base_model: Snowflake/snowflake-arctic-embed-l
widget:
  - source_sentence: How many full-time staff members are employed at 787 Market & Cafe?
    sentences:
      - >-
        STORE ANALYSIS: 787 Market & Cafe (7858724)

        Location: 6105 Memphis Ave, Cleveland


        Store 7858724 - 787 Market & Cafe operates as a Open Store
        Supermarket-Conventional establishment at 6105 Memphis Ave, Cleveland,
        OH 441442252 (FIPS 39-35). Geographically precise at coordinates
        41.4399,-81.7293 (Geocoded to specific address), this location generates
        $2,028,000 in annual sales ($2,000,001 to $4,000,000) from its 2000.0
        square foot space. The operation employs 23 full-time staff across 3
        checkout lanes, yielding a sales density of $1,014.00/sqft. Owned by
        Independent (Family ID: 99999) as part of a 1 Store-location network,
        the store sources inventory through Small Supplier (Supplier ID: 888888,
        Family ID: 88888) to maintain its position in the Grocery sector's
        Supermarket-Conventional segment.
      - >-
        STORE ANALYSIS: 128 Teresa Grocery (729684)

        Location: 128 Audubon Ave, New York


        Store 729684 - 128 Teresa Grocery operates as a Open Store Superette
        establishment at 128 Audubon Ave, New York, NY 100322109 (FIPS 36-61).
        Geographically precise at coordinates 40.8427,-73.9369 (Geocoded to
        specific address), this location generates $1,560,000 in annual sales
        ($1,500,001 to $2,000,000) from its 2000.0 square foot space. The
        operation employs 7 full-time staff across 1 checkout lanes, yielding a
        sales density of $780.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: 2 Star Grocery (1818510)

        Location: 1123 Dellwood Ave, Memphis


        Store 1818510 - 2 Star Grocery operates as a Open Store Superette
        establishment at 1123 Dellwood Ave, Memphis, TN 381277761 (FIPS 47-157).
        Geographically precise at coordinates 35.2105,-90.0267 (Geocoded to
        specific address), this location generates $1,664,000 in annual sales
        ($1,500,001 to $2,000,000) from its 1000.0 square foot space. The
        operation employs 7 full-time staff across 2 checkout lanes, yielding a
        sales density of $1,664.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
  - source_sentence: >-
      What is the annual sales figure for the 2031 Webster Food Court
      supermarket?
    sentences:
      - >-
        STORE ANALYSIS: 2031 Webster Food Court (2192571)

        Location: 2031 Webster Ave, Bronx


        Store 2192571 - 2031 Webster Food Court operates as a Open Store
        Supermarket-Conventional establishment at 2031 Webster Ave, Bronx, NY
        104572411 (FIPS 36-5). Geographically precise at coordinates
        40.8509,-73.8992 (Geocoded to specific address), this location generates
        $2,028,000 in annual sales ($2,000,001 to $4,000,000) from its 1000.0
        square foot space. The operation employs 14 full-time staff across 1
        checkout lanes, yielding a sales density of $2,028.00/sqft. Owned by
        Independent (Family ID: 99999) as part of a 1 Store-location network,
        the store sources inventory through Small Supplier (Supplier ID: 888888,
        Family ID: 88888) to maintain its position in the Grocery sector's
        Supermarket-Conventional segment.
      - >-
        STORE ANALYSIS: 4M Foods (1483149)

        Location: 6349 Macarthur Blvd, Oakland


        Store 1483149 - 4M Foods operates as a Open Store Superette
        establishment at 6349 Macarthur Blvd, Oakland, CA 946051635 (FIPS 6-1).
        Geographically precise at coordinates 37.7741,-122.1801 (Geocoded to
        specific address), this location generates $1,560,000 in annual sales
        ($1,500,001 to $2,000,000) from its 2000.0 square foot space. The
        operation employs 7 full-time staff across 2 checkout lanes, yielding a
        sales density of $780.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: 16th Food Max (701011)

        Location: 9901 N 16th St, Tampa


        Store 701011 - 16th Food Max operates as a Open Store Superette
        establishment at 9901 N 16th St, Tampa, FL 336128233 (FIPS 12-57).
        Geographically precise at coordinates 28.0393,-82.4415 (Geocoded to
        specific address), this location generates $1,196,000 in annual sales
        ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The
        operation employs 6 full-time staff across 2 checkout lanes, yielding a
        sales density of $1,196.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
  - source_sentence: >-
      What is the annual sales figure for 103 Deli & Grocery located at 148 E
      103rd St, New York?
    sentences:
      - >-
        STORE ANALYSIS: 103 Deli & Grocery (970062)

        Location: 148 E 103rd St, New York


        Store 970062 - 103 Deli & Grocery operates as a Open Store Superette
        establishment at 148 E 103rd St, New York, NY 100295334 (FIPS 36-61).
        Geographically precise at coordinates 40.7902,-73.9476 (Geocoded to
        specific address), this location generates $1,352,000 in annual sales
        ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The
        operation employs 6 full-time staff across 1 checkout lanes, yielding a
        sales density of $1,352.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: 1158 Grocery & Deli (1666158)

        Location: 1158 Saint Lawrence Ave, Bronx


        Store 1666158 - 1158 Grocery & Deli operates as a Open Store Superette
        establishment at 1158 Saint Lawrence Ave, Bronx, NY 104724612 (FIPS
        36-5). Geographically precise at coordinates 40.8295,-73.8666 (Geocoded
        to specific address), this location generates $1,352,000 in annual sales
        ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The
        operation employs 6 full-time staff across 2 checkout lanes, yielding a
        sales density of $1,352.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: A & A Grocery (7345282)

        Location: 6776 Biggers Reyno Rd, Reyno


        Store 7345282 - A & A Grocery operates as a Open Store
        Supermarket-Conventional establishment at 6776 Biggers Reyno Rd, Reyno,
        AR 72462 (FIPS 5-121). Geographically precise at coordinates
        36.3629,-90.7534 (Geocoded to specific address), this location generates
        $4,160,000 in annual sales ($4,000,001 to $6,000,000) from its 2000.0
        square foot space. The operation employs 16 full-time staff across 2
        checkout lanes, yielding a sales density of $2,080.00/sqft. Owned by
        Independent (Family ID: 99999) as part of a 1 Store-location network,
        the store sources inventory through Small Supplier (Supplier ID: 888888,
        Family ID: 88888) to maintain its position in the Grocery sector's
        Supermarket-Conventional segment.
  - source_sentence: What is the annual sales figure for 12th St Mike Gourmet Deli?
    sentences:
      - >-
        STORE ANALYSIS: 8 Brothers Grocery Store (1639326)

        Location: 2120 N 29th St, Philadelphia


        Store 1639326 - 8 Brothers Grocery Store operates as a Open Store
        Superette establishment at 2120 N 29th St, Philadelphia, PA 191211234
        (FIPS 42-101). Geographically precise at coordinates 39.9886,-75.1805
        (Geocoded to specific address), this location generates $1,820,000 in
        annual sales ($1,500,001 to $2,000,000) from its 3000.0 square foot
        space. The operation employs 9 full-time staff across 1 checkout lanes,
        yielding a sales density of $606.67/sqft. Owned by Independent (Family
        ID: 99999) as part of a 1 Store-location network, the store sources
        inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888)
        to maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: 12th St Mike Gourmet Deli (879406)

        Location: 1203 40th Ave, Long Island City


        Store 879406 - 12th St Mike Gourmet Deli operates as a Open Store
        Superette establishment at 1203 40th Ave, Long Island City, NY 111016107
        (FIPS 36-81). Geographically precise at coordinates 40.7561,-73.9425
        (Geocoded to specific address), this location generates $1,508,000 in
        annual sales ($1,500,001 to $2,000,000) from its 1000.0 square foot
        space. The operation employs 6 full-time staff across 1 checkout lanes,
        yielding a sales density of $1,508.00/sqft. Owned by Independent (Family
        ID: 99999) as part of a 1 Store-location network, the store sources
        inventory through Harold Levinson Associates (Supplier ID: 195994,
        Family ID: 2213) to maintain its position in the Grocery sector's
        Superette segment.
      - >-
        STORE ANALYSIS: A & B Naturals (5551436)

        Location: 101 Cottage St, Bar Harbor


        Store 5551436 - A & B Naturals operates as a Open Store
        Supermarket-Natural/Gourmet Foods establishment at 101 Cottage St, Bar
        Harbor, ME 46091442 (FIPS 23-9). Geographically precise at coordinates
        44.389,-68.2121 (Geocoded to specific address), this location generates
        $3,640,000 in annual sales ($2,000,001 to $4,000,000) from its 3000.0
        square foot space. The operation employs 15 full-time staff across 2
        checkout lanes, yielding a sales density of $1,213.33/sqft. Owned by
        Independent (Family ID: 99999) as part of a 1 Store-location network,
        the store sources inventory through United Natural Foods/Dist Ctr
        (Supplier ID: 74258, Family ID: 10540) to maintain its position in the
        Grocery sector's Supermarket-Natural/Gourmet Foods segment.
  - source_sentence: How many full-time staff members are employed at the 54 Royal Market?
    sentences:
      - >-
        STORE ANALYSIS: 114th Gourmet Deli & Grill (805951)

        Location: 11321 Jamaica Ave, Richmond Hill


        Store 805951 - 114th Gourmet Deli & Grill operates as a Open Store
        Superette establishment at 11321 Jamaica Ave, Richmond Hill, NY
        114182441 (FIPS 36-81). Geographically precise at coordinates
        40.6983,-73.835 (Geocoded to specific address), this location generates
        $1,820,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0
        square foot space. The operation employs 11 full-time staff across 1
        checkout lanes, yielding a sales density of $1,820.00/sqft. Owned by
        Independent (Family ID: 99999) as part of a 1 Store-location network,
        the store sources inventory through Small Supplier (Supplier ID: 888888,
        Family ID: 88888) to maintain its position in the Grocery sector's
        Superette segment.
      - >-
        STORE ANALYSIS: 54 Royal Market (728885)

        Location: 817 9th Ave, New York


        Store 728885 - 54 Royal Market operates as a Open Store Superette
        establishment at 817 9th Ave, New York, NY 100194401 (FIPS 36-61).
        Geographically precise at coordinates 40.7662,-73.9872 (Geocoded to
        specific address), this location generates $1,768,000 in annual sales
        ($1,500,001 to $2,000,000) from its 2000.0 square foot space. The
        operation employs 11 full-time staff across 3 checkout lanes, yielding a
        sales density of $884.00/sqft. Owned by Independent (Family ID: 99999)
        as part of a 1 Store-location network, the store sources inventory
        through Small Supplier (Supplier ID: 888888, Family ID: 88888) to
        maintain its position in the Grocery sector's Superette segment.
      - >-
        STORE ANALYSIS: 1683 Jimmy Deli Grocery (1816263)

        Location: 1683 Woodbine St, Ridgewood


        Store 1816263 - 1683 Jimmy Deli Grocery operates as a Open Store
        Superette establishment at 1683 Woodbine St, Ridgewood, NY 113853546
        (FIPS 36-81). Geographically precise at coordinates 40.7012,-73.9083
        (Geocoded to specific address), this location generates $1,456,000 in
        annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot
        space. The operation employs 8 full-time staff across 2 checkout lanes,
        yielding a sales density of $1,456.00/sqft. Owned by Independent (Family
        ID: 99999) as part of a 1 Store-location network, the store sources
        inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888)
        to maintain its position in the Grocery sector's Superette segment.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@3
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@3
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@3
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@100
model-index:
  - name: SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: Unknown
          type: unknown
        metrics:
          - type: cosine_accuracy@1
            value: 0.85
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.93
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.96
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.98
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.85
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.30999999999999994
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.19199999999999995
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.09799999999999998
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.85
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.93
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.96
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.98
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.9172332496525142
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.8967619047619046
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.8976413373860181
            name: Cosine Map@100

SentenceTransformer based on Snowflake/snowflake-arctic-embed-l

This is a sentence-transformers model finetuned from Snowflake/snowflake-arctic-embed-l. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: Snowflake/snowflake-arctic-embed-l
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 1024 dimensions
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("philocifer/banner-flip-arctic-embed-l")
# Run inference
sentences = [
    'How many full-time staff members are employed at the 54 Royal Market?',
    "STORE ANALYSIS: 54 Royal Market (728885)\nLocation: 817 9th Ave, New York\n\nStore 728885 - 54 Royal Market operates as a Open Store Superette establishment at 817 9th Ave, New York, NY 100194401 (FIPS 36-61). Geographically precise at coordinates 40.7662,-73.9872 (Geocoded to specific address), this location generates $1,768,000 in annual sales ($1,500,001 to $2,000,000) from its 2000.0 square foot space. The operation employs 11 full-time staff across 3 checkout lanes, yielding a sales density of $884.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.",
    "STORE ANALYSIS: 1683 Jimmy Deli Grocery (1816263)\nLocation: 1683 Woodbine St, Ridgewood\n\nStore 1816263 - 1683 Jimmy Deli Grocery operates as a Open Store Superette establishment at 1683 Woodbine St, Ridgewood, NY 113853546 (FIPS 36-81). Geographically precise at coordinates 40.7012,-73.9083 (Geocoded to specific address), this location generates $1,456,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The operation employs 8 full-time staff across 2 checkout lanes, yielding a sales density of $1,456.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.85
cosine_accuracy@3 0.93
cosine_accuracy@5 0.96
cosine_accuracy@10 0.98
cosine_precision@1 0.85
cosine_precision@3 0.31
cosine_precision@5 0.192
cosine_precision@10 0.098
cosine_recall@1 0.85
cosine_recall@3 0.93
cosine_recall@5 0.96
cosine_recall@10 0.98
cosine_ndcg@10 0.9172
cosine_mrr@10 0.8968
cosine_map@100 0.8976

Training Details

Training Dataset

Unnamed Dataset

  • Size: 300 training samples
  • Columns: sentence_0 and sentence_1
  • Approximate statistics based on the first 300 samples:
    sentence_0 sentence_1
    type string string
    details
    • min: 13 tokens
    • mean: 19.23 tokens
    • max: 29 tokens
    • min: 200 tokens
    • mean: 215.01 tokens
    • max: 232 tokens
  • Samples:
    sentence_0 sentence_1
    How many full-time staff members are employed at 3 Rivers Grocery Market? STORE ANALYSIS: 3 Rivers Grocery Market (432489)
    Location: 9400 US Highway 60 W, Kevil

    Store 432489 - 3 Rivers Grocery Market operates as a Open Store Supermarket-Conventional establishment at 9400 US Highway 60 W, Kevil, KY 420539678 (FIPS 21-145). Geographically precise at coordinates 37.0624,-88.8028 (Geocoded to specific address), this location generates $4,160,000 in annual sales ($4,000,001 to $6,000,000) from its 13000.0 square foot space. The operation employs 18 full-time staff across 4 checkout lanes, yielding a sales density of $320.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Assoc Wholesale/Nashville Div (Supplier ID: 12115, Family ID: 4110) to maintain its position in the Grocery sector's Supermarket-Conventional segment.
    How many full-time staff members are employed at the 28th Street Supermarket? STORE ANALYSIS: 28th Street Supermarket (737932)
    Location: 2747 Cedar Ave, Cleveland

    Store 737932 - 28th Street Supermarket operates as a Open Store Superette establishment at 2747 Cedar Ave, Cleveland, OH 441152908 (FIPS 39-35). Geographically precise at coordinates 41.4988,-81.6687 (Geocoded to specific address), this location generates $1,404,000 in annual sales ($1,000,001 to $1,500,000) from its 1000.0 square foot space. The operation employs 4 full-time staff across 3 checkout lanes, yielding a sales density of $1,404.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through H T Hackney Co/Dist Ctr (Supplier ID: 36166, Family ID: 41880) to maintain its position in the Grocery sector's Superette segment.
    How many full-time staff members are employed at the 4th Street Market? STORE ANALYSIS: 4th Street Market (772013)
    Location: 301 4th St, Richmond

    Store 772013 - 4th Street Market operates as a Open Store Superette establishment at 301 4th St, Richmond, CA 948013001 (FIPS 6-13). Geographically precise at coordinates 37.9362,-122.3657 (Geocoded to specific address), this location generates $1,560,000 in annual sales ($1,500,001 to $2,000,000) from its 1000.0 square foot space. The operation employs 7 full-time staff across 2 checkout lanes, yielding a sales density of $1,560.00/sqft. Owned by Independent (Family ID: 99999) as part of a 1 Store-location network, the store sources inventory through Small Supplier (Supplier ID: 888888, Family ID: 88888) to maintain its position in the Grocery sector's Superette segment.
  • Loss: MatryoshkaLoss with these parameters:
    {
        "loss": "MultipleNegativesRankingLoss",
        "matryoshka_dims": [
            768,
            512,
            256,
            128,
            64
        ],
        "matryoshka_weights": [
            1,
            1,
            1,
            1,
            1
        ],
        "n_dims_per_step": -1
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 10
  • per_device_eval_batch_size: 10
  • num_train_epochs: 10
  • multi_dataset_batch_sampler: round_robin

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 10
  • per_device_eval_batch_size: 10
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1
  • num_train_epochs: 10
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.0
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • eval_use_gather_object: False
  • average_tokens_across_devices: False
  • prompts: None
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: round_robin

Training Logs

Epoch Step cosine_ndcg@10
1.0 30 0.9111
1.6667 50 0.9106
2.0 60 0.9058
3.0 90 0.9149
3.3333 100 0.9199
4.0 120 0.9185
5.0 150 0.9208
6.0 180 0.9172
6.6667 200 0.9172
7.0 210 0.9172
8.0 240 0.9172
8.3333 250 0.9172
9.0 270 0.9172
10.0 300 0.9172

Framework Versions

  • Python: 3.11.11
  • Sentence Transformers: 3.4.1
  • Transformers: 4.48.3
  • PyTorch: 2.5.1+cu124
  • Accelerate: 1.3.0
  • Datasets: 3.3.2
  • Tokenizers: 0.21.0

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MatryoshkaLoss

@misc{kusupati2024matryoshka,
    title={Matryoshka Representation Learning},
    author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
    year={2024},
    eprint={2205.13147},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}