yaniseuranova
/

setfit-rag-hybrid-search-query-router-test

@@ -10,14 +10,15 @@ tags:
 - text-classification
 - generated_from_setfit_trainer
 widget:
-- text: How does technology impact our daily lives and what benefits can it bring
-    to various activities?
-- text: How do organizations effectively deploy and manage machine learning algorithms
-    to drive business value?
-- text: What are the key considerations for organizing and managing computer lab resources
-    and tracking their status?
-- text: How can batch processing improve the efficiency of data lake operations?
-- text: What is the purpose of setting up a CUPS on a server?
 inference: true
 model-index:
 - name: SetFit with sentence-transformers/all-MiniLM-L6-v2
@@ -31,7 +32,7 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.8947368421052632
       name: Accuracy
 ---
@@ -51,7 +52,7 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **Maximum Sequence Length:** 256 tokens
-- **Number of Classes:** 2 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
@@ -63,17 +64,19 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label    | Examples                                                                                                                                                                                                                                      |
-|:---------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| lexical  | <ul><li>"How does Happeo's search AI work to provide answers to user queries?"</li><li>'What are the primary areas of focus in the domain of Data Science and Analysis?'</li><li>'How can one organize a running event in Belgium?'</li></ul> |
-| semantic | <ul><li>'What changes can be made to a channel header?'</li><li>'How can hardware capabilities impact the accuracy of motion and object detections?'</li><li>'Who is responsible for managing guarantees and prolongations?'</li></ul>        |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.8947   |
 ## Uses
@@ -93,7 +96,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-rag-hybrid-search-query-router-test")
 # Run inference
-preds = model("What is the purpose of setting up a CUPS on a server?")
 ```
 <!--
@@ -125,16 +128,18 @@ preds = model("What is the purpose of setting up a CUPS on a server?")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 4   | 13.7407 | 28  |
-| Label    | Training Sample Count |
-|:---------|:----------------------|
-| lexical  | 44                    |
-| semantic | 118                   |
 ### Training Hyperparameters
-- batch_size: (32, 32)
-- num_epochs: (1, 1)
 - max_steps: -1
 - sampling_strategy: oversampling
 - body_learning_rate: (2e-05, 1e-05)
@@ -150,20 +155,52 @@ preds = model("What is the purpose of setting up a CUPS on a server?")
 - load_best_model_at_end: True
 ### Training Results
-| Epoch   | Step    | Training Loss | Validation Loss |
-|:-------:|:-------:|:-------------:|:---------------:|
-| 0.0020  | 1       | 0.4064        | -               |
-| 0.0998  | 50      | 0.2177        | -               |
-| 0.1996  | 100     | 0.0437        | -               |
-| 0.2994  | 150     | 0.0057        | -               |
-| 0.3992  | 200     | 0.0034        | -               |
-| 0.4990  | 250     | 0.0009        | -               |
-| 0.5988  | 300     | 0.0009        | -               |
-| 0.6986  | 350     | 0.0007        | -               |
-| 0.7984  | 400     | 0.0007        | -               |
-| 0.8982  | 450     | 0.0009        | -               |
-| 0.9980  | 500     | 0.0005        | -               |
-| **1.0** | **501** | **-**         | **0.1811**      |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

 - text-classification
 - generated_from_setfit_trainer
 widget:
+- text: What are the key components involved in developing a deep learning model for
+    handwritten digit recognition?
+- text: What is the purpose of the message posted by the CR?
+- text: How can researchers create and maintain public repositories for reproducible
+    research?
+- text: What are the key components involved in developing a deep learning model for
+    handwritten digit recognition?
+- text: How do you prioritize and delegate tasks to ensure efficient collaboration
+    and feedback?
 inference: true
 model-index:
 - name: SetFit with sentence-transformers/all-MiniLM-L6-v2
       split: test
     metrics:
     - type: accuracy
+      value: 0.5
       name: Accuracy
 ---
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **Maximum Sequence Length:** 256 tokens
+- **Number of Classes:** 4 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label         | Examples                                                                                                                                                                                                                                                                                                                                                                       |
+|:--------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| lexical       | <ul><li>'What are the key considerations when choosing an optimization method for a complex problem?'</li><li>'What are the challenges of being a remote mentor or sponsor?'</li><li>'How do researchers typically obtain information on the ranking of machine learning conferences?'</li></ul>                                                                               |
+| semantic      | <ul><li>'What are common issues that users may encounter when accessing a platform that uses JumpCloud for authentication?'</li><li>'What are the key components involved in developing a deep learning model for handwritten digit recognition?'</li><li>'How can machine learning and data enrichment be used to improve business outcomes in various industries?'</li></ul> |
+| very_semantic | <ul><li>"What are people's opinions on a particular topic?"</li><li>'What are the key considerations when proposing names for a project or initiative?'</li><li>'What are the key considerations for successful collaboration between industry and academia in research and development projects?'</li></ul>                                                                   |
+| very_lexical  | <ul><li>'How can one track and store keys in a Flink operator?'</li><li>'What role do companies like Solvay play in addressing key societal challenges through their business strategies and operations?'</li><li>'What is the purpose of the scoring methodology in determining RAI maturity?'</li></ul>                                                                      |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.5      |
 ## Uses
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-rag-hybrid-search-query-router-test")
 # Run inference
+preds = model("What is the purpose of the message posted by the CR?")
 ```
 <!--
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 8   | 14.4138 | 24  |
+| Label         | Training Sample Count |
+|:--------------|:----------------------|
+| lexical       | 32                    |
+| semantic      | 21                    |
+| very_lexical  | 10                    |
+| very_semantic | 24                    |
 ### Training Hyperparameters
+- batch_size: (8, 8)
+- num_epochs: (3, 3)
 - max_steps: -1
 - sampling_strategy: oversampling
 - body_learning_rate: (2e-05, 1e-05)
 - load_best_model_at_end: True
 ### Training Results
+| Epoch   | Step     | Training Loss | Validation Loss |
+|:-------:|:--------:|:-------------:|:---------------:|
+| 0.0015  | 1        | 0.268         | -               |
+| 0.0736  | 50       | 0.2649        | -               |
+| 0.1473  | 100      | 0.3352        | -               |
+| 0.2209  | 150      | 0.2516        | -               |
+| 0.2946  | 200      | 0.2438        | -               |
+| 0.3682  | 250      | 0.1808        | -               |
+| 0.4418  | 300      | 0.2365        | -               |
+| 0.5155  | 350      | 0.1337        | -               |
+| 0.5891  | 400      | 0.2263        | -               |
+| 0.6627  | 450      | 0.1936        | -               |
+| 0.7364  | 500      | 0.0612        | -               |
+| 0.8100  | 550      | 0.1664        | -               |
+| 0.8837  | 600      | 0.0987        | -               |
+| 0.9573  | 650      | 0.0736        | -               |
+| 1.0     | 679      | -             | 0.2288          |
+| 1.0309  | 700      | 0.0568        | -               |
+| 1.1046  | 750      | 0.0765        | -               |
+| 1.1782  | 800      | 0.1193        | -               |
+| 1.2518  | 850      | 0.199         | -               |
+| 1.3255  | 900      | 0.2734        | -               |
+| 1.3991  | 950      | 0.194         | -               |
+| 1.4728  | 1000     | 0.1085        | -               |
+| 1.5464  | 1050     | 0.1496        | -               |
+| 1.6200  | 1100     | 0.1673        | -               |
+| 1.6937  | 1150     | 0.2225        | -               |
+| 1.7673  | 1200     | 0.0503        | -               |
+| 1.8409  | 1250     | 0.1531        | -               |
+| 1.9146  | 1300     | 0.2287        | -               |
+| 1.9882  | 1350     | 0.1187        | -               |
+| **2.0** | **1358** | **-**         | **0.2055**      |
+| 2.0619  | 1400     | 0.0546        | -               |
+| 2.1355  | 1450     | 0.2072        | -               |
+| 2.2091  | 1500     | 0.1208        | -               |
+| 2.2828  | 1550     | 0.0837        | -               |
+| 2.3564  | 1600     | 0.0405        | -               |
+| 2.4300  | 1650     | 0.1334        | -               |
+| 2.5037  | 1700     | 0.1458        | -               |
+| 2.5773  | 1750     | 0.2189        | -               |
+| 2.6510  | 1800     | 0.0561        | -               |
+| 2.7246  | 1850     | 0.1656        | -               |
+| 2.7982  | 1900     | 0.1351        | -               |
+| 2.8719  | 1950     | 0.1826        | -               |
+| 2.9455  | 2000     | 0.1905        | -               |
+| 3.0     | 2037     | -             | 0.2273          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_501",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_1358",
   "architectures": [
     "BertModel"
   ],

config_setfit.json CHANGED Viewed

@@ -2,6 +2,8 @@
   "normalize_embeddings": false,
   "labels": [
     "lexical",
-    "semantic"
   ]
 }

   "normalize_embeddings": false,
   "labels": [
     "lexical",
+    "semantic",
+    "very_lexical",
+    "very_semantic"
   ]
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5b08cb8ef3f4175acd6951cd1ea664172bc14585810f21c96ddec7fe51c2a3b8
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:7fac62744a83855a95a3e80c70bf8a4648a3c5a1cd0053760fa1ff330790c771
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9580a3d3e74febc8f840a57e62654f417272cf7d39a382095caa7babcb979f74
-size 3983

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5a2800b0ffabd217138abf7b9e4a3321ce002b79f4c83251f28a4f0a7a58788
+size 13367