louisbrulenaudet
/

lemone-router-m

Text Classification

sentence-transformers

Generated from Trainer

feature-extraction

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

louisbrulenaudet commited on Oct 26

Commit

714ed93

•

1 Parent(s): 0d1959e

Update README.md

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -34,6 +34,34 @@ datasets:
 # Lemone-Router: A Series of Fine-Tuned Classification Models for French Taxation
 This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base).
 It achieves the following results on the evaluation set:
 - Loss: 0.4096

 # Lemone-Router: A Series of Fine-Tuned Classification Models for French Taxation
+Lemone-router is a series of classification models designed to produce an optimal multi-agent system for different branches of tax law. Trained on a base of 49k lines comprising a set of synthetic questions generated by GPT-4 Turbo and Llama 3.1 70B, which have been further refined through evol-instruction tuning and manual curation and authority documents, these models are based on an 8-category decomposition of the classification scheme derived from the Bulletin officiel des finances publiques - impôts :
+```python3
+label2id = {
+    "Bénéfices professionnels": 0,
+    "Contrôle et contentieux": 1,
+    "Dispositifs transversaux": 2,
+    "Fiscalité des entreprises": 3,
+    "Patrimoine et enregistrement": 4,
+    "Revenus particuliers": 5,
+    "Revenus patrimoniaux": 6,
+    "Taxes sur la consommation": 7
+}
+id2label = {
+    0: "Bénéfices professionnels",
+    1: "Contrôle et contentieux",
+    2: "Dispositifs transversaux",
+    3: "Fiscalité des entreprises",
+    4: "Patrimoine et enregistrement",
+    5: "Revenus particuliers",
+    6: "Revenus patrimoniaux",
+    7: "Taxes sur la consommation"
+}
+```
+This breakdown may be debatable, but it nevertheless simplifies the diversity of categories and is a source of preparatory work.
 This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base).
 It achieves the following results on the evaluation set:
 - Loss: 0.4096