MoritzLaurer
/

deberta-v3-large-zeroshot-v1.1-all-33

Zero-Shot Classification

text-classification

Inference Endpoints

Model card Files Files and versions Community

MoritzLaurer HF staff commited on Nov 29, 2023

Commit

c899018

•

1 Parent(s): 97c31f1

Update README.md

Files changed (1) hide show

README.md +10 -6

README.md CHANGED Viewed

@@ -57,20 +57,18 @@ print(output)
 ### Details on data and training
 The code for preparing the data and training & evaluating the model is fully open-source here: https://github.com/MoritzLaurer/zeroshot-classifier/tree/main
-## Limitations and bias
-The model can only do text classification tasks.
-Please consult the original DeBERTa paper and the papers for the different datasets for potential biases.
 ## Metrics
-Balanced accuracy metrics on all datasets.
 `deberta-v3-large-zeroshot-v1.1-all-33` was trained on all datasets, with only maximum 500 texts per class to avoid overfitting.
-The metrics on these datasets are therefore not strictly zeroshot, as the model has seen some data for each task.
 `deberta-v3-large-zeroshot-v1.1-heldout` indicates zeroshot performance on the respective dataset.
 To calculate these zeroshot metrics, the pipeline was run 28 times, each time with one dataset held out from training to simulate a zeroshot setup.
-![figure_large_v1.1](https://github.com/MoritzLaurer/zeroshot-classifier/blob/main/results/fig_large_v1.1.png)
 |                            |   deberta-v3-large-mnli-fever-anli-ling-wanli-binary |   deberta-v3-large-zeroshot-v1.1-heldout |   deberta-v3-large-zeroshot-v1.1-all-33 |
@@ -115,6 +113,12 @@ To calculate these zeroshot metrics, the pipeline was run 28 times, each time wi
 ## License
 The base model (DeBERTa-v3) is published under the MIT license.
 The datasets the model was fine-tuned on are published under a diverse set of licenses.

 ### Details on data and training
 The code for preparing the data and training & evaluating the model is fully open-source here: https://github.com/MoritzLaurer/zeroshot-classifier/tree/main
+Hyperparameters and other details are available in this Weights & Biases repo: https://wandb.ai/moritzlaurer/deberta-v3-large-zeroshot-v1-1-all-33/table?workspace=user-
 ## Metrics
+Balanced accuracy is reported for all datasets.
 `deberta-v3-large-zeroshot-v1.1-all-33` was trained on all datasets, with only maximum 500 texts per class to avoid overfitting.
+The metrics on these datasets are therefore not strictly zeroshot, as the model has seen some data for each task during training.
 `deberta-v3-large-zeroshot-v1.1-heldout` indicates zeroshot performance on the respective dataset.
 To calculate these zeroshot metrics, the pipeline was run 28 times, each time with one dataset held out from training to simulate a zeroshot setup.
+![figure_large_v1.1](https://raw.githubusercontent.com/MoritzLaurer/zeroshot-classifier/main/results/fig_large_v1.1.png)
 |                            |   deberta-v3-large-mnli-fever-anli-ling-wanli-binary |   deberta-v3-large-zeroshot-v1.1-heldout |   deberta-v3-large-zeroshot-v1.1-all-33 |
+## Limitations and bias
+The model can only do text classification tasks.
+Please consult the original DeBERTa paper and the papers for the different datasets for potential biases.
 ## License
 The base model (DeBERTa-v3) is published under the MIT license.
 The datasets the model was fine-tuned on are published under a diverse set of licenses.