What libraries can I use for Zero-Shot Classification?

The transformersand transformers.js libraries are compatible with Zero-Shot Classification.

What models can I use for Zero-Shot Classification?

The facebook/bart-large-mnli, MoritzLaurer/ModernBERT-large-zeroshot-v2.0, and knowledgator/gliclass-modern-base-v2.0-init models can be used for Zero-Shot Classification.

What datasets can I use for Zero-Shot Classification?

The nyu-mll/glue, nyu-mll/multi_nli, and fever/fever datasets can be used for Zero-Shot Classification.

Tasks

Zero-Shot Classification

Zero-shot text classification is a task in natural language processing where a model is trained on a set of labeled examples but is then able to classify new examples from previously unseen classes.

Inputs

Text Input

Dune is the best movie ever.

Candidate Labels

CINEMA, ART, MUSIC

Zero-Shot Classification Model

Output

CINEMA

0.900

ART

0.100

MUSIC

0.000

About Zero-Shot Classification

About the Task

Zero Shot Classification is the task of predicting a class that wasn't seen by the model during training. This method, which leverages a pre-trained language model, can be thought of as an instance of transfer learning which generally refers to using a model trained for one task in a different application than what it was originally trained for. This is particularly useful for situations where the amount of labeled data is small.

In zero shot classification, we provide the model with a prompt and a sequence of text that describes what we want our model to do, in natural language. Zero-shot classification excludes any examples of the desired task being completed. This differs from single or few-shot classification, as these tasks include a single or a few examples of the selected task.

Zero, single and few-shot classification seem to be an emergent feature of large language models. This feature seems to come about around model sizes of +100M parameters. The effectiveness of a model at a zero, single or few-shot task seems to scale with model size, meaning that larger models (models with more trainable parameters or layers) generally do better at this task.

Here is an example of a zero-shot prompt for classifying the sentiment of a sequence of text:

Classify the following input text into one of the following three categories: [positive, negative, neutral]

Input Text: Hugging Face is awesome for making all of these
state of the art models available!
Sentiment: positive

One great example of this task with a nice off-the-shelf model is available at the widget of this page, where the user can input a sequence of text and candidate labels to the model. This is a word level example of zero shot classification, more elaborate and lengthy generations are available with larger models. Testing these models out and getting a feel for prompt engineering is the best way to learn how to use them.

Inference

You can use the 🤗 Transformers library zero-shot-classification pipeline to infer with zero shot text classification models.

from transformers import pipeline

pipe = pipeline(model="facebook/bart-large-mnli")
pipe("I have a problem with my iphone that needs to be resolved asap!",
    candidate_labels=["urgent", "not urgent", "phone", "tablet", "computer"],
)
# output
>>> {'sequence': 'I have a problem with my iphone that needs to be resolved asap!!', 'labels': ['urgent', 'phone', 'computer', 'not urgent', 'tablet'], 'scores': [0.504, 0.479, 0.013, 0.003, 0.002]}

Useful Resources

Deploy on Inference Endpoints

Compatible libraries

Transformers

Transformers.js

using facebook/bart-large-mnli

Models for Zero-Shot Classification

Browse Models (385)

facebook/bart-large-mnli

Zero-Shot Classification • Updated Sep 5, 2023 • 3.2M • • 1.35k

Note Powerful zero-shot text classification model.

MoritzLaurer/ModernBERT-large-zeroshot-v2.0

Text Classification • Updated Jan 16 • 71.8k • 46

Note Cutting-edge zero-shot multilingual text classification model.

knowledgator/gliclass-modern-base-v2.0-init

Zero-Shot Classification • Updated 17 days ago • 10.8k • 22

Note Zero-shot text classification model that can be used for topic and sentiment classification.

Datasets for Zero-Shot Classification

Browse Datasets (501)

nyu-mll/glue

Viewer • Updated Jan 30, 2024 • 1.49M • 228k • 407

Note A widely used dataset used to benchmark multiple variants of text classification.

nyu-mll/multi_nli

Viewer • Updated Jan 4, 2024 • 412k • 3.9k • 100

Note The Multi-Genre Natural Language Inference (MultiNLI) corpus is a crowd-sourced collection of 433k sentence pairs annotated with textual entailment information.

fever/fever

Updated Jan 18, 2024 • 1.03k • 30

Note FEVER is a publicly available dataset for fact extraction and verification against textual sources.

Spaces using Zero-Shot Classification

No example Space is defined for this task.

Note Contribute by proposing a Space for this task !

Metrics for Zero-Shot Classification

No example metric is defined for this task.

Note Contribute by proposing a metric for this task !