--- license: apache-2.0 datasets: - AyoubChLin/CNN_News_Articles_2011-2022 language: - en metrics: - accuracy pipeline_tag: zero-shot-classification --- # DistilBERT for Zero Shot Classification This repository contains a DistilBERT model trained for zero-shot classification on CNN articles. The model has been evaluated on CNN articles and achieved an accuracy of 0.956 and an F1 score of 0.955. ## Model Details - Architecture: DistilBERT - Training Data: CNN articles - Accuracy: 0.956 - F1 Score: 0.955 ## Usage To use this model for zero-shot classification, you can follow the steps below: 1. Load the trained model: ```python from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("AyoubChLin/DistilBERT_ZeroShot") model = AutoModelForSequenceClassification.from_pretrained("AyoubChLin/DistilBERT_ZeroShot") ``` 4. Classify text using zero-shot classification: ```python from transformers import pipeline # Create a zero-shot classification pipeline classifier = pipeline("zero-shot-classification", model=model, tokenizer=tokenizer) # Classify a sentence sentence = "The latest scientific breakthroughs in medicine" candidate_labels = ["politics", "sports", "technology", "business"] result = classifier(sentence, candidate_labels) print(result) ``` The output will be a dictionary containing the classified label and the corresponding classification score. ## About the Author This work was created by Ayoub Cherguelaine. If you have any questions or suggestions regarding this repository or the trained model, feel free to reach out to Ayoub Cherguelaine.