|
--- |
|
license: mit |
|
library_name: transformers |
|
pipeline_tag: text-classification |
|
tags: |
|
- URL |
|
- BinaryCalssification |
|
- URL Classification |
|
- BERT URL calssifier |
|
|
|
widget: |
|
- text: "torgi.kz/help/id2.txt" |
|
|
|
- text: "vvps.ws/44/mothersdarlingcross.php?" |
|
|
|
- text: "torrentreactor.net/torrents/897644/ANGELCORPSE-4-(All)-Studio-Albums%5EWaPo" |
|
|
|
--- |
|
|
|
|
|
|
|
ππ€ **Binary URL Classifier with BERT** |
|
|
|
Welcome to our Binary URL Classifier! π |
|
|
|
### What it Does? |
|
|
|
This classifier categorizes URLs into two categories: Good (Non-Malicious) and Bad (Malicious). It uses BERT, a powerful language model, to understand the content and context of URLs for accurate predictions. |
|
|
|
### How it Works? |
|
|
|
1. **Data**: Trained on a curated dataset of good and bad URLs from this repo https://github.com/faizann24/Using-machine-learning-to-detect-malicious-URLs/tree/master?tab=readme-ov-file. |
|
|
|
2. **Training**: Fine-tuned a pre-trained BERT model using Hugging Face's Transformers library. |
|
|
|
3. **Evaluation**: Validation(15%) and Test(15%) set kept Aside. |
|
|
|
4. **Inference**: Provides probability scores for new URLs, indicating their likelihood of being good or bad. |
|
|
|
### Usage π |
|
|
|
1. **Input**: Provide a URL to classify. |
|
|
|
2. **Output**: Receive a probability score for its safety. |
|
|
|
### Why Trust Us? π |
|
|
|
- **Robust Training**: Trained on diverse and representative datasets. |
|
|
|
- **Evaluation**: Rigorously tested for accuracy and reliability. |
|
|
|
- **Simple & Effective**: Empowers users to make informed decisions about URL safety. |
|
|
|
### Let's Make the Web Safer! π |
|
|
|
Our Binary URL Classifier with BERT is here to enhance online safety. Whether you're a security professional, developer, or concerned user, it helps identify malicious URLs for proactive protection. |
|
|
|
Try it now and experience the power of BERT in URL classification! We're open to feedback and suggestions for continuous improvement. Stay safe online! ππ‘οΈ |