saiful9379
commited on
Commit
•
37342ba
1
Parent(s):
ff3308c
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,34 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
|
5 |
+
# BanglaNER
|
6 |
+
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1uN1WP7MjaBYXKABfhkHGn7EBWm9kd9k9?usp=sharing)
|
7 |
+
Bangla Name Entity Recognition (NER) is extracting human names from input Bangla string or text. To solve this problem select Spacy pipeline and try 5 experimental approaches.
|
8 |
+
|
9 |
+
The experiment is done only using one entity name (person) labeled as PER. After completing the experiment we got the best performance from the spacy transformer-based model.
|
10 |
+
|
11 |
+
For more detail please check the experimental details and Best model F1 score is ~.81.05.
|
12 |
+
# Dataset
|
13 |
+
|
14 |
+
Bangla NER data is collected from,
|
15 |
+
|
16 |
+
1. banglakit Bangla NER Dataset [Link](https://raw.githubusercontent.com/banglakit/bengali-ner-data/master/main.jsonl)
|
17 |
+
|
18 |
+
2. Rifat1493 Bangla NER Dataset [Link](https://github.com/Rifat1493/Bengali-NER/tree/master/Input)
|
19 |
+
|
20 |
+
|
21 |
+
3. SemEval2022 Bangla NER Dataaset [Link](https://competitions.codalab.org/competitions/36425#learn_the_details)
|
22 |
+
|
23 |
+
More Detail about the model check github.
|
24 |
+
|
25 |
+
# Reference
|
26 |
+
|
27 |
+
1. [Spacy Training Pipelines & Models](https://spacy.io/usage/training)
|
28 |
+
2. [NER data annotation](https://doccano.github.io/doccano/tutorial/)
|
29 |
+
|
30 |
+
3. [BERT Pretrin model ](https://github.com/csebuetnlp/banglabert)
|
31 |
+
4. [BILOU data formats meaning](https://stackoverflow.com/questions/17116446/what-do-the-bilou-tags-mean-in-named-entity-recognition)
|
32 |
+
5. [SpaCy 3.1 data format](https://zachlim98.github.io/me/2021-03/spacy3-ner-tutorial)
|
33 |
+
6. [Tranformer infornation](https://jalammar.github.io/illustrated-transformer/)
|
34 |
+
7. [Load Gensim WordVectors into spacy pipeline](https://stackoverflow.com/questions/75521069/load-gensim-wordvectors-into-spacy-pipeline)
|