sagorsarker
commited on
Commit
•
12aeda2
1
Parent(s):
cd8c8f0
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,24 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
## Bangla Glove Vectors
|
5 |
+
This is a collection of pre-trained glove vectors for the Bengali language. You can find details training in [this](https://github.com/sagorbrur/GloVe-Bengali) repository.
|
6 |
+
|
7 |
+
This model is build for [bnlp](https://github.com/sagorbrur/bnlp) package.
|
8 |
+
|
9 |
+
|
10 |
+
## Datasets
|
11 |
+
- [Wikipedia dump datasets](https://dumps.wikimedia.org/bnwiki/latest/)
|
12 |
+
|
13 |
+
## Model Details
|
14 |
+
- wikipedia+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 195.4MB download):
|
15 |
+
bn_glove.39M.300d.zip
|
16 |
+
|
17 |
+
- wikipeida+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 100d vectors, 65MB download):
|
18 |
+
bn_glove.39M.100d.zip
|
19 |
+
|
20 |
+
- Wikipedia (20M(19965328) tokens, 0.13M(134255) vocab, 300d vectors, 145.9MB download):
|
21 |
+
bn_glove.300d.zip
|
22 |
+
|
23 |
+
- Wikipedia (20M(19965328) tokens, 0.13M(134255), 100d vectors, 51.2MB download):
|
24 |
+
bn_glove.100d.zip
|