File size: 1,097 Bytes
cd8c8f0 12aeda2 9a544eb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
license: mit
---
## Bangla Glove Vectors
This is a collection of pre-trained glove vectors for the Bengali language. You can find details training in [this](https://github.com/sagorbrur/GloVe-Bengali) repository.
This model is build for [bnlp](https://github.com/sagorbrur/bnlp) package.
## Datasets
- [Wikipedia dump datasets](https://dumps.wikimedia.org/bnwiki/latest/)
## Model Details
- wikipedia+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 195.4MB download):
bn_glove.39M.300d.zip
- wikipeida+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 100d vectors, 65MB download):
bn_glove.39M.100d.zip
- Wikipedia (20M(19965328) tokens, 0.13M(134255) vocab, 300d vectors, 145.9MB download):
bn_glove.300d.zip
- Wikipedia (20M(19965328) tokens, 0.13M(134255), 100d vectors, 51.2MB download):
bn_glove.100d.zip
## Usage
```py
from bnlp import BengaliGlove
glove_path = "bn_glove.39M.100d.txt"
word = "গ্রাম"
bng = BengaliGlove()
res = bng.closest_word(glove_path, word)
print(res)
vec = bng.word2vec(glove_path, word)
print(vec)
``` |