|
--- |
|
language: |
|
- rw |
|
dataset: |
|
- kinyabert by antoine nzeyimana |
|
metric: |
|
- loss |
|
pipeline_tag: text-generation |
|
--- |
|
|
|
Kinyarwanda GPT-2 model based on Andrej Karpathy's nanoGPT. It was trained on a mixture of news data and diverse computer-generated datasets in Kinyarwanda. |
|
|
|
## Model configuration |
|
|
|
- number of layers = 6 |
|
- number of heads = 6 |
|
- embeddings = 384 |
|
- block size = 256 |
|
|
|
## Model dependencies |
|
|
|
``` |
|
pip install transformers datasets tiktoken wandb tqdm |
|
|
|
``` |
|
|
|
## Usage |
|
|
|
To use this model to generate text, download all the files in the repo and put them in the same directory, then run the following command: |
|
|
|
``` |
|
python sample.py --out_dir=. |
|
``` |
|
|