English
rwkv-minipile / README.md
evaldas-leliuga's picture
Create README.md
16972a2 verified
|
raw
history blame
505 Bytes
---
datasets:
- JeanKaddour/minipile
language:
- en
license: mpl-2.0
---
# RWKV Minipile
## Model Specifications
- **Architecture**: RWKV
- **Vocabulary Size**: 65,536
- **Embedding Size**: 768
- **Number of Layers**: 12
- **Context Length**: 512
- **Data Type**: bfloat16
- **Dataset**: Minipile
- **Tokens**: 20,643,840 (20 Million)
The model underwent a rigorous training regimen, completing 30 epochs to optimize performance.
## Inference
```bash
pip install torch numpy
python inference.py
```