|
--- |
|
language: vi |
|
|
|
datasets: |
|
- oscar |
|
|
|
--- |
|
|
|
# NlpHUST/roberta-base-vn |
|
|
|
## Model description |
|
|
|
This is a Vietnamese RoBERTa base model pretrained on Vietnamese Oscar dataset. |
|
|
|
## How to use |
|
|
|
You can use this model for masked language modeling as follows: |
|
```python |
|
from transformers import AutoTokenizer, AutoModelForMaskedLM |
|
tokenizer = AutoTokenizer.from_pretrained("NlpHUST/roberta-base-vn") |
|
model = AutoModelForMaskedLM.from_pretrained("NlpHUST/roberta-base-vn") |
|
|
|
You can fine-tune this model on downstream tasks. |
|
|