roberta-base-vn / README.md
nhanv's picture
Update README.md
b2068b5
|
raw
history blame
509 Bytes

language: vi

datasets:

  • oscar

NlpHUST/roberta-base-vn

Model description

This is a Vietnamese RoBERTa base model pretrained on Vietnamese Oscar dataset.

How to use

You can use this model for masked language modeling as follows:

from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("NlpHUST/roberta-base-vn")
model = AutoModelForMaskedLM.from_pretrained("NlpHUST/roberta-base-vn")

You can fine-tune this model on downstream tasks.