princeton-nlp
/

efficient_mlm_m0.40

Model card Files Files and versions Community

princeton-nlp commited on Apr 27, 2022

Commit

08e4470

•

1 Parent(s): dc3d269

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 ---
 inference: false
----

 ---
 inference: false
+---
+This is a model checkpoint for ["Should You Mask 15% in Masked Language Modeling"](https://arxiv.org/abs/2202.08005) [(code)](https://github.com/princeton-nlp/DinkyTrain.git). We use pre layer norm, which is not supported by HuggingFace. To use our model, go to our [github repo](https://github.com/princeton-nlp/DinkyTrain.git), download our code, and import the RoBERTa class from `huggingface/modeling_roberta_prelayernorm.py`. For example,
+``` bash
+from huggingface.modeling_roberta_prelayernorm import RobertaForMaskedLM, RobertaForSequenceClassification
+```