Add link to paper

This PR ensures the model can be viewed at https://huggingface.co/papers/2410.17215.

Feel free to update the other model cards, and add the paper to the collection :)

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -37,4 +37,14 @@ MiniPLM models achieves better performance given the same computation and scales
 ## Citation
-TODO

 ## Citation
+```bibtex
+@misc{gu2024miniplmknowledgedistillationpretraining,
+      title={MiniPLM: Knowledge Distillation for Pre-Training Language Models},
+      author={Yuxian Gu and Hao Zhou and Fandong Meng and Jie Zhou and Minlie Huang},
+      year={2024},
+      eprint={2410.17215},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2410.17215},
+}
+```