ZiyiYe
/

Con-J-Qwen2-7B

Model card Files Files and versions Community

ZiyiYe commited on Oct 8, 2024

Commit

a4b4071

·

verified ·

1 Parent(s): 39387d1

Update README.md

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -154,6 +154,15 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ## Reference
-Coming soon.

 ## Reference
+```
+@misc{ye2024scalarrewardmodellearning,
+      title={Beyond Scalar Reward Model: Learning Generative Judge from Preference Data},
+      author={Ziyi Ye and Xiangsheng Li and Qiuchi Li and Qingyao Ai and Yujia Zhou and Wei Shen and Dong Yan and Yiqun Liu},
+      year={2024},
+      eprint={2410.03742},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2410.03742},
+}
+```