Update README_zh.md
Browse files- README_zh.md +2 -2
README_zh.md
CHANGED
@@ -3,7 +3,7 @@
|
|
3 |
Read this in [English](README.md)
|
4 |
|
5 |
<p align="center">
|
6 |
-
🤗 <a href="https://huggingface.co/datasets/THUDM/LongReward-10k" target="_blank">[LongReward Dataset] </a> • 💻 <a href="https://github.com/THUDM/LongReward" target="_blank">[Github Repo]</a> • 📃 <a href="https://arxiv.org/abs/" target="_blank">[LongReward Paper]</a>
|
7 |
</p>
|
8 |
|
9 |
LongReward-glm4-9b-DPO 是 [LongReward-glm4-9b-SFT](https://huggingface.co/THUDM/LongReward-glm4-9b-SFT) 的 DPO 版本,支持最多
|
@@ -66,7 +66,7 @@ print(tokenizer.decode(out[0][input_len:], skip_special_tokens=True))
|
|
66 |
title = {LongReward: Improving Long-context Large Language Models
|
67 |
with AI Feedback}
|
68 |
author={Jiajie Zhang and Zhongni Hou and Xin Lv and Shulin Cao and Zhenyu Hou and Yilin Niu and Lei Hou and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
|
69 |
-
journal={arXiv preprint arXiv:},
|
70 |
year={2024}
|
71 |
}
|
72 |
```
|
|
|
3 |
Read this in [English](README.md)
|
4 |
|
5 |
<p align="center">
|
6 |
+
🤗 <a href="https://huggingface.co/datasets/THUDM/LongReward-10k" target="_blank">[LongReward Dataset] </a> • 💻 <a href="https://github.com/THUDM/LongReward" target="_blank">[Github Repo]</a> • 📃 <a href="https://arxiv.org/abs/2410.21252" target="_blank">[LongReward Paper]</a>
|
7 |
</p>
|
8 |
|
9 |
LongReward-glm4-9b-DPO 是 [LongReward-glm4-9b-SFT](https://huggingface.co/THUDM/LongReward-glm4-9b-SFT) 的 DPO 版本,支持最多
|
|
|
66 |
title = {LongReward: Improving Long-context Large Language Models
|
67 |
with AI Feedback}
|
68 |
author={Jiajie Zhang and Zhongni Hou and Xin Lv and Shulin Cao and Zhenyu Hou and Yilin Niu and Lei Hou and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
|
69 |
+
journal={arXiv preprint arXiv:2410.21252},
|
70 |
year={2024}
|
71 |
}
|
72 |
```
|