NeoZ123 commited on
Commit
5cd4f53
·
verified ·
1 Parent(s): 96f64b0

Update README_zh.md

Browse files
Files changed (1) hide show
  1. README_zh.md +2 -2
README_zh.md CHANGED
@@ -3,7 +3,7 @@
3
  Read this in [English](README.md)
4
 
5
  <p align="center">
6
- 🤗 <a href="https://huggingface.co/datasets/THUDM/LongReward-10k" target="_blank">[LongReward Dataset] </a> • 💻 <a href="https://github.com/THUDM/LongReward" target="_blank">[Github Repo]</a> • 📃 <a href="https://arxiv.org/abs/" target="_blank">[LongReward Paper]</a>
7
  </p>
8
 
9
  LongReward-glm4-9b-DPO 是 [LongReward-glm4-9b-SFT](https://huggingface.co/THUDM/LongReward-glm4-9b-SFT) 的 DPO 版本,支持最多
@@ -66,7 +66,7 @@ print(tokenizer.decode(out[0][input_len:], skip_special_tokens=True))
66
  title = {LongReward: Improving Long-context Large Language Models
67
  with AI Feedback}
68
  author={Jiajie Zhang and Zhongni Hou and Xin Lv and Shulin Cao and Zhenyu Hou and Yilin Niu and Lei Hou and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
69
- journal={arXiv preprint arXiv:},
70
  year={2024}
71
  }
72
  ```
 
3
  Read this in [English](README.md)
4
 
5
  <p align="center">
6
+ 🤗 <a href="https://huggingface.co/datasets/THUDM/LongReward-10k" target="_blank">[LongReward Dataset] </a> • 💻 <a href="https://github.com/THUDM/LongReward" target="_blank">[Github Repo]</a> • 📃 <a href="https://arxiv.org/abs/2410.21252" target="_blank">[LongReward Paper]</a>
7
  </p>
8
 
9
  LongReward-glm4-9b-DPO 是 [LongReward-glm4-9b-SFT](https://huggingface.co/THUDM/LongReward-glm4-9b-SFT) 的 DPO 版本,支持最多
 
66
  title = {LongReward: Improving Long-context Large Language Models
67
  with AI Feedback}
68
  author={Jiajie Zhang and Zhongni Hou and Xin Lv and Shulin Cao and Zhenyu Hou and Yilin Niu and Lei Hou and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
69
+ journal={arXiv preprint arXiv:2410.21252},
70
  year={2024}
71
  }
72
  ```