wangkevin02 commited on
Commit
48c56aa
·
verified ·
1 Parent(s): f3c1b5f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -15,7 +15,7 @@ base_model:
15
 
16
  > **GitHub repository** for exploring the source code and additional resources: https://github.com/wangkevin02/USP
17
 
18
- The **AI Detect Model** is a binary classification model designed to determine whether a given text is AI-generated (label=1) or written by a human (label=0). This model plays a crucial role in providing AI detection rewards, helping to prevent reward hacking during Reinforcement Learning with Cycle Consistency (RLCC). For more details, please refer to [our paper](https://tongyi.aliyun.com/qianwen/?sessionId=ea3bbcf36a2346a0a7819b06fcb36a1c#).
19
 
20
  This model is built upon the [Longformer](https://huggingface.co/allenai/longformer-base-4096) architecture and trained using our proprietary [LMSYS-USP](https://huggingface.co/datasets/wangkevin02/LMSYS-USP) dataset. Specifically, in a dialogue context, texts generated by the assistant are labeled as AI-generated (label=1), while user-generated texts are assigned the opposite label (label=0).
21
 
@@ -78,5 +78,13 @@ if __name__ == "__main__":
78
  If you find this model useful, please cite:
79
 
80
  ```plaintext
81
- [Authors], "[Paper Title]," [Venue], [Year], [URL or DOI].
 
 
 
 
 
 
 
 
82
  ```
 
15
 
16
  > **GitHub repository** for exploring the source code and additional resources: https://github.com/wangkevin02/USP
17
 
18
+ The **AI Detect Model** is a binary classification model designed to determine whether a given text is AI-generated (label=1) or written by a human (label=0). This model plays a crucial role in providing AI detection rewards, helping to prevent reward hacking during Reinforcement Learning with Cycle Consistency (RLCC). For more details, please refer to [our paper](https://arxiv.org/pdf/2502.18968).
19
 
20
  This model is built upon the [Longformer](https://huggingface.co/allenai/longformer-base-4096) architecture and trained using our proprietary [LMSYS-USP](https://huggingface.co/datasets/wangkevin02/LMSYS-USP) dataset. Specifically, in a dialogue context, texts generated by the assistant are labeled as AI-generated (label=1), while user-generated texts are assigned the opposite label (label=0).
21
 
 
78
  If you find this model useful, please cite:
79
 
80
  ```plaintext
81
+ @misc{wang2025knowbettermodelinghumanlike,
82
+ title={Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles},
83
+ author={Kuang Wang and Xianfei Li and Shenghao Yang and Li Zhou and Feng Jiang and Haizhou Li},
84
+ year={2025},
85
+ eprint={2502.18968},
86
+ archivePrefix={arXiv},
87
+ primaryClass={cs.CL},
88
+ url={https://arxiv.org/abs/2502.18968},
89
+ }
90
  ```