jiazhengli commited on
Commit
41b0cbc
1 Parent(s): 8a74685

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -19,5 +19,22 @@ widget:
19
 
20
  # Model Card for deberta-v3-large-Rationale-to-Score
21
 
22
- This repository provides a fine-tuned version of microsoft/deberta-v3-large,
23
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
  # Model Card for deberta-v3-large-Rationale-to-Score
21
 
22
+ This repository hosts a version of `microsoft/deberta-v3-large` that has been fine-tuned to assess text-based rationales and generate corresponding scores. As shown in the examples, the model processes a given free-text rationale and outputs a numerical score.
23
 
24
+ For a comprehensive understanding of the training process and methodologies employed, please refer to our detailed research paper: [Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring](https://arxiv.org/abs/2406.19949).
25
+
26
+ If you utilize this model in your research, please acknowledge it by citing our work:
27
+
28
+ ## Citation Information
29
+
30
+ ```bibtex
31
+ @misc{li2024calibratingllmspreferenceoptimization,
32
+ title={Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring},
33
+ author={Jiazheng Li and Hainiu Xu and Zhaoyue Sun and Yuxiang Zhou and David West and Cesare Aloisi and Yulan He},
34
+ year={2024},
35
+ eprint={2406.19949},
36
+ archivePrefix={arXiv},
37
+ primaryClass={cs.CL},
38
+ url={https://arxiv.org/abs/2406.19949},
39
+ }
40
+ ```