mightbe
/

Better-PairRM

Inference Endpoints

Model card Files Files and versions Community

maywell commited on Apr 3

Commit

1b901ae

•

1 Parent(s): 7fc0cce

Update README.md

Files changed (1) hide show

README.md +38 -0

README.md CHANGED Viewed

@@ -1,3 +1,41 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+# Better Implementation for [*PairRM*](https://huggingface.co/llm-blender/PairRM)
+# Introduction
+This version of PairRM have some fixes on training process, which improve model's performance significantly.
+## **Minor Fixes**
+### Longer Context Length (2048 -> 3380)
+Thanks to deberta's tokenzer, original PairRM model had enough Context Length.
+But, the longer the better :>
+---
+## **Major Fixes**
+### Change Prompt Format
+Why use something like
+```
+<Response i + 1> {response}
+```
+So, I changed to a format based on Vicuna 1.1.
+---
+### Change Truncate side
+The original process was using right side truncate even on Input. This can cause serious problem when Input exceeds model's seq len.
+---
+### Dataset Filter
+There was decent amount of empty assistant response on original dataset. So, I dropped them.