File size: 851 Bytes
7fc0cce
 
 
1b901ae
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
license: apache-2.0
---
# Better Implementation for [*PairRM*](https://huggingface.co/llm-blender/PairRM)

# Introduction

This version of PairRM have some fixes on training process, which improve model's performance significantly.

## **Minor Fixes**

### Longer Context Length (2048 -> 3380)

Thanks to deberta's tokenzer, original PairRM model had enough Context Length.

But, the longer the better :>

---

## **Major Fixes**

### Change Prompt Format

Why use something like
```
<Response i + 1> {response}
```

So, I changed to a format based on Vicuna 1.1.

---

### Change Truncate side

The original process was using right side truncate even on Input. This can cause serious problem when Input exceeds model's seq len.

---

### Dataset Filter

There was decent amount of empty assistant response on original dataset. So, I dropped them.