Update README.md
Browse files
README.md
CHANGED
@@ -52,7 +52,7 @@ For details on training and evaluation, read [our paper](https://link.todo)!
|
|
52 |
| **Tulu V2.5 PPO 13B (this model)** | 13B | PPO with 70B RM | 58.0 | **26.7** | 62.8 |
|
53 |
| **Tulu V2 DPO 13B** | 13B | DPO | 50.5 | 16.0 | 61.0 |
|
54 |
| **Tulu V2 SFT 13B** | 13B | - | 46.0 | 10.4 | 62.8 |
|
55 |
-
| **Tulu V2 DPO 70B** |
|
56 |
|
57 |
## Input Format
|
58 |
|
|
|
52 |
| **Tulu V2.5 PPO 13B (this model)** | 13B | PPO with 70B RM | 58.0 | **26.7** | 62.8 |
|
53 |
| **Tulu V2 DPO 13B** | 13B | DPO | 50.5 | 16.0 | 61.0 |
|
54 |
| **Tulu V2 SFT 13B** | 13B | - | 46.0 | 10.4 | 62.8 |
|
55 |
+
| **Tulu V2 DPO 70B** | 70B | DPO | **71.5** | 21.2 | **69.4** |
|
56 |
|
57 |
## Input Format
|
58 |
|