angelahzyuan
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,7 @@ This model was developed using [Self-Play Preference Optimization](https://arxiv
|
|
32 |
| Model | LC. Win Rate | Win Rate | Avg. Length |
|
33 |
|-------------------------------------------|:------------:|:--------:|:-----------:|
|
34 |
|[Llama-3-8B-SPPO Iter1](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1) |48.70 |40.76 | 1669
|
35 |
-
|[Llama-3-8B-SPPO Iter2](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2)
|
36 |
|[Llama-3-8B-SPPO Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) |**53.27** |**47.74** | 1803
|
37 |
|
38 |
|
|
|
32 |
| Model | LC. Win Rate | Win Rate | Avg. Length |
|
33 |
|-------------------------------------------|:------------:|:--------:|:-----------:|
|
34 |
|[Llama-3-8B-SPPO Iter1](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1) |48.70 |40.76 | 1669
|
35 |
+
|[Llama-3-8B-SPPO Iter2](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2) |50.93 | 44.64 | 1759
|
36 |
|[Llama-3-8B-SPPO Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) |**53.27** |**47.74** | 1803
|
37 |
|
38 |
|