Update README.md
Browse files
README.md
CHANGED
@@ -18,5 +18,13 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
|
|
18 |
| 10 | Qwen/Qwen2-72B-Instruct | 8.00 | 8.14 | 9.07 | 9.85 | 9.78 | 7.28 | 8.61 | 8.76 | 8.69 | 72B |
|
19 |
| 11 | google/gemini-1.5-pro-001 | 7.00 | 8.00 | 9.57 | 8.85 | 9.35 | 8.64 | 8.61 | 8.52 | 8.57 | ? |
|
20 |
|
21 |
-
### KMMLU Benchmark
|
22 |
-
* [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark score.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
| 10 | Qwen/Qwen2-72B-Instruct | 8.00 | 8.14 | 9.07 | 9.85 | 9.78 | 7.28 | 8.61 | 8.76 | 8.69 | 72B |
|
19 |
| 11 | google/gemini-1.5-pro-001 | 7.00 | 8.00 | 9.57 | 8.85 | 9.35 | 8.64 | 8.61 | 8.52 | 8.57 | ? |
|
20 |
|
21 |
+
### KMMLU Benchmark
|
22 |
+
* [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
|
23 |
+
|
24 |
+
| Category |Qwen2-72B kor-dpo| Qwen2-72B | Questions |
|
25 |
+
|-----------------|-----------------|------------|------------|
|
26 |
+
| HUMSS | 0.63 | 0.63 | 5130 |
|
27 |
+
| STEM | 0.59 | 0.59 | 9900 |
|
28 |
+
| Applied Science | 0.56 | 0.56 | 11600 |
|
29 |
+
| Other | 0.58 | 0.58 | 8400 |
|
30 |
+
| Overall Accuracy| 0.58 | 0.58 | 35030 |
|