Taka008 commited on
Commit
1f5a6d0
1 Parent(s): f67409a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -148,6 +148,8 @@ We evaluated the models using 100 examples from the dev split.
148
 
149
  ### Japanese MT Bench
150
 
 
 
151
  | Model name | average | coding | extraction | humanities | math | reasoning | roleplay | stem | writing |
152
  | :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
153
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) | 4.93 | 1.50 | 4.70 | 7.80 | 1.55 | 2.60 | 7.80 | 6.10 | 7.40 |
 
148
 
149
  ### Japanese MT Bench
150
 
151
+ We evaluated the models using `gpt-4-0613`. Please see the [codes](https://github.com/llm-jp/llm-leaderboard/tree/main) for details.
152
+
153
  | Model name | average | coding | extraction | humanities | math | reasoning | roleplay | stem | writing |
154
  | :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
155
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) | 4.93 | 1.50 | 4.70 | 7.80 | 1.55 | 2.60 | 7.80 | 6.10 | 7.40 |