bleysg commited on
Commit
a4b1e76
1 Parent(s): 0f6c100

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -151,6 +151,13 @@ We gain a slight edge over our previous releases, again topping the leaderboard,
151
 
152
  ![GPT4ALL Performance](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca/resolve/main/Images/MistralOrca7BGPT4ALL.png "GPT4ALL Performance")
153
 
 
 
 
 
 
 
 
154
 
155
  # Dataset
156
 
 
151
 
152
  ![GPT4ALL Performance](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca/resolve/main/Images/MistralOrca7BGPT4ALL.png "GPT4ALL Performance")
153
 
154
+ ## MT-Bench Performance
155
+
156
+ MT-Bench uses GPT-4 as a judge of model response quality, across a wide range of challenges.
157
+ We find our performance is *on-par with `Llama2-70b-chat`*, averaging **6.86**.
158
+
159
+ ![MT-Bench Performance](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca/resolve/main/Images/MistralOrca7BMTBENCH.png "MT-Bench Performance")
160
+
161
 
162
  # Dataset
163