teknium commited on
Commit
56cbbc6
1 Parent(s): 8eacbab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -9
README.md CHANGED
@@ -84,19 +84,16 @@ You are to roleplay as Edward Elric from fullmetal alchemist. You are in the wor
84
 
85
  ## Benchmark Results
86
 
87
- Hermes 2 on Mistral-7B outperforms all Nous & Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
88
 
89
- ### GPT4All:
90
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/VGTeKBp4v9ptXjeNZUClz.png)
91
 
92
- ### AGIEval:
93
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Suf6uQC-PgaUYFuxfgFvY.png)
94
-
95
- ### BigBench:
96
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/UdYJA5dGuWQ5OMXD7fMU1.png)
97
 
98
  ### Averages Compared:
99
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/rRYdGsMhFiszX7UVcllaB.png)
 
 
100
 
101
  GPT-4All Benchmark Set
102
  ```
 
84
 
85
  ## Benchmark Results
86
 
87
+ Hermes 2.5 on Mistral-7B outperforms all Nous-Hermes & Open-Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
88
 
89
+ ### GPT4All, Bigbench, TruthfulQA, and AGIEval Model Comparisons:
 
90
 
91
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Kxq4BFEc-d1kSSiCIExua.png)
 
 
 
 
92
 
93
  ### Averages Compared:
94
+
95
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Q9uexgcbTLcywlYBvORTs.png)
96
+
97
 
98
  GPT-4All Benchmark Set
99
  ```