Update README.md
Browse files
README.md
CHANGED
@@ -144,18 +144,18 @@ Below are comparisions of this model with other models in the 7B regime.
|
|
144 |
| Model | Params | Tokens | Open dataset? | CORE | MMLU | EXTENDED |
|
145 |
|---------------|--------|--------|---------------|----------|----------|----------|
|
146 |
| **Open weights, closed datasets** | | | | | | |
|
147 |
-
| Llama2 | 7B | 2T |
|
148 |
-
| DeepSeek | 7B | 2T |
|
149 |
-
| Mistral-0.3 | 7B | ? |
|
150 |
-
| QWEN-2 | 7B | ? |
|
151 |
-
| Llama3 | 8B | 15T |
|
152 |
-
| Gemma | 8B | 6T |
|
153 |
-
| Phi-3 | 7B | ? |
|
154 |
| **Open weights, open datasets** | | | | | | |
|
155 |
-
| Falcon | 7B | 1T |
|
156 |
-
| OLMo-1.7 | 7B | 2.1T |
|
157 |
-
| MAP-Neo | 7B | 4.5T |
|
158 |
-
| **DCLM-7B-8k** | 7B | 2.5T |
|
159 |
|
160 |
|
161 |
|
|
|
144 |
| Model | Params | Tokens | Open dataset? | CORE | MMLU | EXTENDED |
|
145 |
|---------------|--------|--------|---------------|----------|----------|----------|
|
146 |
| **Open weights, closed datasets** | | | | | | |
|
147 |
+
| Llama2 | 7B | 2T | β | 49.2 | 45.8 | 34.1 |
|
148 |
+
| DeepSeek | 7B | 2T | β | 50.7 | 48.5 | 35.3 |
|
149 |
+
| Mistral-0.3 | 7B | ? | β | 57.0 | 62.7 | 45.1 |
|
150 |
+
| QWEN-2 | 7B | ? | β | 57.5 | **71.9** | 50.5 |
|
151 |
+
| Llama3 | 8B | 15T | β | 57.6 | 66.2 | 46.3 |
|
152 |
+
| Gemma | 8B | 6T | β | 57.8 | 64.3 | 44.6 |
|
153 |
+
| Phi-3 | 7B | ? | β | **61.0** | 69.9 | **57.9** |
|
154 |
| **Open weights, open datasets** | | | | | | |
|
155 |
+
| Falcon | 7B | 1T | β
| 44.1 | 27.4 | 25.1 |
|
156 |
+
| OLMo-1.7 | 7B | 2.1T | β
| 47.0 | 54.0 | 34.2 |
|
157 |
+
| MAP-Neo | 7B | 4.5T | β
| **50.2** | **57.1** | **40.4** |
|
158 |
+
| **DCLM-7B-8k** | 7B | 2.5T | β
| **57.1** | **63.7** | **45.4** |
|
159 |
|
160 |
|
161 |
|