victormiller
commited on
Commit
•
42657ea
1
Parent(s):
35fd7c6
Update README.md
Browse files
README.md
CHANGED
@@ -32,13 +32,12 @@ The LLM360 Performance and Evaluation Collection is a robust evaluations set con
|
|
32 |
Evaluations include standard best practice benchmarks, medical, math, and coding knowledge. More about the evaluations can be found [here](llm360.ai/evaluations).
|
33 |
|
34 |
|
35 |
-
|
36 |
|
37 |
|
38 |
Detailed analysis can be found on the K2 Weights and Biases project [here](wandb.ai)
|
39 |
|
40 |
|
41 |
-
|
42 |
view the prompt gallery here - Detailed analysis can be found on the K2 Weights and Biases project [here](wandb.ai)
|
43 |
|
44 |
|
|
|
32 |
Evaluations include standard best practice benchmarks, medical, math, and coding knowledge. More about the evaluations can be found [here](llm360.ai/evaluations).
|
33 |
|
34 |
|
35 |
+
<center><img src="k2_table_of_tables.png" alt="k2 big eval table"/></center>
|
36 |
|
37 |
|
38 |
Detailed analysis can be found on the K2 Weights and Biases project [here](wandb.ai)
|
39 |
|
40 |
|
|
|
41 |
view the prompt gallery here - Detailed analysis can be found on the K2 Weights and Biases project [here](wandb.ai)
|
42 |
|
43 |
|