nicholasKluge
commited on
Commit
•
2d6191a
1
Parent(s):
12ae98e
Update README.md
Browse files
README.md
CHANGED
@@ -115,10 +115,11 @@ The model will output something like:
|
|
115 |
|
116 |
## Evaluation
|
117 |
|
118 |
-
| Model|Average|[ARC](https://arxiv.org/abs/1803.05457)|[TruthfulQA](https://arxiv.org/abs/2109.07958)|[ToxiGen](https://arxiv.org/abs/2203.09509)|
|
119 |
-
|
120 |
-
| [Aira-2-1B1](https://huggingface.co/nicholasKluge/Aira-2-1B1)
|
121 |
-
| TinyLlama-1.1B-intermediate-step-480k-1T
|
|
|
122 |
|
123 |
* Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). The notebook used to make these evaluations is available in the [this repo](lm_evaluation_harness.ipynb).
|
124 |
|
|
|
115 |
|
116 |
## Evaluation
|
117 |
|
118 |
+
| Model (TinyLlama) | Average | [ARC](https://arxiv.org/abs/1803.05457) | [TruthfulQA](https://arxiv.org/abs/2109.07958) | [ToxiGen](https://arxiv.org/abs/2203.09509) |
|
119 |
+
|---------------------------------------------------------------|-----------|-----------------------------------------|------------------------------------------------|---------------------------------------------|
|
120 |
+
| [Aira-2-1B1](https://huggingface.co/nicholasKluge/Aira-2-1B1) | **42.55** | 25.26 | **50.81** | **51.59** |
|
121 |
+
| TinyLlama-1.1B-intermediate-step-480k-1T | 37.52 | **30.89** | 39.55 | 42.13 |
|
122 |
+
|
123 |
|
124 |
* Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). The notebook used to make these evaluations is available in the [this repo](lm_evaluation_harness.ipynb).
|
125 |
|