nicholasKluge
/

Aira-2-1B1

Text Generation

instruction tuned

text generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Oct 30, 2023

Commit

2d6191a

•

1 Parent(s): 12ae98e

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -115,10 +115,11 @@ The model will output something like:
 ## Evaluation
-| Model|Average|[ARC](https://arxiv.org/abs/1803.05457)|[TruthfulQA](https://arxiv.org/abs/2109.07958)|[ToxiGen](https://arxiv.org/abs/2203.09509)|
-|---|---|---|---|---|
-| [Aira-2-1B1](https://huggingface.co/nicholasKluge/Aira-2-1B1) |**42.55**|25.26|**50.81**|**51.59**|
-| TinyLlama-1.1B-intermediate-step-480k-1T | 37.52 | **30.89** | 39.55 | 42.13 |
 * Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). The notebook used to make these evaluations is available in the [this repo](lm_evaluation_harness.ipynb).

 ## Evaluation
+| Model (TinyLlama)                                             | Average   | [ARC](https://arxiv.org/abs/1803.05457) | [TruthfulQA](https://arxiv.org/abs/2109.07958) | [ToxiGen](https://arxiv.org/abs/2203.09509) |
+|---------------------------------------------------------------|-----------|-----------------------------------------|------------------------------------------------|---------------------------------------------|
+| [Aira-2-1B1](https://huggingface.co/nicholasKluge/Aira-2-1B1) | **42.55** | 25.26                                   | **50.81**                                      | **51.59**                                   |
+| TinyLlama-1.1B-intermediate-step-480k-1T                      | 37.52     | **30.89**                               | 39.55                                          | 42.13                                       |
 * Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)). The notebook used to make these evaluations is available in the [this repo](lm_evaluation_harness.ipynb).