nicholasKluge
commited on
Commit
•
b2032d2
1
Parent(s):
74aca8c
Update README.md
Browse files
README.md
CHANGED
@@ -63,8 +63,7 @@ Check our gradio-demo in [Spaces](https://huggingface.co/spaces/nicholasKluge/Ai
|
|
63 |
- **Emissions:** 0.25 KgCO2 (Singapore)
|
64 |
- **Total Energy Consumption:** 0.52 kWh
|
65 |
|
66 |
-
This repository has the [
|
67 |
-
|
68 |
## Usage
|
69 |
|
70 |
Three special tokens are used to mark the user side of the interaction and the model's response:
|
@@ -131,7 +130,7 @@ The model will output something like:
|
|
131 |
| [Aira-OPT-1B3](https://huggingface.co/nicholasKluge/Aira-OPT-1B3) | **43.90** | 28.41 | **46.59** | **56.70** | | |
|
132 |
| OPT-1.3b | 40.91 | **29.69** | 38.68 | 54.36 | | |
|
133 |
|
134 |
-
* Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)).
|
135 |
|
136 |
## Cite as 🤗
|
137 |
|
|
|
63 |
- **Emissions:** 0.25 KgCO2 (Singapore)
|
64 |
- **Total Energy Consumption:** 0.52 kWh
|
65 |
|
66 |
+
This repository has the [source code](https://github.com/Nkluge-correa/Aira) used to train this model.
|
|
|
67 |
## Usage
|
68 |
|
69 |
Three special tokens are used to mark the user side of the interaction and the model's response:
|
|
|
130 |
| [Aira-OPT-1B3](https://huggingface.co/nicholasKluge/Aira-OPT-1B3) | **43.90** | 28.41 | **46.59** | **56.70** | | |
|
131 |
| OPT-1.3b | 40.91 | **29.69** | 38.68 | 54.36 | | |
|
132 |
|
133 |
+
* Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)).
|
134 |
|
135 |
## Cite as 🤗
|
136 |
|