ringoreality
commited on
Commit
•
b792fa5
1
Parent(s):
6f66d33
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,49 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Model Card for climategpt/climategpt-70b
|
2 |
+
- This model is the 70B parameter variant of the ClimateGPT model release.
|
3 |
+
- Starting from Llama2 70B weights, the model undergoes continued pretraining and instruction finetuning on climate data.
|
4 |
+
- The model is capable of answering questions and following instructions, especially tailored for the climate domain.
|
5 |
+
|
6 |
+
## Overview
|
7 |
+
- **Developed by:** AppTek, Eqtylab, Erasmus AI
|
8 |
+
- **Model type:** decoder-only Transformer
|
9 |
+
- **Language(s) (NLP):** natively supported: English; supported via cascaded MT on web interface: Arabic, Bangla, Chinese (simplified), Dutch, Finnougoric, French, Germanic, Greek, Hebrew, Indonesian, Japenese, Korean, Lithuanian, Pashto, Persian, Portuguese, Russian, Spanish, Thai, Turkish, Vietnamese,
|
10 |
+
- **License:** TO BE ADDED
|
11 |
+
- **Finetuned from model:** Llama2 70B
|
12 |
+
- **Repository:** https://huggingface.co/climategpt/climategpt-70b
|
13 |
+
- **Paper:** TO BE ADDED
|
14 |
+
- **Demo:** TO BE ADDED
|
15 |
+
|
16 |
+
## Uses
|
17 |
+
- This model is intended to be directly used as a question answering model that is specialized in the climate domain.
|
18 |
+
- The model is aimed at providing useful feedback for decision makers, scientists and jounalists involved in climate discussions.
|
19 |
+
- The model can also be used as a starting point for interested developers for further finetuning.
|
20 |
+
- The model is NOT intended to be a general-purpose chatbot (although it has chat capabilities).
|
21 |
+
- For the full system including cascaded MT, RAG, etc., we recommend the user to go to our demo website: TO BE ADDED.
|
22 |
+
- For hands-on finetuning deployment and inference, we recommend the user to directly use the Huggingface helpers.
|
23 |
+
- For in-depth model conversion and finetuning, we recommend the user to use https://github.com/epfLLM/Megatron-LLM/.
|
24 |
+
- **Despite the efforts from the development team to elimite them, as every other chat-capable LLMs, this model may generate biased, offensive, inaccurate responses.**
|
25 |
+
|
26 |
+
## How to Get Started with the Model
|
27 |
+
After downloading the HF formatted model, the HF helpers should work out-of-the-box.
|
28 |
+
For example, it is possible to evaluate the model with https://github.com/EleutherAI/lm-evaluation-harness by plugging in the model identifier ```--model_args pretrained=climategpt/climategpt-70b```.
|
29 |
+
|
30 |
+
## Training
|
31 |
+
- For the Llama2 training data, we refer the user to https://huggingface.co/meta-llama/Llama-2-70b-chat-hf.
|
32 |
+
- For continued pretraining, 4.2B climate domain tokens (tokenized by the Llama tokenizer) are used.
|
33 |
+
- For instruction finetuning, about 579K instruction-completion pairs (both in the climate domain but also general domain) are used.
|
34 |
+
|
35 |
+
## Evaluation
|
36 |
+
Automatic evaluation is done via https://github.com/EleutherAI/lm-evaluation-harness, into which we also implemented custom evaluation tasks.
|
37 |
+
TO BE ADDED
|
38 |
+
We also perform human evaluation with experts in the climate domain.
|
39 |
+
TO BE ADDED
|
40 |
+
|
41 |
+
## Environmental Impact
|
42 |
+
- **Hardware Type:** H100
|
43 |
+
- **Hours used:** 2300 hrs
|
44 |
+
- **Cloud Provider:** TO BE ADDED
|
45 |
+
- **Compute Region:** TO BE ADDED
|
46 |
+
- **Carbon Emitted:** TO BE ADDED
|
47 |
+
|
48 |
+
## Citation
|
49 |
+
**BibTeX:** TO BE ADDED
|