Update README.md
Browse files
README.md
CHANGED
@@ -7,10 +7,10 @@ tags:
|
|
7 |
- climate
|
8 |
---
|
9 |
|
10 |
-
# CatastroBERT a model for Extreme weather events detection in
|
11 |
|
12 |
This model aims to facilitate the detection of paragraphs or articles relevant to extreme weather events
|
13 |
-
in
|
14 |
|
15 |
<div align=center>
|
16 |
<img src="images/bert_illustration.png" width="500" height="500" />
|
@@ -22,7 +22,7 @@ in french text. It is based on the [camembert-base](https://huggingface.co/camem
|
|
22 |
- **Language(s) (NLP):** French
|
23 |
- **Finetuned from model :** [camembert-base](https://huggingface.co/camembert-base) (RoBERTa Checkpoint)
|
24 |
|
25 |
-
- **Repository:** Check the [CatastroBERT](https://github.com/dh-epfl-students/dhlab-CatastroBERT)
|
26 |
|
27 |
## Usage
|
28 |
|
@@ -71,7 +71,7 @@ print(f"Prediction: {predict(text)}")
|
|
71 |
|
72 |
### Training Data
|
73 |
|
74 |
-
This model was trained on manually a manually
|
75 |
|
76 |
## Environmental Impact
|
77 |
|
|
|
7 |
- climate
|
8 |
---
|
9 |
|
10 |
+
# CatastroBERT a model for Extreme weather events detection in French text
|
11 |
|
12 |
This model aims to facilitate the detection of paragraphs or articles relevant to extreme weather events
|
13 |
+
in French text. It is based on the [camembert-base](https://huggingface.co/camembert-base) model and was trained on manually annotated data (articles summaries) from the Gazette de Lausanne archives collected by [impresso](https://impresso-project.ch/)
|
14 |
|
15 |
<div align=center>
|
16 |
<img src="images/bert_illustration.png" width="500" height="500" />
|
|
|
22 |
- **Language(s) (NLP):** French
|
23 |
- **Finetuned from model :** [camembert-base](https://huggingface.co/camembert-base) (RoBERTa Checkpoint)
|
24 |
|
25 |
+
- **Repository:** Check the [CatastroBERT](https://github.com/dh-epfl-students/dhlab-CatastroBERT) GitHub page for more usage examples and information.
|
26 |
|
27 |
## Usage
|
28 |
|
|
|
71 |
|
72 |
### Training Data
|
73 |
|
74 |
+
This model was trained on manually a manually annotated dataset (articles summaries) curated from the Gazette de Lausanne archives collected by the [impresso](https://impresso-project.ch/) project. The dataset is composed of 4500 articles summaries of which 3500 were used for training and 1000 for validation.
|
75 |
|
76 |
## Environmental Impact
|
77 |
|