File size: 2,085 Bytes
39a7c1d
 
 
 
 
 
 
580d9e9
 
 
 
39a7c1d
e6fc70b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4a885f3
a6c0f1a
e6fc70b
 
ce7300f
e6fc70b
 
 
 
975a019
39a7c1d
 
 
 
 
 
 
 
975a019
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
language: fr
license: mit
datasets:
- oscar
---

# Basé Sur Le Modèle Originale :
## Almanach/camembert-base : 
### "A Tasty French Language Model"
## Lien : https://huggingface.co/almanach/camembert-base




## Pre-trained models by almanach/camembert-base
### https://huggingface.co/almanach/camembert-base/blob/main/README.md#pre-trained-models

| Model                          | #params                        | Arch. | Training data                     |
|--------------------------------|--------------------------------|-------|-----------------------------------|
| `camembert-base` | 110M   | Base  | OSCAR (138 GB of text)            |
| `camembert/camembert-large`              | 335M    | Large | CCNet (135 GB of text)            |
| `camembert/camembert-base-ccnet`         | 110M    | Base  | CCNet (135 GB of text)            |
| `camembert/camembert-base-wikipedia-4gb` | 110M    | Base  | Wikipedia (4 GB of text)          |
| `camembert/camembert-base-oscar-4gb`     | 110M    | Base  | Subsample of OSCAR (4 GB of text) |
| `camembert/camembert-base-ccnet-4gb`     | 110M    | Base  | Subsample of CCNet (4 GB of text) |


## Fine-Tunning by MisterAI/ALMANACH_CamemBERT_Agent001
Testing Training/FineTunning For Now >:)

| Model                          | #params                        | Arch. | Training data                     |
|--------------------------------|--------------------------------|-------|-----------------------------------|
| `MisterAI/ALMANACH_CamemBERT_Agent001` based on `camembert-base`| 110M  | Base | MisterAI/SimpleSmallFrenchQA (50 KB of text) |




************
If you use our work, please cite:

@inproceedings{martin2020camembert,
  title={CamemBERT: a Tasty French Language Model},
  author={Martin, Louis and Muller, Benjamin and Su{\'a}rez, Pedro Javier Ortiz and Dupont, Yoann and Romary, Laurent and de la Clergerie, {\'E}ric Villemonte and Seddah, Djam{\'e} and Sagot, Beno{\^\i}t},
  booktitle={Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics},
  year={2020}
}

*************