nicholasKluge commited on
Commit
936c943
1 Parent(s): 99bdcf7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -39,6 +39,8 @@ co2_eq_emissions:
39
 
40
  Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
41
 
 
 
42
  ## Details
43
 
44
  - **Architecture:** a Transformer-based model pre-trained via causal language modeling
@@ -190,7 +192,7 @@ All the shown results are the higher accuracy scores achieved on the respective
190
  ```latex
191
 
192
  @misc{correa24ttllama,
193
- title = {TeenyTinyLlama: a pair of open-source tiny language models trained in Brazilian Portuguese},
194
  author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
195
  journal={arXiv},
196
  year = {2024},
 
39
 
40
  Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
41
 
42
+ Read our preprint on [ArXiv](xxx).
43
+
44
  ## Details
45
 
46
  - **Architecture:** a Transformer-based model pre-trained via causal language modeling
 
192
  ```latex
193
 
194
  @misc{correa24ttllama,
195
+ title = {TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese},
196
  author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
197
  journal={arXiv},
198
  year = {2024},