nicholasKluge
/

TeenyTinyLlama-460m-awq

@@ -33,7 +33,7 @@ co2_eq_emissions:
 ---
 # TeenyTinyLlama-460m-awq
-<img src="./logo.png" alt="A curious llama exploring a mushroom forest." height="200">
 ## Model Summary
@@ -41,7 +41,7 @@ co2_eq_emissions:
 Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
-Read our preprint on [ArXiv](https://arxiv.org/abs/2401.16640).
 ## Details
@@ -223,7 +223,6 @@ All the shown results are the higher accuracy scores achieved on the respective
 ## Cite as 🤗
 ```latex
 @misc{correa24ttllama,
   title = {TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese},
   author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
@@ -231,6 +230,15 @@ All the shown results are the higher accuracy scores achieved on the respective
   year={2024}
 }
 ```
 ## Funding
@@ -239,4 +247,4 @@ This repository was built as part of the RAIES ([Rede de Inteligência Artificia
 ## License
-TeenyTinyLlama-460m is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.

 ---
 # TeenyTinyLlama-460m-awq
+<img src="../../img/460m-llama.png" alt="A curious llama exploring a mushroom forest." height="200">
 ## Model Summary
 Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
+Read our preprint on [Article](https://www.sciencedirect.com/science/article/pii/S2666827024000343).
 ## Details
 ## Cite as 🤗
 ```latex
 @misc{correa24ttllama,
   title = {TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese},
   author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
   year={2024}
 }
+@misc{correa24ttllama,
+  doi = {10.1016/j.mlwa.2024.100558},
+  url = {https://www.sciencedirect.com/science/article/pii/S2666827024000343},
+  title = {TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese},
+  author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
+  journal={Machine Learning With Applications},
+  publisher = {Springer},
+  year={2024}
+}
 ```
 ## Funding
 ## License
+TeenyTinyLlama-460m is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.