--- language: - es tags: - text-generation - bloom license: "cc-by-sa-4.0" --- # Bloom 1b1 for Spanish text generation This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on Spanish datasets. It achieves the following results on the evaluation set: - Loss: 2.340 Model under development. Use with caution. ### Dataset Summary Model trained with [Large Spanish Corpus](https://huggingface.co/datasets/large_spanish_corpus) and a Spanish books corpus crawled from web and torrents. ### Preprocessing Preprocessing performed by [spanish_nlp](https://github.com/jorgeortizfuentes/spanish_nlp). ### Licensing Information The dataset is available under the [Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0)](https://creativecommons.org/licenses/by-sa/4.0/). Some books may be subject to copyright. Use for academic purposes only. ### Citation Information ``` @misc {jorge_ortiz_fuentes_2023, author = { {Jorge Ortiz Fuentes} }, title = { Bloom 1b1 for Spanish text generation }, year = 2023, url = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish }, doi = { 10.57967/hf/0247 }, publisher = { Hugging Face } } ```