bloom-1b1-spanish / README.md
jorgeortizfuentes's picture
Update README.md
f5a6577
---
language:
- es
tags:
- text-generation
- bloom
license: "cc-by-sa-4.0"
---
# Bloom 1b1 for Spanish text generation
This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on Spanish datasets.
It achieves the following results on the evaluation set:
- Loss: 2.340
Model under development. Use with caution.
### Dataset Summary
Model trained with [Large Spanish Corpus](https://huggingface.co/datasets/large_spanish_corpus) and a Spanish books corpus crawled from web and torrents.
### Preprocessing
Preprocessing performed by [spanish_nlp](https://github.com/jorgeortizfuentes/spanish_nlp).
### Licensing Information
The dataset is available under the [Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0)](https://creativecommons.org/licenses/by-sa/4.0/).
Some books may be subject to copyright. Use for academic purposes only.
### Citation Information
```
@misc {jorge_ortiz_fuentes_2023,
author = { {Jorge Ortiz Fuentes} },
title = { Bloom 1b1 for Spanish text generation },
year = 2023,
url = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish },
doi = { 10.57967/hf/0247 },
publisher = { Hugging Face }
}
```