File size: 1,253 Bytes
6ec428f 3701452 6ec428f f5a6577 6ec428f 3701452 6ec428f 3701452 f5a6577 3701452 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 |
---
language:
- es
tags:
- text-generation
- bloom
license: "cc-by-sa-4.0"
---
# Bloom 1b1 for Spanish text generation
This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on Spanish datasets.
It achieves the following results on the evaluation set:
- Loss: 2.340
Model under development. Use with caution.
### Dataset Summary
Model trained with [Large Spanish Corpus](https://huggingface.co/datasets/large_spanish_corpus) and a Spanish books corpus crawled from web and torrents.
### Preprocessing
Preprocessing performed by [spanish_nlp](https://github.com/jorgeortizfuentes/spanish_nlp).
### Licensing Information
The dataset is available under the [Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0)](https://creativecommons.org/licenses/by-sa/4.0/).
Some books may be subject to copyright. Use for academic purposes only.
### Citation Information
```
@misc {jorge_ortiz_fuentes_2023,
author = { {Jorge Ortiz Fuentes} },
title = { Bloom 1b1 for Spanish text generation },
year = 2023,
url = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish },
doi = { 10.57967/hf/0247 },
publisher = { Hugging Face }
}
``` |