|
--- |
|
language: |
|
- es |
|
tags: |
|
- text-generation |
|
- bloom |
|
license: "cc-by-sa-4.0" |
|
--- |
|
|
|
# Bloom 1b1 for Spanish text generation |
|
|
|
This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on Spanish datasets. |
|
It achieves the following results on the evaluation set: |
|
- Loss: 2.340 |
|
|
|
Model under development. Use with caution. |
|
|
|
|
|
### Dataset Summary |
|
|
|
Model trained with [Large Spanish Corpus](https://huggingface.co/datasets/large_spanish_corpus) and a Spanish books corpus crawled from web and torrents. |
|
|
|
### Preprocessing |
|
|
|
Preprocessing performed by [spanish_nlp](https://github.com/jorgeortizfuentes/spanish_nlp). |
|
|
|
### Licensing Information |
|
|
|
The dataset is available under the [Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0)](https://creativecommons.org/licenses/by-sa/4.0/). |
|
|
|
Some books may be subject to copyright. Use for academic purposes only. |
|
|
|
### Citation Information |
|
|
|
``` |
|
@misc {jorge_ortiz_fuentes_2023, |
|
author = { {Jorge Ortiz Fuentes} }, |
|
title = { Bloom 1b1 for Spanish text generation }, |
|
year = 2023, |
|
url = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish }, |
|
doi = { 10.57967/hf/0247 }, |
|
publisher = { Hugging Face } |
|
} |
|
``` |