cosimoiaia
/

Loquace-7B

Text Generation

RefinedWebModel

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cosimoiaia commited on Jun 10, 2023

Commit

23d920e

•

1 Parent(s): ce8dac3

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -20,6 +20,9 @@ Model Card for Loquace-7B
 An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
 ## Model Description
 Loquace-7B is the first 7B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
@@ -57,7 +60,7 @@ model = LLaMAForCausalLM.from_pretrained(
 Loquace-7B was trained on a conversational dataset comprising 102k question/answer pairs in Italian language.
 The training data was constructed by putting together translations from the original alpaca Dataset and other sources like the OpenAssistant dataset.
-The model was trained for only 3000 iterations and took 18 hours on a single RTX 3090, kindly provided by Genesis Cloud.
 ## Limitations

 An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
+The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
+of a specific language.
 ## Model Description
 Loquace-7B is the first 7B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
 Loquace-7B was trained on a conversational dataset comprising 102k question/answer pairs in Italian language.
 The training data was constructed by putting together translations from the original alpaca Dataset and other sources like the OpenAssistant dataset.
+The model was trained for only 3000 iterations and took 16 hours on a single RTX 3090, kindly provided by Genesis Cloud. (https://gnsiscld.co/26qhlf)
 ## Limitations