swap-uniba
/

LLaMAntino-3-ANITA-8B-Inst-DPO-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m-polignano-uniba commited on Apr 29, 2024

Commit

cb0ed1d

·

verified ·

1 Parent(s): 85e6c39

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -239,4 +239,17 @@ The model is an instruction-tuned version of [**Meta-Llama-3-8b-instruct**](http
 This model aims to be the **multilingual base-model** to further fine-tune in the Italian environment.
-The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for the **ITA**lian language)* wants to provide Italian NLP researchers with an improved model the for Italian Language 🇮🇹 use cases.

 This model aims to be the **multilingual base-model** to further fine-tune in the Italian environment.
+The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for the **ITA**lian language)* wants to provide Italian NLP researchers with an improved model the for Italian Language 🇮🇹 use cases.
+<hr>
+**Model developers** Marco Polignano - University of Bari Aldo Moro, Italy
+**Variations** The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
+**Input** Models input text only.
+**Output** Models generate text and code only.
+**Model Architecture** *Llama 3 architecture*.