swap-uniba
/

LLaMAntino-3-ANITA-8B-Inst-DPO-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m-polignano-uniba commited on May 10, 2024

Commit

48e18ae

·

verified ·

1 Parent(s): 3d480f6

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -24,9 +24,9 @@ license: llama3
 <hr>
 <!--<img src="https://i.ibb.co/6mHSRm3/llamantino53.jpg" width="200"/>-->
-**LLaMAntino-3-ANITA-8B-sft-DPO** is a model of the [**LLaMAntino**](https://huggingface.co/swap-uniba) - *Large Language Models family*.
 The model is an instruction-tuned version of [**Meta-Llama-3-8b-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) (a fine-tuned **LLaMA 3 model**).
-This model version aims to be the **Multilingual Base-Model** 🏁 to further fine-tune in the Italian environment.
 The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for the **ITA**lian language)*
@@ -46,6 +46,7 @@ wants to provide Italian NLP researchers with an improved model the for Italian
 - **Model developers**: Ph.D. Marco Polignano - University of Bari Aldo Moro, Italy - SWAP Research Group
 - **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *jondurbin/truthy-dpo-v0.1* dataset is used to align with human preferences for helpfulness and safety.
 - **Input**: Models input text only.
 - **Output**: Models generate text and code only.
 - **Model Architecture**: *Llama 3 architecture*.
 - **Context length**: 8K, 8192.

 <hr>
 <!--<img src="https://i.ibb.co/6mHSRm3/llamantino53.jpg" width="200"/>-->
+<p style="text-align:justify;">**LLaMAntino-3-ANITA-8B-Instr-DPO-ITA** is a model of the [**LLaMAntino**](https://huggingface.co/swap-uniba) - *Large Language Models family*.
 The model is an instruction-tuned version of [**Meta-Llama-3-8b-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) (a fine-tuned **LLaMA 3 model**).
+This model version aims to be the a **Multilingual Model** 🏁 -- EN 🇺🇸 + ITA🇮🇹 -- to further fine-tune for the Specific Italian Task</p>
 The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for the **ITA**lian language)*
 - **Model developers**: Ph.D. Marco Polignano - University of Bari Aldo Moro, Italy - SWAP Research Group
 - **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** 4bit, on two instruction-based datasets. **DPO** approach over the *jondurbin/truthy-dpo-v0.1* dataset is used to align with human preferences for helpfulness and safety.
 - **Input**: Models input text only.
+- **Language**: Multilingual🏁 + Italian 🇮🇹
 - **Output**: Models generate text and code only.
 - **Model Architecture**: *Llama 3 architecture*.
 - **Context length**: 8K, 8192.