swap-uniba
/

LLaMAntino-3-ANITA-8B-Inst-DPO-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m-polignano-uniba commited on Apr 29, 2024

Commit

7c2db1a

·

verified ·

1 Parent(s): cb0ed1d

Update README.md

Files changed (1) hide show

README.md +12 -7

README.md CHANGED Viewed

@@ -243,13 +243,18 @@ The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for t
 <hr>
-**Model developers** Marco Polignano - University of Bari Aldo Moro, Italy
-**Variations** The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
-**Input** Models input text only.
-**Output** Models generate text and code only.
-**Model Architecture** *Llama 3 architecture*.

 <hr>
+## Specifications
+- **Model developers** Marco Polignano - University of Bari Aldo Moro, Italy
+- **Variations** The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
+- **Input** Models input text only.
+- **Output** Models generate text and code only.
+- **Model Architecture** *Llama 3 architecture*.
+- **Context length**: 8K, 8192.
+<hr>
+#### Unsloth
+<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/made with unsloth.png" width="200px" align="center" />
+[Unsloth](https://unsloth.ai), a great tool that helps us easily develop products, at a lower cost than expected.