swap-uniba
/

LLaMAntino-3-ANITA-8B-Inst-DPO-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

m-polignano-uniba commited on Apr 29, 2024

Commit

6cab6c2

·

verified ·

1 Parent(s): bb8a7f9

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -245,11 +245,11 @@ The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for t
 ## Specifications
-- **Model developers** Marco Polignano - University of Bari Aldo Moro, Italy
-- **Variations** The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
-- **Input** Models input text only.
-- **Output** Models generate text and code only.
-- **Model Architecture** *Llama 3 architecture*.
 - **Context length**: 8K, 8192.
 <hr>

 ## Specifications
+- **Model developers**: Ph.D. Marco Polignano - University of Bari Aldo Moro, Italy
+- **Variations**: The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
+- **Input**: Models input text only.
+- **Output**: Models generate text and code only.
+- **Model Architecture**: *Llama 3 architecture*.
 - **Context length**: 8K, 8192.
 <hr>