m-polignano-uniba
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -243,13 +243,18 @@ The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for t
|
|
243 |
|
244 |
<hr>
|
245 |
|
246 |
-
|
247 |
-
|
248 |
-
|
249 |
-
|
250 |
-
**Input** Models input text only.
|
|
|
|
|
|
|
|
|
251 |
|
252 |
-
|
253 |
|
254 |
-
|
255 |
|
|
|
|
243 |
|
244 |
<hr>
|
245 |
|
246 |
+
## Specifications
|
247 |
+
|
248 |
+
- **Model developers** Marco Polignano - University of Bari Aldo Moro, Italy
|
249 |
+
- **Variations** The model release has been **supervised fine-tuning (SFT)** using **QLoRA** in the 4bit version, on a long list of instruction-based datasets. **ORPO** approach over the *mlabonne/orpo-dpo-mix-40k* dataset is used to align with human preferences for helpfulness and safety.
|
250 |
+
- **Input** Models input text only.
|
251 |
+
- **Output** Models generate text and code only.
|
252 |
+
- **Model Architecture** *Llama 3 architecture*.
|
253 |
+
- **Context length**: 8K, 8192.
|
254 |
+
<hr>
|
255 |
|
256 |
+
#### Unsloth
|
257 |
|
258 |
+
<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/made with unsloth.png" width="200px" align="center" />
|
259 |
|
260 |
+
[Unsloth](https://unsloth.ai), a great tool that helps us easily develop products, at a lower cost than expected.
|