nbeerbower's picture
Update README.md
5689f92 verified
|
raw
history blame
557 Bytes
---
license: apache-2.0
library_name: transformers
base_model:
- nbeerbower/mistral-nemo-bophades-12B
datasets:
- jondurbin/gutenberg-dpo-v0.1
---
# mistral-nemo-gutades-12B
[nbeerbower/mistral-nemo-bophades-12B](https://huggingface.co/nbeerbower/mistral-nemo-bophades-12B) finetuned on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1).
### Method
ORPO finetuned using an RTX 3090 for 3 epochs.
[Fine-tune Llama 3 with ORPO](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html)