Uploaded model
- Developed by: Daemontatox
- License: apache-2.0
- Finetuned from model : NousResearch/Nous-Hermes-2-Mistral-7B-DPO
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model tree for Daemontatox/sentientdpo
Base model
mistralai/Mistral-7B-v0.1