metadata
language:
- en
- es
license: mit
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
base_model: BarraHome/zephyr-dpo-4bit
datasets:
- jondurbin/truthy-dpo-v0.1
- BarraHome/ultrafeedback_binarized
library_name: transformers
pipeline_tag: text-classification
Uploaded model
- Developed by: BarraHome
- License: apache-2.0
- Finetuned from model : BarraHome/zephyr-dpo-4bit
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.