Edit model card

Solarized-18B-truthy

Solarized-18B-dpo fine-tuned to improve truthfulness.

It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.

image/png

Downloads last month
432
Safetensors
Model size
17.9B params
Tensor type
F32
·
FP16
·
I8
·
Inference API
Input a message to start chatting with vicgalle/solarized-18B-truthy.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Dataset used to train vicgalle/solarized-18B-truthy