A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k

Downloads last month: 1

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for eren23/DPOMixLLama-3-8B-lora

Base model

meta-llama/Meta-Llama-3-8B-Instruct

Adapter

(667)

this model

eren23
/

DPOMixLLama-3-8B-lora

Model tree for eren23/DPOMixLLama-3-8B-lora

Dataset used to train eren23/DPOMixLLama-3-8B-lora