Edit model card

QuantFactory/ArliAI-Llama-3-8B-Instruct-ORPO-v0.1-GGUF

This is quantized version of OwenArli/ArliAI-Llama-3-8B-Instruct-ORPO-v0.1 created using llama.cpp

Model Description

Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement: https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct

ORPO fine tuning method using the following datasets:

Despite the toxic datasets to reduce refusals, this model is still relatively safe but refuses less than the original Meta model.

As of now ORPO fine tuning seems to improve some metrics while reducing other metrics by a lot:

OpenLLM Leaderboard

Instruct format:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

Quants:

Downloads last month
106
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for QuantFactory/ArliAI-Llama-3-8B-Instruct-ORPO-v0.1-GGUF

Quantized
this model

Collection including QuantFactory/ArliAI-Llama-3-8B-Instruct-ORPO-v0.1-GGUF