Qwen2-1.5B-Instruct-Abliterated-GGUF

Model: Qwen2-1.5B-Instruct-Abliterated
Made by: trollek

Based on original model: Qwen2-1.5B-Instruct
Created by: Qwen

Quantization notes

Made with llama.cpp-b3154 with imatrix file based on Exllamav2 default dataset.
01.09.2024: Added Q4_0_4_4 (low end ARM CPUs), Q4_0_4_8 and Q4_0_8_8 (high end ARM CPUs).
On my PC with i7-3770 CPU these are significantly slower than Q4_K_M.
On my phone Q4_0_4_4 is marginally faster than Q4_K_M.
17.12.2024: Readme update. It seems Q4_0_4_4, Q4_0_4_8 and Q4_0_8_8 support was removed in recent llama.cpp. I'll keep them but they might be no longer useful.

Original model card

This is an abliterated version of Qwen2-1.5B-Instruct using the same procedure as augmxnt/Qwen2-7B-Instruct-deccp with their code on Github with some added lines from mlabonne/harmful_behaviors to the harmful.txt file.

I have not done anything else to the model. Yet.

cgus
/

Qwen2-1.5B-Instruct-Abliterated-iMat-GGUF

Qwen2-1.5B-Instruct-Abliterated-GGUF

Quantization notes

Original model card

Model tree for cgus/Qwen2-1.5B-Instruct-Abliterated-iMat-GGUF