huihui-ai/Qwen2.5-Coder-3B-Instruct-abliterated · What tensors have you modified?

Hugging Face

What tensors have you modified?

by Orion-zhen - opened 19 days ago

Discussion

Orion-zhen

19 days ago

Thank you for your excellent work.

I followed deccp code and applied refusal dirs to self_attn.o_proj and mlp.down_proj. However it only worked with llama model, other model like qwen, mistral, etc., were outputing garbage infinitely. But your model performed well. So what tensors have you modified? or you used FailSpy/abliterator instead?

huihui-ai

Owner 18 days ago

The key is how you determine the candidate layer.

Orion-zhen

18 days ago

So how do you determine the appropriate layer? deccp code modifies a continuous range of layers, should I adjust the range, or pick out certain layers?

Orion-zhen

18 days ago

Also, I noticed that your abliterated model's max_window_layers is not the same as qwen2.5-coder-3b's. Is there any extra work that needs to be done?

huihui-ai

Owner 18 days ago

•

edited 18 days ago

File "01-compute_refusal_dir.py:28", "layer_idx = int(len(model.model.layers) * 0.6)", This 0.6 may not be correct and needs continuous testing. It could be 1 to the maximum number of model layers.

Orion-zhen

18 days ago

Wow, I can't thank you more!

Orion-zhen changed discussion status to closed 18 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment