Rombos-LLM-V2.5-Qwen-32b

image/jpeg

Rombos-LLM-V2.5-Qwen-32b is a continues finetuned version of Qwen2.5-32B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method

This version of the model shows higher performance than the original instruct and base models.

Quants: (Coming soon)

GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-32b-GGUF

EXL2:

(8-bit)

(5-bit)

(4.25-bit)

Downloads last month
20
Safetensors
Model size
32.8B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Rombo-Org/Rombo-LLM-V2.5-Qwen-32b

Base model

Qwen/Qwen2.5-32B
Finetuned
(119)
this model
Finetunes
1 model
Merges
4 models
Quantizations
2 models

Collection including Rombo-Org/Rombo-LLM-V2.5-Qwen-32b