mevatron
/

Diffsense-0.5B

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Uploaded model

Developed by: mevatron
License: apache-2.0
Finetuned from model : unsloth/Qwen2.5-Coder-0.5B-Instruct

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 181

GGUF

Model size

494M params

Architecture

qwen2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for mevatron/Diffsense-0.5B

Base model

Qwen/Qwen2.5-0.5B

Finetuned

Qwen/Qwen2.5-Coder-0.5B

Finetuned

Qwen/Qwen2.5-Coder-0.5B-Instruct

Finetuned

unsloth/Qwen2.5-Coder-0.5B-Instruct

Quantized

(1)

this model