Qwen
/

Qwen2-1.5B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

evaluation pipeline

#8 opened about 2 months ago by

Hello, is this 1.5B model trained from scratch, or is it distilled like LLaMA 3.2?

#7 opened 5 months ago by

recommended context length for SFT?

#6 opened 7 months ago by

Why is there no model.safetensors.index.json file?

#5 opened 7 months ago by

[AUTOMATED] Model Memory Requirements

#3 opened 8 months ago by

model-sizer-bot

lm_eval results is weird

#2 opened 9 months ago by

Upload ONNX weights

#1 opened 9 months ago by