Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192 Text Generation • Updated 9 days ago • 169
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken Text Generation • Updated 8 days ago • 19
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken-new Text Generation • Updated 8 days ago • 24