File size: 774 Bytes
7e86226
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---

{}
---

static quants of https://www.modelscope.cn/models/swift/MS-LongWriter-Qwen2.5-7B-Instruct

MS-LongWriter-Qwen2.5-7B-Instruct is trained based on https://modelscope.cn/models/qwen/Qwen2.5-7B-Instruct, and is capable of generating 10,000+ words at once.

MS-LongWriter-Qwen2.5-7B-Instruct begins training directly from the Qwen2.5-7B-Instruct, while performing significant distillation on the LongWriter-6k to obtain 666 high-quality samples, which is LongWriter-6k-filtered

Datasets
LongWriter-6k-filtered, based on the LongWriter-6k
Magpie-Qwen2-Pro-200K-Chinese , random sampling 6k examples.
Magpie-Qwen2-Pro-200K-English , random sampling 6k examples.


想测试体验一下这个模型的效果,但没看到有人量化,只能自己动手做一个。