Create README.md
Browse files
README.md
CHANGED
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
{}
|
3 |
+
---
|
4 |
+
static quants of https://www.modelscope.cn/models/swift/MS-LongWriter-Qwen2.5-7B-Instruct
|
5 |
+
|
6 |
+
MS-LongWriter-Qwen2.5-7B-Instruct is trained based on https://modelscope.cn/models/qwen/Qwen2.5-7B-Instruct, and is capable of generating 10,000+ words at once.
|
7 |
+
|
8 |
+
MS-LongWriter-Qwen2.5-7B-Instruct begins training directly from the Qwen2.5-7B-Instruct, while performing significant distillation on the LongWriter-6k to obtain 666 high-quality samples, which is LongWriter-6k-filtered
|
9 |
+
|
10 |
+
Datasets
|
11 |
+
LongWriter-6k-filtered, based on the LongWriter-6k
|
12 |
+
Magpie-Qwen2-Pro-200K-Chinese , random sampling 6k examples.
|
13 |
+
Magpie-Qwen2-Pro-200K-English , random sampling 6k examples.
|
14 |
+
|
15 |
+
|
16 |
+
想测试体验一下这个模型的效果,但没看到有人量化,只能自己动手做一个。
|