Update README.md
Browse files
README.md
CHANGED
@@ -12,6 +12,8 @@ tags:
|
|
12 |
- 5M-Logits
|
13 |
- trl
|
14 |
---
|
|
|
|
|
15 |
# **Megatron-Corpus-14B-Exp**
|
16 |
|
17 |
Megatron-Corpus-14B-Exp is based on the Qwen 2.5 14B modality architecture, designed to enhance the reasoning capabilities of 14B-parameter models. It has been fine-tuned on a synthetic dataset based on math corpus, further optimizing its chain-of-thought (CoT) reasoning and logical problem-solving abilities. The model demonstrates significant improvements in context understanding, structured data processing, and long-context comprehension, making it ideal for complex reasoning tasks, instruction-following, and text generation.
|
|
|
12 |
- 5M-Logits
|
13 |
- trl
|
14 |
---
|
15 |
+

|
16 |
+
|
17 |
# **Megatron-Corpus-14B-Exp**
|
18 |
|
19 |
Megatron-Corpus-14B-Exp is based on the Qwen 2.5 14B modality architecture, designed to enhance the reasoning capabilities of 14B-parameter models. It has been fine-tuned on a synthetic dataset based on math corpus, further optimizing its chain-of-thought (CoT) reasoning and logical problem-solving abilities. The model demonstrates significant improvements in context understanding, structured data processing, and long-context comprehension, making it ideal for complex reasoning tasks, instruction-following, and text generation.
|