Ponimash commited on
Commit
24882be
1 Parent(s): 44c57a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -3
README.md CHANGED
@@ -1,3 +1,31 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ru
5
+ - en
6
+ base_model:
7
+ - Qwen/Qwen2.5-3B-Instruct
8
+ pipeline_tag: text-generation
9
+ library_name: transformers
10
+ ---
11
+
12
+ ---
13
+
14
+ ## FractalGPT/RuQwen2.5-3b-instruct
15
+
16
+ ---
17
+
18
+ ### Model Overview
19
+
20
+ - **RuQwen2.5-3b-instruct** by FractalGPT is a language model tailored to deliver high-quality Russian language output. Building upon the Qwen2.5 series, it is optimized for Russian-language tasks while retaining broad multilingual support.
21
+
22
+ - **Improved Russian Language Quality**: Adaptations have significantly enhanced the fluency, accuracy, and coherence of Russian text generation, making it an excellent choice for Russian-language applications.
23
+
24
+ ### Model Specifications
25
+
26
+ - **Type**: Instruction-tuned Causal Language Model
27
+ - **Training Stages**: Pretraining & Instruction Tuning
28
+ - **Architecture**: Transformer with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
29
+ - **Layers**: 36
30
+ - **Attention Heads (GQA)**: 24 for Q, 4 for KV
31
+ - **Context Length**: Supports a full context of 131,072 tokens and generation of up to 8,192 tokens