
opencsg
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
OpenCSG
OpenCSG stands for Converged resources, Software refined, and Generative LM. The 'C' represents Converged resources, indicating the integration and full utilization of hybrid resources. The 'S' stands for Software refined, signifying software that is refined by large models. The 'G' represents Generative LM, which denotes widespread, inclusive, and democratized generative large models.
The vision of OpenCSG is to empower every industry, every company, and every individual to own their models. We adhere to the principles of openness and open source, making the large model software stack of OpenCSG available to the community. We welcome everyone to use, feedback, and collaborative contribute.
Collections
6
SLM pretrained from scratch
a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets.
-
opencsg/Fineweb-Edu-Chinese-V2.1
Viewer • Updated • 958M • 52.8k • 22 -
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training
Paper • 2501.08197 • Published • 8 -
opencsg/chinese-fineweb-edu-v2
Viewer • Updated • 188M • 4.21k • 62 -
opencsg/chinese-fineweb-edu
Viewer • Updated • 84.6M • 21.1k • 91
models
34

opencsg/OpenCSG-R1-Qwen2.5-Code-3B-V1
Text Generation
•
Updated
•
5

opencsg/OpenCSG-Qwen2.5-3B-GUI
Updated
•
78
•
1

opencsg/OpenCSG-Qwen2.5-7B-GUI
Updated
•
32
•
1

opencsg/OpenCSG-R1-Qwen2.5-Math-7B-V1
Updated
•
102
•
4

opencsg/OpenCSG-R1-Qwen2.5-Math-3B-V1
Updated
•
49
•
3

opencsg/csg-wukong-2b-ultrafeedback-chinese-binarized-lowest
Updated
•
20
•
1

opencsg/csg-wukong-2b-ultrafeedback-chinese-binarized
Updated
•
14

opencsg/csg-wukong-2b-smoltalk-chinese
Updated
•
11
•
1

opencsg/opencsg-starcoder2-15b-v0.1
Text Generation
•
Updated
•
14
•
2

opencsg/opencsg-CodeLlama-34b-v0.2
Text Generation
•
Updated
•
22
•
2
datasets
10
opencsg/autohub-benchmark
Viewer
•
Updated
•
99
•
1.91k
•
1
opencsg/Fineweb-Edu-Chinese-V2.1
Viewer
•
Updated
•
958M
•
52.8k
•
22
opencsg/chinese-fineweb-v2-scorer-train-data
Preview
•
Updated
•
318
opencsg/chinese-fineweb-edu
Viewer
•
Updated
•
84.6M
•
21.1k
•
91
opencsg/chinese-fineweb-edu-v2
Viewer
•
Updated
•
188M
•
4.21k
•
62
opencsg/smoltalk-chinese
Preview
•
Updated
•
738
•
26
opencsg/chinese-cosmopedia
Preview
•
Updated
•
1.07k
•
60
opencsg/UltraFeedback-chinese
Preview
•
Updated
•
493
•
8
opencsg/PR_review_deepseek
Viewer
•
Updated
•
24.8k
•
79
•
3
opencsg/csg-robomaster
Updated
•
643
•
2