Zhangchen Xu
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,19 @@
|
|
1 |
---
|
2 |
title: README
|
3 |
-
emoji:
|
4 |
colorFrom: green
|
5 |
colorTo: red
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
title: README
|
3 |
+
emoji: π±
|
4 |
colorFrom: green
|
5 |
colorTo: red
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# π± KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
|
11 |
+
|
12 |
+
KodCode is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. It contains 12 distinct subsets spanning various domains (from algorithmic to package-specific knowledge) and difficulty levels (from basic coding exercises to interview and competitive programming challenges). KodCode is designed for both supervised fine-tuning (SFT) and RL tuning.
|
13 |
+
|
14 |
+
|
15 |
+
<div align="center">
|
16 |
+
|
17 |
+
πΈοΈ [Project Website](https://kodcode-ai.github.io/) | π [Technical Report](https://arxiv.org/abs/2503.02951) | πΎ [Github Repo](https://github.com/KodCode-AI/kodcode) | π€ [KodCode-V1 (For RL)](https://huggingface.co/datasets/KodCode/KodCode-V1) | π€ [KodCode-V1-SFT-R1 (for SFT)](https://huggingface.co/datasets/KodCode/KodCode-V1-SFT-R1)
|
18 |
+
|
19 |
+
</div>
|