Spaces:
Running
Running
Commit
·
b27f422
1
Parent(s):
ca86990
Update README.md
Browse files
README.md
CHANGED
@@ -19,15 +19,16 @@ Hello World! This is codefuse-ai!
|
|
19 |
|
20 |
<a id="english"></a>
|
21 |
|
22 |
-
|
23 |
|
|
|
|
|
|
|
|
|
24 |
|
25 |
-
|
26 |
-
|
27 |
|
28 |
We believe that our solution can significantly enhance the performance of pretrained LLMs across multiple related tasks simultaneously. We are committed to further exploring this direction and providing more open-source contributions. We also encourage engineers and researchers within this community to join us in co-constructing CodeFuse.
|
29 |
-
|
30 |
-
|
31 |
</div>
|
32 |
|
33 |
<a id="chinese"></a>
|
|
|
19 |
|
20 |
<a id="english"></a>
|
21 |
|
22 |
+
**The mission of CodeFuse is to develop Code Large Language Models (Code LLMs) specifically designed to support the entire software development lifecycle, covering crucial stages such as design, requirements, coding, testing, deployment, operations, and maintenance.** We are passionate about creating innovative solutions that empower developers throughout the software development process.
|
23 |
|
24 |
+
In this release, we are open sourcing
|
25 |
+
1. [**The MFT (Multi-Task Fine-Tuning) framework, known as MFTcoder**](https://github.com/codefuse-ai/MFTCoder);
|
26 |
+
2. **Two datasets for enhancing the coding capabilities of LLMs**, that is, [Code Exercise](https://huggingface.co/datasets/codefuse/CodeExercise-Python-27k) and [Evol-Instruction](https://huggingface.co/datasets/codefuse/Evol-instruction-66k);
|
27 |
+
3. [**A faster and more reliable deployment framework based on FasterTransformer**](https://github.com/codefuse-ai/FasterTransformer4CodeFuse);
|
28 |
|
29 |
+
The resulting model ensemble, which includes [CodeFuse-13B](https://huggingface.co/codefuse/CodeFuse-13B) and [CodeFuse-CodeLlama-34B](https://huggingface.co/codefuse/CodeFuse-CodeLlama-34B), supports various code-related tasks such as code completion, text-to-code conversion, and unit test generation. In particular, [CodeFuse-CodeLlama-34B](https://huggingface.co/codefuse/CodeFuse-CodeLlama-34B), built upon CodeLlama as the base model and fine-tuned using the proposed MFT framework, achieves an impressive score of **74.4% (greedy decoding)** in the HumanEval Python pass@1 evaluation, **even surpassing the performance of GPT-4 (71%)**. We have plans to incorporate additional base LLMs into the ensemble in the near future.
|
|
|
30 |
|
31 |
We believe that our solution can significantly enhance the performance of pretrained LLMs across multiple related tasks simultaneously. We are committed to further exploring this direction and providing more open-source contributions. We also encourage engineers and researchers within this community to join us in co-constructing CodeFuse.
|
|
|
|
|
32 |
</div>
|
33 |
|
34 |
<a id="chinese"></a>
|