Spaces:
Running
Running
Terry Zhuo (Monash University)
commited on
Commit
•
d0102f5
1
Parent(s):
13f37cd
add astraios
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ pinned: false
|
|
17 |
|
18 |
# BigCode
|
19 |
|
20 |
-
BigCode is an open scientific collaboration working on responsible training of large language models for coding applications. You can find more information on the main [website](https://www.bigcode-project.org/) or follow Big Code on [Twitter](https://twitter.com/BigCodeProject). In this organization you can find the artefacts of this collaboration: **StarCoder**, a state-of-the-art language model for code, **OctoPack**, artifacts for instruction tuning large code models, **The Stack**, the largest available pretraining dataset with perimssive code, and **SantaCoder**, a 1.1B parameter model for code.
|
21 |
|
22 |
---
|
23 |
<details>
|
@@ -82,6 +82,7 @@ BigCode is an open scientific collaboration working on responsible training of l
|
|
82 |
- [StarCoder Search](https://huggingface.co/spaces/bigcode/search): Full-text search code in the pretraining dataset.
|
83 |
- [StarCoder Membership Test](https://stack.dataportraits.org/): Blazing fast test if code was present in pretraining dataset.
|
84 |
</details>
|
|
|
85 |
---
|
86 |
<details>
|
87 |
<summary>
|
@@ -101,6 +102,21 @@ BigCode is an open scientific collaboration working on responsible training of l
|
|
101 |
- [OctoGeeX](https://huggingface.co/bigcode/octogeex): Instruction tuned model of CodeGeeX2 by training on CommitPackFT.
|
102 |
</details>
|
103 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
104 |
<details>
|
105 |
<summary>
|
106 |
<b><font size="+1">📑The Stack</font></b>
|
|
|
17 |
|
18 |
# BigCode
|
19 |
|
20 |
+
BigCode is an open scientific collaboration working on responsible training of large language models for coding applications. You can find more information on the main [website](https://www.bigcode-project.org/) or follow Big Code on [Twitter](https://twitter.com/BigCodeProject). In this organization you can find the artefacts of this collaboration: **StarCoder 2**, a state-of-the-art language model for code, **StarCoder**, a previous state-of-the-art language model for code, **Astraios**, scaling instruction-tuned language models for code via diverse fine-tuning methods , **OctoPack**, artifacts for instruction tuning large code models, **The Stack**, the largest available pretraining dataset with perimssive code, and **SantaCoder**, a 1.1B parameter model for code.
|
21 |
|
22 |
---
|
23 |
<details>
|
|
|
82 |
- [StarCoder Search](https://huggingface.co/spaces/bigcode/search): Full-text search code in the pretraining dataset.
|
83 |
- [StarCoder Membership Test](https://stack.dataportraits.org/): Blazing fast test if code was present in pretraining dataset.
|
84 |
</details>
|
85 |
+
|
86 |
---
|
87 |
<details>
|
88 |
<summary>
|
|
|
102 |
- [OctoGeeX](https://huggingface.co/bigcode/octogeex): Instruction tuned model of CodeGeeX2 by training on CommitPackFT.
|
103 |
</details>
|
104 |
---
|
105 |
+
<details>
|
106 |
+
<summary>
|
107 |
+
<b><font size="+1">✨Astraios</font></b>
|
108 |
+
</summary>
|
109 |
+
|
110 |
+
Astraios is a model suite of scaling 28 instruction-tuned language models for code.
|
111 |
+
|
112 |
+
- [Paper](https://arxiv.org/abs/2401.00788): Research paper with details about all components of Astraios.
|
113 |
+
- [GitHub](https://github.com/bigcode-project/astraios): All code used for the creation of Astraios.
|
114 |
+
- [Astraios-1B](https://huggingface.co/collections/bigcode/astraios-1b-6576ff1b8e449026ae327c1c): Collection of StarCoderBase-1B models instruction tuned on CommitPackFT + OASST with 7 method.
|
115 |
+
- [Astraios-3B](https://huggingface.co/collections/bigcode/astraios-3b-6577127317ee44ff547252d3): Collection of StarCoderBase-3B models instruction tuned on CommitPackFT + OASST with 7 method.
|
116 |
+
- [Astraios-7B](https://huggingface.co/collections/bigcode/astraios-7b-65788b509c5c26f96c08d576): Collection of StarCoderBase-7B models instruction tuned on CommitPackFT + OASST with 7 method.
|
117 |
+
- [Astraios-15B](https://huggingface.co/collections/bigcode/astraios-15b-65788b7476b6de79781054cc): Collection of StarCoderBase-15B models instruction tuned on CommitPackFT + OASST with 7 method.
|
118 |
+
</details>
|
119 |
+
---
|
120 |
<details>
|
121 |
<summary>
|
122 |
<b><font size="+1">📑The Stack</font></b>
|