Spaces:
Running
Running
metadata
title: README
emoji: ✨
colorFrom: gray
colorTo: red
sdk: static
pinned: false
Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.
You can find more information on the main website at https://www.bigcode-project.org. You can also follow Big Code on Twitter at https://twitter.com/BigCodeProject.
In this organization, you can find The Stack, a 3.1TB of source code in 30 programming languages, its near deduplicated version and a small subset.
If you want to access the models trained on these datasets, please send a request to contact@bigcode-project.org.