To be written, for now please see: https://github.com/bigscience-workshop/bigscience/tree/master/train/tr11-176B-ml