Naijaweb datasets ๐ณ๐ฌ Collection A recreation of the fineweb collection for Nigerians โข 3 items โข Updated Oct 24 โข 5
OpenCulture Collection A multilingual dataset of public domain books and newspapers. โข 27 items โข Updated Nov 6 โข 120
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper โข 2311.00430 โข Published Nov 1, 2023 โข 57