Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models Paper β’ 2411.06402 β’ Published Nov 10, 2024 β’ 2
view article Article ArabicWeb24: Creating a High Quality Arabic Web-only Pre-training Dataset By MayFarhat β’ Aug 8, 2024 β’ 11
LLM360: Towards Fully Transparent Open-Source LLMs Paper β’ 2312.06550 β’ Published Dec 11, 2023 β’ 57