AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctβ’ MI250 GPUs based on OLMo. β’ 4 items β’ Updated 19 days ago β’ 16
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 8 items β’ Updated 13 days ago β’ 95
view article Article Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code Dec 5, 2023 β’ 4
view article Article π₯ Argilla 2.0: the data-centric tool for AI makers π€ By dvilasuero β’ Jul 30 β’ 37
NIM Serverless Inference API Collection Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. β’ 8 items β’ Updated Oct 14 β’ 21