mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC Text Generation β’ Updated Nov 6, 2024 β’ 130 β’ 4
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! β’ 44 items β’ Updated Oct 17, 2024 β’ 61