Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
's Collections
NeMo Audio Codecs
Hymba
Optimized ONNX models for NVIDIA RTX GPUs
Cosmos Tokenizer
Llama-3.1-Nemotron-70B
NVLM 1.0
OpenMath-2
Nemotron 4 340B
SteerLM
Parakeet
Canary
InstructRetro
OpenMath
RLHF
NV-Embed
Llama3-ChatQA-1.5
SSMs
Nemotron 3 8B
BigVGAN
MambaVision
Minitron
RADIO
NIM Serverless Inference API
Model Optimizer
Llama3-ChatQA-2
NeMo Curator - Classifier Models
Model Optimizer
updated
Oct 24
A collection of generative models quantized and optimized with TensorRT Model Optimizer.
Upvote
3
nvidia/Llama-3.1-8B-Instruct-FP8
Updated
Oct 24
•
903
•
7
nvidia/Llama-3.1-405B-Instruct-FP8
Updated
Oct 24
•
449
•
5
nvidia/Llama-3.1-70B-Instruct-FP8
Updated
Oct 24
•
3.69k
•
6
Upvote
3
Share collection
View history
Collection guide
Browse collections