bartowski
/

Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation

Model card Files Files and versions Community

Resources

View closed (1)

Compatible small models for speculative decoding?

#9 opened 2 days ago by

How many GPU ram needed?

#8 opened about 1 month ago by

q8 with 8 part

#7 opened about 2 months ago by

Q6_K vs. Q5_K_L

#6 opened 2 months ago by

IQ2_S

#4 opened 2 months ago by

Unable to pull in from Ollama

#3 opened 2 months ago by

Observation: 4-bit quantization can't answer the Strawberry prompt

#2 opened 2 months ago by

Nemotron 51B too please

#1 opened 3 months ago by