This is the 8-bit quantized version of NousResearch/Hermes-3-Llama-3.1-8B by following the example from the AutoGPTQ repository.

Downloads last month
10
Inference API
Unable to determine this model's library. Check the docs .

Model tree for ktoprakucar/Hermes-3-Llama-3.1-8B-Q8-GPTQ

Quantized
(170)
this model