Edit model card

Model Card for twhoool02/Llama-2-7b-chat-hf-AutoGPTQ

Model Details

This model is a GPTQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.

  • Developed by: Ted Whooley
  • Library: Transformers, GPTQ
  • Model type: llama
  • Model name: Llama-2-7b-chat-hf-AutoGPTQ
  • Pipeline tag: text-generation
  • Qunatized by: twhoool02
  • Language(s) (NLP): en
  • License: other
Downloads last month
0
Safetensors
Model size
1.13B params
Tensor type
I32
·
FP16
·
Inference API
Input a message to start chatting with twhoool02/Llama-2-7b-hf-AutoGPTQ.
This model can be loaded on Inference API (serverless).

Quantized from