Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen1.5-14B-Chat-GPTQ-Int4
like
19
Follow
Qwen
3,383
Text Generation
Transformers
Safetensors
English
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2309.16609
License:
tongyi-qianwen
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
Is fast attention supported?
#2
by
ericzzz
- opened
Mar 3
Discussion
ericzzz
Mar 3
I got an error message when loading the model with flash attention
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment