Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen1.5-14B-Chat-GPTQ-Int4
like
19
Follow
Qwen
3,746
Text Generation
Transformers
Safetensors
English
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2309.16609
License:
tongyi-qianwen
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
[AUTOMATED] Model Memory Requirements
#4 opened 8 months ago by
model-sizer-bot
Is fast attention supported?
#2 opened 9 months ago by
ericzzz
can't run with fastchat cuda 12.1
2
#1 opened 9 months ago by
jaywanghz