Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
TheBloke
/
Qwen-7B-Chat-AWQ
like
8
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
4-bit precision
awq
arxiv:
5 papers
Model card
Files
Files and versions
Community
1
Train
Use this model
9be754f
Qwen-7B-Chat-AWQ
1 contributor
History:
5 commits
TheBloke
Fix config.json, disabling Flash Attention
9be754f
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
LICENSE
Safe
6.9 kB
AWQ model commit
12 months ago
NOTICE
Safe
2.7 kB
AWQ model commit
12 months ago
README.md
Safe
48.4 kB
Upload README.md
12 months ago
config.json
Safe
1.28 kB
Fix config.json, disabling Flash Attention
12 months ago
configuration_qwen.py
Safe
2.35 kB
AWQ model commit
12 months ago
cpp_kernels.py
Safe
1.92 kB
AWQ model commit
12 months ago
generation_config.json
Safe
249 Bytes
AWQ model commit
12 months ago
model.safetensors
Safe
5.86 GB
LFS
AWQ model commit
12 months ago
modeling_qwen.py
Safe
55.8 kB
AWQ model commit
12 months ago
quant_config.json
Safe
90 Bytes
AWQ model commit
12 months ago
qwen.tiktoken
Safe
2.56 MB
AWQ model commit
12 months ago
qwen_generation_utils.py
Safe
14.6 kB
AWQ model commit
12 months ago
special_tokens_map.json
Safe
3 Bytes
AWQ model commit
12 months ago
tokenization_qwen.py
Safe
9.62 kB
AWQ model commit
12 months ago
tokenizer_config.json
Safe
173 Bytes
AWQ model commit
12 months ago