Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-14B-Chat-Int4
like
101
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
4-bit precision
gptq
5 papers
Model card
Files
Files and versions
Community
9
Train
Use this model
83cdd36
Qwen-14B-Chat-Int4
2 contributors
History:
18 commits
JustinLin610
Update README.md
83cdd36
11 months ago
assets
update modeling_qwen.py
12 months ago
.gitattributes
1.7 kB
Upload 3 files
12 months ago
LICENSE
6.9 kB
update batch infer
12 months ago
NOTICE
2.7 kB
update batch infer
12 months ago
README.md
28.9 kB
Update README.md
11 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
update kernels
12 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
update kernels
12 months ago
config.json
1.2 kB
upload model
12 months ago
configuration_qwen.py
2.35 kB
softmax_in_fp32
12 months ago
cpp_kernels.py
1.92 kB
update kernels
12 months ago
generation_config.json
238 Bytes
upload model
12 months ago
model-00001-of-00005.safetensors
2.05 GB
LFS
upload model
12 months ago
model-00002-of-00005.safetensors
2.02 GB
LFS
upload model
12 months ago
model-00003-of-00005.safetensors
2.04 GB
LFS
upload model
12 months ago
model-00004-of-00005.safetensors
2 GB
LFS
upload model
12 months ago
model-00005-of-00005.safetensors
1.56 GB
LFS
upload model
12 months ago
model.safetensors.index.json
82.2 kB
upload model
12 months ago
modeling_qwen.py
56.9 kB
softmax_in_fp32
12 months ago
quantize_config.json
211 Bytes
upload model
12 months ago
qwen.tiktoken
2.56 MB
upload model
12 months ago
qwen_generation_utils.py
14.6 kB
upload model
12 months ago
tokenization_qwen.py
8.44 kB
upload model
12 months ago
tokenizer_config.json
193 Bytes
upload model
12 months ago