Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-14B
like
202
Follow
Qwen
8.43k
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
arxiv:
2309.16609
Model card
Files
Files and versions
Community
12
Train
Use this model
5eda948
Qwen-14B
/
configuration_qwen.py
Commit History
add softmax_in_fp32
5e88027
yangapku
commited on
Sep 28, 2023
update kvcache
319ed0f
yangapku
commited on
Sep 25, 2023
upload model
5cde1bb
yangapku
commited on
Sep 24, 2023