Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen2-57B-A14B-Instruct
like
51
Text Generation
Transformers
Safetensors
English
qwen2_moe
chat
conversational
Inference Endpoints
arxiv:
2309.00071
License:
apache-2.0
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (3)
为啥没有 Qwen2-57B-A14B-Instruct-GPTQ-Int8?
#5 opened 10 days ago by
Vaccummer
使用vLLM的时候,会报错:CUDA out of memory
1
#3 opened 17 days ago by
zhaoyang0618