Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bird-of-paradise
/
deepseek-moe
like
1
Text Generation
Transformers
PyTorch
English
deepseek-moe
mixture-of-experts
Mixture of Experts
efficient-transformer
arxiv:
2101.03961
arxiv:
2006.16668
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
deepseek-moe
/
src
1 contributor
History:
1 commit
bird-of-paradise
Initial commit
354a706
19 days ago
__pycache__
Initial commit
19 days ago
tests
Initial commit
19 days ago
.DS_Store
6.15 kB
Initial commit
19 days ago
__init__.py
256 Bytes
Initial commit
19 days ago
moe.py
5.61 kB
Initial commit
19 days ago