Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nvidia
/
mamba2-hybrid-8b-3t-128k
like
38
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
2
main
mamba2-hybrid-8b-3t-128k
1 contributor
History:
2 commits
rwaleffe
Upload model
9083d98
23 days ago
release
Upload model
23 days ago
.gitattributes
1.52 kB
initial commit
23 days ago
README.md
2.23 kB
Upload model
23 days ago
latest_checkpointed_iteration.txt
8 Bytes
Upload model
23 days ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
4.57 MB
LFS
Upload model
23 days ago