Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nvidia
/
mamba2-hybrid-8b-3t-32k
like
4
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
main
mamba2-hybrid-8b-3t-32k
1 contributor
History:
2 commits
rwaleffe
Upload model
425c863
18 days ago
release
Upload model
18 days ago
.gitattributes
1.52 kB
initial commit
19 days ago
README.md
2.23 kB
Upload model
18 days ago
latest_checkpointed_iteration.txt
8 Bytes
Upload model
18 days ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
4.57 MB
LFS
Upload model
18 days ago