Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
TRI-ML
/
mamba-7b-rw
like
54
Follow
Toyota Research Institute
102
Text Generation
PyTorch
Safetensors
tiiuae/falcon-refinedweb
English
openlm
mamba
linear
Eval Results
arxiv:
2312.00752
arxiv:
2405.06640
License:
apache-2.0
Model card
Files
Files and versions
Community
9
3c58287
mamba-7b-rw
/
config.json
sedrick-keh-tri
push jsons
2501def
11 months ago
raw
Copy download link
history
blame
Safe
80 Bytes
{
"d_model"
:
4096
,
"n_layer"
:
64
,
"vocab_size"
:
50432
,
"seq_len"
:
2048
}