Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jonathanjordan21
/
mos-mamba-18x130m-trainer-dgx-lora-sft-merged
like
0
Text Generation
Transformers
Safetensors
MoSMamba
conversational
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Use this model
main
mos-mamba-18x130m-trainer-dgx-lora-sft-merged
Commit History
Upload tokenizer
fd8844d
verified
jonathanjordan21
commited on
Aug 23, 2024
Upload tokenizer
fbde525
verified
jonathanjordan21
commited on
Aug 23, 2024
Upload MoSMambaForCausalLM
fa7a04c
verified
jonathanjordan21
commited on
Aug 23, 2024
initial commit
bd6e855
verified
jonathanjordan21
commited on
Aug 23, 2024