Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
17
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
main
gsa-1.3B-100B
Commit History
Update README.md
39d60a1
verified
yzhangcs
commited on
Sep 30
Update README.md
781baa4
verified
yzhangcs
commited on
Sep 30
Link model to paper (
#1
)
63d0c7d
verified
yzhangcs
nielsr
HF staff
commited on
Sep 22
Update tokenizer_config.json
2c8b93b
verified
yzhangcs
commited on
Sep 2
Upload GSAForCausalLM
023f4e2
verified
yzhangcs
commited on
Jun 7
initial commit
f060f9a
verified
yzhangcs
commited on
Jun 7