Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-2.7B-100B
like
0
Follow
fla-hub
32
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
main
gsa-2.7B-100B
Commit History
Upload GSAForCausalLM
48378f1
verified
yzhangcs
commited on
7 days ago
Remove the `norm_first` option
9aefe91
yzhangcs
commited on
10 days ago
Update README.md
9d48ee5
verified
yzhangcs
commited on
Sep 30, 2024
Link model to paper (
#1
)
a250277
verified
yzhangcs
nielsr
HF staff
commited on
Sep 22, 2024
Upload GSAForCausalLM
671ae86
verified
yzhangcs
commited on
Jun 9, 2024
initial commit
fbd04f2
verified
yzhangcs
commited on
Jun 9, 2024