gsa-2.7B-100B / README.md
yzhangcs's picture
Update README.md
9d48ee5 verified
---
language:
- en
tags:
- text-generation
- gsa
license: mit
datasets:
- cerebras/SlimPajama-627B
library_name: fla
---
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).