my-awesome-model / README.md
simarora's picture
Create README.md
5fb3686 verified
metadata
datasets:
  - EleutherAI/pile
language:
  - en

Based model but uses layernorm instead of QK.sum(-1) for the normalization, for better hardware efficiency.