These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes.
HazyResearch
community
AI & ML interests
None defined yet.
Collections
3
models
19
hazyresearch/mamba-360M-30B
Updated
•
57
hazyresearch/based-360M-30B
Updated
•
230
hazyresearch/attn-360M-30B
Updated
•
137
hazyresearch/M2-BERT-8k-Retrieval-Encoder-V1
Fill-Mask
•
Updated
•
7
•
1
hazyresearch/M2-BERT-2k-Retrieval-Encoder-V1
Fill-Mask
•
Updated
•
27
•
1
hazyresearch/M2-BERT-32K-Retrieval-Encoder-V1
Fill-Mask
•
Updated
•
81
•
1
hazyresearch/M2-BERT-128-Retrieval-Encoder-V1
Fill-Mask
•
Updated
•
10
•
1
hazyresearch/based-1b-50b
Updated
•
226
•
1
hazyresearch/attn-1b-50bn
Updated
•
123
hazyresearch/mamba-360m
Updated
•
1
datasets
14
hazyresearch/based_nq_1024
Viewer
•
Updated
•
3.16k
•
145
hazyresearch/based_nq_512
Viewer
•
Updated
•
3.16k
•
66
hazyresearch/based_nq_2048
Viewer
•
Updated
•
3.16k
•
74
hazyresearch/based_triviaqa
Viewer
•
Updated
•
1.69k
•
385
hazyresearch/based_drop
Viewer
•
Updated
•
2.09k
•
89
hazyresearch/based-squad
Viewer
•
Updated
•
2.98k
•
883
hazyresearch/based-swde
Viewer
•
Updated
•
1.11k
•
191
•
2
hazyresearch/based-fda
Viewer
•
Updated
•
1.1k
•
696
•
3
hazyresearch/LoCoV1-Queries
Viewer
•
Updated
•
7.73k
•
350
hazyresearch/LoCoV1-Documents
Viewer
•
Updated
•
14.8k
•
382
•
1