simarora commited on
Commit
e5f50a6
1 Parent(s): 68d9094

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -8,9 +8,9 @@ language:
8
 
9
  This model is pretrained Based model.
10
 
11
- As a quality reference, we include a pretrained Mamba model provided here: https://huggingface.co/hazyresearch/mamba-1b-50b
12
 
13
- Both checkpoints are pretrained on **50Bn tokens** of the Pile in the exact same data order using next token prediction.
14
 
15
  A WandB report for training is here: https://api.wandb.ai/links/hazy-research/ggo9rst2
16
 
 
8
 
9
  This model is pretrained Based model.
10
 
11
+ As a quality reference, we include a pretrained Mamba model provided here: https://huggingface.co/hazyresearch/mamba-1b-50b and a pretrained attention (Llama architecture) model provided here: https://huggingface.co/hazyresearch/attn-1b-50bn
12
 
13
+ All three checkpoints are pretrained on **50Bn tokens** of the Pile in the exact same data order using next token prediction.
14
 
15
  A WandB report for training is here: https://api.wandb.ai/links/hazy-research/ggo9rst2
16