4f948a1 6de87a0
1
2
3
4
5
6
7
8
9
10
11
12
--- license: apache-2.0 --- Pretrained toy model. Made with Andrej Karpathy's NanoGPT, ~2023. Parameters: batch_size = 64 block_size = 256 n_layer = 8 n_head = 8 n_embd = 768 Everything else is left as is.