SimpleStories-125M / config.json
lennart-finke's picture
Push model using huggingface_hub.
2f63422 verified
raw
history blame contribute delete
170 Bytes
{
"block_size": 1024,
"flash_attention": true,
"n_embd": 768,
"n_head": 12,
"n_key_value_heads": 3,
"n_layer": 12,
"rotary_dim": 64,
"vocab_size": 50257
}