tinystories-1M-SAES / 1_cfg.json
ckkissane's picture
First model version
d8bc519
raw
history blame
152 Bytes
{"seed": 49, "batch_size": 4096, "buffer_mult": 384, "lr": 0.001, "num_tokens": 200000000, "l1_coeff": 0.068, "beta1": 0.9, "beta2": 0.99, "dict_mult":