InterpBench / README.md
cybershiptrooper's picture
change to cc by
e5afb2d
|
raw
history blame
769 Bytes
metadata
license: cc-by-4.0

Each directory corresponds to a model/datapoint in the InterpBench dataset. It is structured as:

- task // directory name
-- ll_model_{weight}.pth // the low level transformer model
-- meta.json //ll_model_cfg_{weight}.pkl // a config for the transformer model
-- meta_{weight}.json // training hyperparams
-- edges.pkl // label for the circuit, i.e., list of all the edges that are a part of the ground truth circuit 

This repository of models is complimentary to CircuitsBenchmark, and should be used to load the models. Alternatively, TransformerLens can also be used to load it using the ll_config.json