Pythia-410M models pre-trained by MATES.

The training step is the iteration divided by 4, i.e., iter-040000-ckpt.pth corresponds to the model checkpoint in step 10000.

Paper: MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

Official codebase: https://github.com/cxcscmu/MATES

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Dataset used to train yuzc19/pythia-410m-mates