*_api_key.txt __pycache__ build dist /.venv/ .env .vscode .mypy_cache .pytest_cache *.pyc flagged wandb logs temp tests/logs tests/wandb # Outputs generated by the cli demo cached_generated_dataset/ generated_dataset/ huggingface_data/huggingface_datasets/dataset_index.json huggingface_data/huggingface_datasets/huggingface_datasets_datafinder_index huggingface_data/huggingface_datasets/reranking_dataset_index.json huggingface_data/huggingface_models/ retrieved_dataset_dict/ result/ checkpoint/ status.yaml # Outputs generated by the colab demo trained_model/ trained_tokenizer/ # data babylm_data checkpoint-*/ # hf models info huggingface_models/