*.csv *.pdf *.url *.jpg *.png *.ipynb examples/* processing/* output/* tools/__pycache__/* old_code/* tesseract/* poppler/* build/* dist/* build_deps/*