document_redaction / requirements.txt
seanpedrickcase's picture
Unspecifying gradio and spacy in requirements, then reinstalling latest gradio afterwards in Dockerfile. All to try to avoid typer conflict
619a281
raw
history blame
816 Bytes
pdfminer.six==20231228
pdf2image==1.17.0
opencv-python==4.9.0.80
presidio_analyzer==2.2.351
presidio_anonymizer==2.2.351
presidio-image-redactor==0.0.52
pikepdf==8.15.1
pandas==2.2.2
spacy # Not specified as latest versions create a conflict with latest versions of gradio
en_core_web_lg @ https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.7.1/en_core_web_lg-3.7.1.tar.gz
gradio # Not specified as latest versions create a conflict with latest versions of spacy
boto3==1.34.103
# Following are not currently necessary for the app, may be added for improved pdf parsing in future
#unstructured
#unstructured_inference # This is big! Only necessary if you want to use the high res strategy in pdf_partition
#unstructured_pytesseract
#pillow-heif
#python-docx
#python-pptx