seanpedrickcase
commited on
Commit
·
44647fa
1
Parent(s):
504104c
Updated gradio version to latest (4.31.0) in hope to address AWS server timeout issues. Other tested package versions specified in requirements.
Browse files- requirements.txt +5 -4
requirements.txt
CHANGED
@@ -1,15 +1,16 @@
|
|
1 |
pdfminer.six==20231228
|
2 |
pdf2image==1.17.0
|
3 |
-
opencv-python
|
4 |
presidio_analyzer==2.2.351
|
5 |
presidio_anonymizer==2.2.351
|
6 |
presidio-image-redactor==0.0.52
|
7 |
pikepdf==8.15.1
|
8 |
pandas==2.2.2
|
9 |
-
spacy
|
10 |
en_core_web_lg @ https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.7.1/en_core_web_lg-3.7.1.tar.gz
|
11 |
-
gradio
|
12 |
-
boto3
|
|
|
13 |
#unstructured
|
14 |
#unstructured_inference # This is big! Only necessary if you want to use the high res strategy in pdf_partition
|
15 |
#unstructured_pytesseract
|
|
|
1 |
pdfminer.six==20231228
|
2 |
pdf2image==1.17.0
|
3 |
+
opencv-python==4.9.0.80
|
4 |
presidio_analyzer==2.2.351
|
5 |
presidio_anonymizer==2.2.351
|
6 |
presidio-image-redactor==0.0.52
|
7 |
pikepdf==8.15.1
|
8 |
pandas==2.2.2
|
9 |
+
spacy==3.7.4
|
10 |
en_core_web_lg @ https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.7.1/en_core_web_lg-3.7.1.tar.gz
|
11 |
+
gradio==4.31.0
|
12 |
+
boto3==0.12.1
|
13 |
+
# Following are not currently necessary for the app, may be added for improved pdf parsing in future
|
14 |
#unstructured
|
15 |
#unstructured_inference # This is big! Only necessary if you want to use the high res strategy in pdf_partition
|
16 |
#unstructured_pytesseract
|