document_redaction / tools /aws_functions.py

Commit History

Laid groundwork for passing in AWS API keys. Duplicate pages option should now work for pages with no text.
7907ad4

seanpedrickcase commited on

Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
23f8ca3

seanpedrickcase commited on

Can now specify the root path that the app will run on with an environment variable
b8e245f

seanpedrickcase commited on

Can now define queue size, max file size, and server port in environment variables
dc17f6e

seanpedrickcase commited on

Added option for running redact function through CLI (i.e. not going through Gradio UI or API). Test functions for running this through AWS Lambda.
e5dfae7

seanpedrickcase commited on

Only shows AWS options when AWS functions enabled. Can now upload previous review files to continue review later. Some review debugging.
e2aae24

seanpedrickcase commited on

Comprehend now uses custom spacy recognisers on top of defaults. Added zoom functionality to annotator. Fixed some pdf mediabox issues and redacted image output issues.
ec98119

seanpedrickcase commited on

When on AWS, now loads in a default allow_list to exclude common words from redaction. Improved checks on AWS Comprehend calls.
390bef2

seanpedrickcase commited on

Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
f0f9378

seanpedrickcase commited on

Allowed for time limits on redact to avoid timeouts. Improved review interface. Now accepts only one file at a time. Upgraded Gradio version
eea5c07

seanpedrickcase commited on

Added 'Review redactions' tab to the app. You can now visually inspect suggested redactions and modify/add with a point and click interface.
ebf9010

seanpedrickcase commited on

Improved allow list, handwriting/signature identification, logging
6ea0852

seanpedrickcase commited on

Updated default AWS_FUNCTION value. Logs seconds values from outputs correctly.
7aa4d5f

seanpedrickcase commited on

Enhanced logging of usage. Small buffer added to redaction rectangles as it seems to miss the tops of text often.
34addbf

seanpedrickcase commited on

Handles multiple runs with multiple files correctly now. Logging and feedback improvements.
bbf818d

seanpedrickcase commited on

Decision process now saved as log files. Other log files and feedback added
8c33828

seanpedrickcase commited on

Can now redaction text or csv/xlsx files. Can redact multiple files. Embeds redactions as image-based file by default
7810536

seanpedrickcase commited on

Changed sts region to EU from US
19846ba

seanpedrickcase commited on

Separated file preparation and file redaction functions. Hopefully sts endpoint access now works on AWS
0f18146

seanpedrickcase commited on