Poor Man's Textract
Amazon Textract a (paid) service that "automatically extracts text and data from scanned documents. Amazon Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.". We want to build a free alternative that provides an output of similar quality.
Improve upon the existing PMT project: https://github.com/kenAlparslan/Texttract
Previous (GCi) tasks that did something (albeit simpler) similar:
Take a look at this page.