Amazon has unveiled a new service named Textract which enables its Web Service customers to accurately harvest text from documents. The word Textract is a combination of ‘Text’ and ‘Extract’, and it does exactly what it says. The software performs the same task as a standard OCR (Optical Character Recognition) reader, but where the OCR scanner falls short, Textract accomplishes the task effortlessly.
Amazon’s software is designed to identify various document formats and analyze its contents to process them in the desired manner. For instance, if the concerned document is a table, Textract will reproduce data in a structured form that is [...]