Tesseract is an open source Optical Character Recognition (OCR) engine, which is used by MyQ in the MyQ OCR Server.
The Tesseract OCR engine supports the following formats:
-
PDF
-
PDFA (the compliance level of PDFA is PDFA-1B)
-
TXT
Tesseract can be used to process documents in many language – for more information, see Supported Languages.
For further information about the Tesseract engine, see the dedicated documentation from its developer.