Tesseract (DISABLED - will be removed 8/18/2017)

NOTE: Due to lack of use, this software module has been disabled and will be removed from MSI systems on 8/18/2017. If you need this software, please contact help@msi.umn.edu. Tesseract is a free optical character recognition engine developed originally by HP and currently being maintained by Google. It has been voted as one of the best OCR engine in the world. It has no layout engine, no output formatting and no GUI. It has been trained to perform recognition on many languages like English, French, German etc. It can also be taught to recognize other languages. Currently it can only read tiff and bmp images.

To run this software interactively in a Linux environment run the commands:
module load tesseract
tesseract imagename textfileoutputname
The image file corresponding to 'imagename' is transcribed and the output is stored in the text file, 'textfileoutputname.txt'.
 
If you need more details, visit the official documentation.
SW version: 
3.01
Support level: 
Access level: 
Software category: 
Platform: