Tesseract

Open source OCR engine

https://github.com/tesseract-ocr/tesseract

=Application=
 * http://hortonworks.com/hadoop-tutorial/indexing-and-searching-text-within-images-with-apache-solr/