Posts

Showing posts with the label windows

Preparing Windows for Tesseract "Makefile training" (LSTM training)

Image
The Tesseract Makefile training/LSTM training from existing images was created on Linux (unix like) system using the usual Unix tools.  Fortunately, many these tools can also be installed on Windows.

pdf to image/multipage (in python)

Many people love pdf format as such files are readable on all common devices and it provide stable representation of document. But somehow also scanner started to support it but with one feature: they just encapsulate images to it without OCRing it, so such pdf are not reachable.

Building tesserocr on MS Windows 64bit

If you search for efficient solution of using tesseract OCR in python you will need to use tessocer. But there are no recent version of project for current version of python on windows. So you have build it by yourself.