Posts

Showing posts with the label mupdf

OCR pdf file in python on the fly

With   PyMuPDF and tesserocr you can OCR image pdf easily

pdf to image/multipage (in python)

Many people love pdf format as such files are readable on all common devices and it provide stable representation of document. But somehow also scanner started to support it but with one feature: they just encapsulate images to it without OCRing it, so such pdf are not reachable.