pdfsandwich - Automatic OCR with text injection into PDFs

Petter Reinholdtsen pere at hungry.com
Wed Jun 14 15:35:04 CEST 2017


I just came across <URL: http://www.tobias-elze.de/pdfsandwich/ >, which
seem interesting to us:

  pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which
  contain only images (no text) will be processed by optical character
  recognition (OCR) and the text will be added to each page invisibly
  "behind" the images.

I know some copier/scanner machines do this, but now we can do it in
batch. :)

-- 
Happy hacking
Petter Reinholdtsen


More information about the nikita-noark mailing list