pdfsandwich - Automatic OCR with text injection into PDFs
Petter Reinholdtsen
pere at hungry.com
Wed Jun 14 15:35:04 CEST 2017
I just came across <URL: http://www.tobias-elze.de/pdfsandwich/ >, which
seem interesting to us:
pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which
contain only images (no text) will be processed by optical character
recognition (OCR) and the text will be added to each page invisibly
"behind" the images.
I know some copier/scanner machines do this, but now we can do it in
batch. :)
--
Happy hacking
Petter Reinholdtsen
More information about the nikita-noark
mailing list