Code Yarns ‍👨‍💻
Tech BlogPersonal Blog

How to perform OCR on a DJVU document

📅 2012-Nov-29 ⬩ ✍️ Ashwin Nanjappa ⬩ 🏷️ cuneidjvu, djvu, ocr ⬩ 📚 Archive

OCR on DJVU using CuneiDjVu

A DJVU document typically contains both a layer of scanned image and a layer of the text in that image. Sometimes, a DJVU document is produced which does not have the text layer. This makes it hard to search and find text in the document.

Recognising the text in the DJVU document using OCR and adding that as a text layer to that document is easy:

  1. Download CuneiDjVu and unzip its contents.

  2. Run the CuneiDjVu program.

  3. Choose the DJVU document as input, choose the output folder and the OCR language.

  4. Press Process and the resulting file will be CuneiDjVu Result.djvu on your Desktop.

Tried with: CuneiDjVu 1.4 and Windows 7 Professional x64


© 2022 Ashwin Nanjappa • All writing under CC BY-SA license • 🐘 @codeyarns@hachyderm.io📧