OCR_Python

OCR Foreign Language PDFs with Python and KNIME

This workflow shows you how to OCR a Foreign Language (Japanese, but this can be changed in the Python script) from PDFs which are text-based or image-based using Python and KNIME.

This workflow requires several installations via the terminal and the location of those installation locations must be entered into the component to run this workflow.

If you have any questions please post to the KNIME Forum and tag me using @victor_palacios

This was primarily created for Mac users who want to OCR, but Windows users will find instructions in the Python node.

Nodes

Catch Errors (Data Ports)2 ×
Chunk Loop Start2 ×
Component Input2 ×
Component Output2 ×
Loop End2 ×
Show all 21 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME JavaScript Views
FeatureKNIME Javasnippet
FeatureKNIME Python 2 Integration (legacy)
FeatureKNIME Quick Forms
Show all 6 modules

OCR_​Python

Nodes

Extensions

Links

Download

OCR_Python