This node is currently not available in KNIME v5.12 — instead we’re showing this page for KNIME v5.11. You can use the version menu in the title bar to permanently switch your preferred version. This will also show the link to the update site.

Process PDF With OCR

Go to Product

This endpoint processes a PDF file using OCR (Optical Character Recognition). Users can specify languages, sidecar, deskew, clean, cleanFinal, ocrType, ocrRenderType, and removeImagesAfter options. Uses OCRmyPDF if available, falls back to Tesseract. Input:PDF Output:PDF Type:SI-Conditional

Options

File Input

Languages

List of languages to use in OCR processing, e.g., 'eng', 'deu'

Set Sidecar

Enable to set the optional field Sidecar

Sidecar

Include OCR text in a sidecar text file if set to true

Set Deskew

Enable to set the optional field Deskew

Deskew

Deskew the input file if set to true

Set Clean

Enable to set the optional field Clean

Clean

Clean the input file if set to true

Set Clean Final

Enable to set the optional field Clean Final

Clean Final

Clean the final output if set to true

Ocr Type

Specify the OCR type, e.g., 'skip-text', 'force-ocr', or 'Normal'

Ocr Render Type

Specify the OCR render type, either 'hocr' or 'sandwich'

Set Remove Images After

Enable to set the optional field Remove Images After

Remove Images After

Remove images from the output PDF if set to true

Result Format

Specify how the response should be mapped to the table output. The following formats are available:

Raw Response: Returns the raw response in a single row with the following columns:

body: Response body
status: HTTP status code

Input Ports

: Configuration data.

Output Ports

: Result of the request depending on the selected Result Format.
: Configuration data (this is the same as the input port; it is provided as passthrough for sequentially chaining nodes to declutter your workflow connections).

Popular Predecessors

No recommendations found

Popular Successors

No recommendations found

Views

This node has no views

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Go to Product

Installation

To use this node in KNIME, install the extension Stirling PDF Nodes from the below update site following our NodePit Product and Node Installation Guide:

v5.11

A zipped version of the software site can be downloaded here.

Plugin provider: NodePit

Plugin version: 2.0.0.202601121736

On NodePit since: 2026-03-10

Last update: 2026-07-13

Tags: Modern UI

KNIME versions: From v5.3 to v5.11

NodePit ExclusiveOnly available on NodePit

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!