This node collection integrates the full functionality of Stirling PDF into KNIME, enabling flexible, automated, and privacy-friendly PDF processing directly in your workflows.
You can connect either to:
The public Stirling PDF API, or
Your own self-hosted Stirling PDF server – ideal for privacy-sensitive and compliant document workflows. You can run Stirling PDF via Docker on your own system or server with a single command and connect using the Server node.
Automate powerful PDF processing (like split, merge, convert, OCR) with Stirling PDF’s public or self-hosted API
Check basic properties of PDFs before you process them: page count, file size, page size/rotation, and whether they contain text or images. Use these endpoints as a "pre-check" step to decide what to do with a file next.
Typical uses: • Reject files that are too big or too small • Detect image-only PDFs that should go through OCR • Ensure a document has enough pages before it enters a workflow • Check orientation of pages before printing or merging
Security APIs
Run several PDF operations in one configured pipeline instead of calling multiple endpoints yourself. Useful when you always do the same steps in sequence (for example: convert → OCR → compress → watermark).
Typical uses: • Process incoming invoices in one go (clean, OCR, compress, stamp, etc.) • Normalise documents before they enter an archive • Wrap a complex document flow behind a single API call for your own apps
Tools that don't fit neatly elsewhere: OCR, compress, repair, flatten, extract images, update metadata, add stamps/page numbers/images, and more. These endpoints help fix problem PDFs and prepare them for sharing, storage, or further processing.
Typical uses: • Repair a damaged PDF or remove blank pages • Run OCR on scanned PDFs so they become searchable • Compress large PDFs for email or web download • Extract embedded images or scans • Add page numbers, stamps, or overlay an image (e.g. logo, seal) • Update PDF metadata (title, author, etc.)
Convert PDFs to and from other formats (Word, images, HTML, Markdown, PDF/A, CBZ/CBR, EML, etc.). This group also powers the text-editor / jobId-based editing flow for incremental PDF edits.
Typical uses: • Turn PDFs into Word or text for editing • Convert office files, images, HTML, or email (EML) into PDFs • Create PDF/A for long-term archiving • Export PDFs as images, HTML, CSV, or Markdown for search, analysis, or reuse
Page-level PDF editing: split, merge, rotate, crop, rearrange, and scale pages. These endpoints handle most daily "I opened a PDF editor just to…" type tasks.
Typical uses: • Split a large PDF into smaller files (by pages, chapters, or size) • Merge several PDFs into one report or pack • Rotate or reorder pages before sending or archiving • Turn a multi-page document into one long scrolling page
Node to configure the server endpoints
Nodes for different authorization methods
Work with PDF form fields: read them, fill them, edit them, or remove them. Treats a PDF as a structured form instead of just flat pages.
Typical uses: • Inspect which form fields exist in a PDF • Autofill forms from your own systems (e.g. CRM, ERP) • Change or delete form fields before sending out a final, non-editable copy • Unlock read-only form fields when you need to update them
Nodes for advanced configuration
To use this product in KNIME, install the extension Stirling PDF Nodes from the below update site:
A zipped version of the software site can be downloaded here.
To use this product in KNIME, install the extension Stirling PDF Nodes from the below update site:
A zipped version of the software site can be downloaded here.