Your survival toolkit in the daily jungle of Web Information Extraction, Text Classification, and Geo Data.

Palladian is a Java-based toolkit which provides functionality to perform typical Internet Information Retrieval tasks. It provides a collection of algorithms for text processing focused on classification, extraction of various types of information, and retrieval of data from the Web.

The growing collection of Palladian KNIME nodes provide the possibility to use Palladian’s capabilities directly within KNIME, to complement and extend existing workflows, or to allow for quick prototyping without having to write any code.

The Palladian nodes are entirely free if you use them with free versions of the KNIME Analytics Platform – no matter if you use them commercially, for research, for educational, or for private purposes. Only if you run them on a non-free, paid KNIME product (i.e. the “KNIME Server”), a paid license is required. Follow the yellow “Buy Now” button to purchase a license for KNIME Server use. This “free-for-free” licensing model allows to fund the ongoing improvement, maintenance, and support of this software and make it available to a broad audience.

More information about the Palladian toolkit is available here:

If you have any questions, comments, or problems, we are happy to hear from you:

The Palladian KNIME Nodes were created by Philipp Katz, Klemens Muthmann, David Urbansky; 2011 – 2024.

There’s even more — check out the Selenium Nodes!

For advanced web scraping, task automatization and web application testing, also check out the Selenium Nodes, which allow you to control your browser from KNIME.


VendorPhilipp Katz
AddressBienertstraße 33,01187 Dresden,Germany
VAT ID NumberDE310347520



The Palladian Nodes are your survival toolkit in the daily jungle of Web Information Extraction, Text Classification, and Geo Data.


Nodes for building dictionary-based classifiers for text documents. Using a set of labeled sample documents, one can build a dictionary and use it to classify uncategorized documents. Typical use cases for text classification are e.g. automated email spam detection, language identification, or sentiment analysis.


Nodes for working with collection data cells.


Nodes for extracting various kind of information mainly from unstructured text.


Nodes for working with geographic data. The Geo Nodes contain basic components, such as a “Geo Coordinate” cell type which represents a WGS84 latitude/longitude pair, a Haversine-based distance measure and aggregation methods for coordinate collections. The nodes include an extractor for location data from text, street address geocoding and reverse coordinate lookup.


Nodes for working with images


Nodes with various evaluation and scoring measures.


Utilities for running KNIME workflow tests with the KNIME Testing Framework.


Nodes to interact with HTTP- and REST-based services and for parsing HTML data. Using the “HTTP Retriever” node, different HTTP methods can be executed: GET, POST, HEAD, PUT, DELETE, and PATCH. Utility nodes allow to convert HTTP payloads in different formats, see the “Form Encoded HTTP Entity Creator” and “Multipart Encoded HTTP Entity Creator”. For extracting data from HTML pages, use the “HTML Parser” node.


Further Links


To use this product in KNIME, install the extensions Palladian for KNIME and Palladian for KNIME: Additional Hashing Algorithms from the below update site:


A zipped version of the software site can be downloaded here.

NodePit Exclusive Only available on NodePit