Palladian

Palladian is a Java-based toolkit which provides functionality to perform typical Internet Information Retrieval tasks. It provides a collection of algorithms for text processing focused on classification, extraction of various types of information, and retrieval of data from the Web.

The nodes are intended to integrate with existing KNIME nodes, such as the KNIME Textprocessing and the KNIME XML-Processing nodes.

The growing collection of Palladian KNIME nodes provide the possibility to use Palladian’s capabilities directly within KNIME, to complement and extend existing workflows, or to allow for quick prototyping without having to write any code.

More information about the Palladian toolkit is available here: https://palladian.ai/

If you have any questions, comments, or problems, we are happy to hear from you: mail@palladian.ai

The Palladian KNIME Nodes were created by Philipp Katz, Klemens Muthmann, David Urbansky; 2011 – 2021.

There’s even more — check out the Selenium Nodes!

For advanced web scraping, task automatization and web application testing, also check out the Selenium Nodes, which allow you to control your browser from KNIME.

Content

IconPalladian

The Palladian Nodes are your survival toolkit in the daily jungle of Web Information Extraction, Text Classification, and Geo Data.

IconClassification

Nodes for building dictionary-based classifiers for text documents. Using a set of labeled sample documents, one can build a dictionary and use it to classify uncategorized documents. Typical use cases for text classification are e.g. automated email spam detection, language identification, or sentiment analysis.

IconExtraction

Nodes for extracting various kind of information mainly from unstructured text.

IconGeo

Nodes for working with geographic data. The Geo Nodes contain basic components, such as a “Geo Coordinate” cell type which represents a WGS84 latitude/longitude pair, a Haversine-based distance measure and aggregation methods for coordinate collections. The nodes include an extractor for location data from text, street address geocoding and reverse coordinate lookup.

IconImages

Nodes for working with images

IconScoring

Nodes with various evaluation and scoring measures.

IconTesting

Utilities for running KNIME workflow tests with the KNIME Testing Framework.

IconWeb

Nodes to interact with HTTP- and REST-based services and for parsing HTML data. Using the “HTTP Retriever” node, different HTTP methods can be executed: GET, POST, HEAD, PUT, DELETE, and PATCH. Utility nodes allow to convert HTTP payloads in different formats, see the “Form Encoded HTTP Entity Creator” and “Multipart Encoded HTTP Entity Creator”. For extracting data from HTML pages, use the “HTML Parser” node.

Nodes

Further Links

Installation

To use this product in KNIME, install the extensions Palladian for KNIME, Palladian for KNIME: MaxMind Nodes and Palladian for KNIME: Additional Hashing Algorithms from the below update site:

v4.4

A zipped version of the software site can be downloaded here.

NodePit Exclusive Only available on NodePit