Palladian is a Java-based toolkit which provides functionality to perform typical Internet Information Retrieval tasks. It provides a collection of algorithms for text processing focused on classification, extraction of various types of information, and retrieval of data from the Web.
The nodes are intended to integrate with existing KNIME nodes, such as the KNIME Textprocessing and the KNIME XML-Processing nodes.
The growing collection of Palladian KNIME nodes provide the possibility to use Palladian’s capabilities directly within KNIME, to complement and extend existing workflows, or to allow for quick prototyping without having to write any code.
More information about the Palladian toolkit is available here: https://palladian.ai/
If you have any questions, comments, or problems, we are happy to hear from you: firstname.lastname@example.org
The Palladian extension is released under the Palladian Free Software License Version 2.1.
The Palladian KNIME Nodes were created by Philipp Katz, Klemens Muthmann, David Urbansky; 2011 – 2019.
For advanced web scraping, task automatization and web application testing, also check out the Selenium Nodes, which allow you to control your browser from KNIME.
The Palladian Nodes are your survival toolkit in the daily jungle of Web Information Extraction, Text Classification, and Geo Data.
Nodes for building dictionary-based classifiers for text documents. Using a set of labeled sample documents, one can build a dictionary and use it to classify uncategorized documents. Typical use cases for text classification are e.g. automated email spam detection, language identification, or sentiment analysis.
Nodes for extracting various kind of information mainly from unstructured text.
Nodes for working with geographic data. The Geo Nodes contain basic components, such as a “Geo Coordinate” cell type which represents a WGS84 latitude/longitude pair, a Haversine-based distance measure and aggregation methods for coordinate collections. The nodes include an extractor for location data from text, street address geocoding and reverse coordinate lookup.
Nodes with various evaluation and scoring measures.
Utilities for running KNIME workflow tests with the KNIME Testing Framework.
Nodes to interact with HTTP- and REST-based services and for parsing HTML data. Using the “HTTP Retriever” node, different HTTP methods can be executed: GET, POST, HEAD, PUT, and DELETE. Utility nodes allow to convert HTTP payloads in different formats, see the “Form Encoded HTTP Entity Creator” and “Multipart Encoded HTTP Entity Creator”. For extracting data from HTML pages, use the “HTML Parser” node.
To use this product in KNIME, install Palladian for KNIME from the following update site:
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to email@example.com, follow @NodePit on Twitter, or chat on Gitter!
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.