Palladian

Your survival toolkit in the daily jungle of Web Information Extraction, Text Classification, and Geo Data.

Changelog

This log gives an overview of the most prominent changes in each release. Minor fixes, changes “under the hood” and refactoring changes are not listed here. While we were rather sloppy with the versioning in the past, we follow the Semantic Versioning scheme and the guidelines from “Keep a Changelog” since 2019.

version-3.1.0 (2024-09-01)

Add
Web Searcher: Add Marginalia
Add
Coordinate to Open Location Code (OLC): Converter from Latitude/Longitude coordinates to Open Location Code
Add
Open Location Code (OLC) to Coordinate: Converter from Open Location Code to Latitude/Longitude coordinates
Add
Handlebars: New node to apply Handlebars templates
Add
Web Searcher: Add Noun Project
Change
OAuth Connector (Custom OAuth 2.0): No longer requires a paid license (see here)
Fix
Coordinate to Geohash: Fixed node description
Fix
Geohash to Coordinate: Fixed node description

version-3.0.1 (2024-07-15)

Info
KNIME 5.3 compatibility only - further changes see 3.0.0

version-3.0.0 (2024-07-15)

Add
Sitemap Retriever: New node to read and parse sitemap XML file
Add
Sitemap Finder: New node to find sitemaps for given URLs
Add
Ranking Services: Re-add new node with new and updated ranking sources: BibSonomy, Facebook Link Stats, Google Cached Page, Hacker News, OpenPageRank, Pinterest, Reddit, SharedCount, and Wikipedia
Add
Web Searcher: Re-add new node with new and updated search engines for web pages, videos, images, and audio: Flickr, Google Programmable Search Engine, Hackernews, MetMuseum, Openverse, Pexels, Pixabay, Qwant, Reddit, Stack Exchange, Unsplash, Vimeo, Wikipedia, YouTube.
Change
Base64 Encoder: Allow encoding string cells
Change
Base64 Decoder: Allow decoding to string cell
Change
Hash Calculator: Allow input of path cells
Change
HTML Parser: Allow input of path cells
Change
Base64 Encoder: Allow input of path cells
Remove
Removed plugin ws.palladian.nodes.deprecated with long-deprecated (see v2.0.0) nodes: Date Extractor (use the updated Date Extractor node instead), Palladian NER, Content Extractor (use Web Page Content Extractor instead), NekoHtmlParser (use the HTML Parser node).
Remove
Removed defunct MapQuest tile cell renderer.
Fix
Hash Calculator: I/O performance

version-2.10.0 (2024-03-18)

Info
Requires at least KNIME 4.7 (please make sure you’re using an update site URL corresponding to your KNIME version)
Info
Note: We’re currently evaluating for how long we’re going to support KNIME 4.7 and your opinion matters – if you’re deliberately not upgrading to KNIME 5 (for now), please let us know!
Add
Empty String to Missing Value: Node to replace empty and blank strings with missing values
Add
Regex Extractor: Add template for extracting link tags
Add
New string distance measures: Damerau Levenshtein, Hamming Distance, Jaro, Longest Common Subsequence, Longest Common Substring, Needleman-Wunsch, Ratcliff-Obershelp, Smith-Waterman
Add
Collection Distances: Node for collection similarities / distances; supported measures: Block Distance, Cosine, Dice, Euclidean, Hamming Distance, Jaccard, Overlap, Tanimoto
Change
Regex Extractor: Add output modes “Is Match (Boolean)” and “Match Count (Number)”
Change
Column Distances: Improved UX and simplified configuration
Change
Remove separate features ws.palladian.nodes.maxmind.feature.feature.group, ws.palladian.nodes.oauth.feature.feature.group, ws.palladian.nodes.phone.feature.feature.group, ws.palladian.nodes.distances.feature.feature.group - they are all contained within ws.palladian.nodes.feature.feature.group now
Deprecate
Old “Column Distances” node is deprecated
Fix
Retry failing Geonames API requests

version-2.9.0 (2023-09-28)

Add
Input License Key: Node which allows to set the license key (as an alternative way to the KNIME preferences; useful if you’re having issues to set the license on a KNIME Server or “Hub”; see here)
Add
Phone Number Formatter: New node for formatting and parsing phone numbers
Fix
Coordinate to Geohash: Configuration validation of input column
Fix
Geohash to Coordinate: Configuration validation of input column

version-2.8.1 (2023-03-01)

Fix
Revert using Palladian’s “fat JAR” library which obviously could casue issues on Windows (see here; kudos to bobpeers)

version-2.8.0 (2023-02-28)

Info
Requires at least KNIME 4.5 (please make sure you’re using an update site URL corresponding to your KNIME version)
Info
Works with current KNIME 5.0 nightly
Add
Map Viewer: Allow to render map to image (see here, here, here)
Add
OAuth Connector (Custom OAuth 2.0): New node for connecting to arbitrary OAuth 2.0 services
Change
Regex Extractor: Add “streaming” functionality (see here)
Change
OAuth Connector: Add Flow Variable support (see here)
Change
Regex Extractor: The format of output RowIDs in case of “Rows” or “Rows or Missing” has changed – instead of strictly incrementing the IDs (Row0, Row1, Row2, …) we now use the input row ID suffixed with a running index (Row0, Row1#0, Row1#1, …)
Change
Regex Extractor: Input port can explicitly be removed (using the three dots on the node) to allow direct string input
Change
Regex Extractor: Add improved templates for matching URLs (see here)
Change
Regex Extractor: Template for extracting hashtags
Change
Location Extractor: Make node streamable
Change
Geohash to Coordinate: Make node streamable
Change
GeoIP2 Extractor: Make node streamable
Change
Text Classifier Predictor: Make node streamable
Change
Column Distance: Make node streamable
Change
Date Extractor: Make node streamable
Change
Reverse Location Lookup: Make node streamable
Change
TF-IDF Similarity: Make node streamable
Change
Corpus Creator: Make node streamable
Deprecate
Old “Map Viewer” node is deprecated and replaced with improved version (can be automatically migrated)
Deprecate
Old “Regex Extrator” node is deprecated and replace with improved version (can be automatically migrated)
Fix
Main Location Extractor: Fix auto-configuration
Fix
TF-IDF Similarity: Missing value handling
Fix
Corpus Creator: Check presence of input column

version-2.7.0 (2022-11-09)

Add
OpenCage Geocoder: New node
Add
GeoNames Geocoder: New node
Add
Empirical Distribution: Added aggregation function to be used in GroupBy node (see Empirical distribution function)
Add
Coordinate to Geohash: New node to create a Geohash from a coordinate pair
Add
Geohash to Coordinate: New node to convert a Geohash back to a coordinate pair
Add
Main Location Extractor: New node to extract the main location for texts and address strings (geocoder functionality)
Add
Location Extractor: Add new model “ML (730-docs-10T)”
Fix
MapQuest Geocoder: Fix API endpoint

version-2.6.1 (2022-10-23)

Change
Coordinate Parser: Allow parsing decimal degree (DD) coordinates
Change
Move to NodePit hierarchy
Fix
Local Location Source: Fix error “primaryName must not be null” for some locations
Fix
Reverse Location Lookup: Set default mapping value
Fix
HTML Parser: Use proper encoding for string input

version-2.6.0 (2022-09-27)

Add
Local Location Source: New node which allows to run a local gazetteer for the “Location Extractor” and “Reverse Location Lookup” nodes – use this if you want to keep your data private or you have high volumes of data to extract
Add
GeoNames Location Source: Access to GeoNames API for “Location Extractor” and “Reverse Location Lookup”
Change
Caching of Geocoder requests
Change
OAuth Connector: Renew token upon execution if API returned a refresh token
Change
OAuth Connector: Append column with expiry time/date
Change
OAuth Connector: Append column with raw response
Change
OAuth Connector: Improved dialog UX and validation
Change
Location Extractor: The location sources are now configured through via input port
Change
Location Extractor: Provide additional output columns (Row ID, Annotation Value/Start/End, Location ID, Alternative Names, Ancestor IDs)
Change
Location Extractor: Allow to specify output column name prefix
Change
Location Extractor: Add additional output modes (“Rows”, “Rows or Missing”, “JSON”)
Change
Location Extractor: Add ML-based disambiguation
Change
Reverse Location Lookup: Provide additional output columns (Location ID, Alternative Names, Ancestor IDs)
Change
Reverse Location Lookup: Allow distances below 1.0 km
Change
Reverse Location Lookup: Allow to specify output column name prefix
Change
Reverse Location Lookup: Add additional output modes (“Rows”, “Rows or Missing”, “JSON”)
Change
Multipart Encoded HTTP Entity Creator: Return content type as flow variable
Deprecate
Old “OAuth Connector” is deprecated and replaced with improved version
Deprecate
Old “Location Extractor” is deprecated and replaced with improved version
Deprecate
Old “Reverse Location Lookup” is deprecated and replaced with improved version
Fix
Regex Extractor: Add “No Full Match” option when output type “List” is selected (see here; kudos to mwiegand)
Fix
URL Domain Extractor: Update domain lists to contain new TLDs (see here and here)
Fix
URL Domain Extractor: Operate case-insensitively (see here)
Fix
Additional splash screen icon sizes
Fix
Don’t include KNIME Textprocessing on the update site
Fix
OAuth Connector: Show error message when creating authorization URL fails (see here; kudos to badger101)
Fix
OAuth Connector: Request refresh tokens from Dropbox API
Fix
OAuth Connector: Explicity specify perm param for Flickr to avoid error (see here)
Fix
Location Extractor: Handling of missing value cells

version-2.5.2 (2022-08-01)

Fix
OAuth-related dependencies on KNIME 4.7 (nightly)

version-2.5.1 (2022-06-12)

Info
Requires at least KNIME 4.4 (please make sure you’re using an update site URL corresponding to your KNIME version)
Fix
Map Viewer: Open window with proper zoom level, set proper zoom level for none, only one coordinate pair
Fix
Column Distance: Handle missing value cells
Fix
OAuth Connector [BETA]: Prevent exception when checking response (see here; kudos to nxfxcom)
Fix
OAuth Connector [BETA]: Fix java.lang.NoClassDefFoundError when accessing e.g. Google (see here; kudos to fryfella)

version-2.5.0 (2022-04-12)

Add
Coordinate Formatter: Node for formatting Geo Coordinate cells to DMS strings (see here).
Add
Coordinate Parser: Node for parsing DMS strings to Geo Coordinate cells.
Change
OAuth 1 Signer: Allow to set signature method (HMAC-SHA1, HMAC-SHA256, HMAC-SHA512, PLAINTEXT)
Change
OAuth 1 Signer: Allow to set additional Authorization headers (e.g. for realm)
Change
OAuth 1 Signer: Make node streamable
Change
OAuth 1 Signer: Thank you to wisemanleo for supporting this upgrade (see here, here)
Change
OAuth Connector [BETA]: Add new sources (Instagram, Доктор на работе, The Things Network)
Deprecate
Old “OAuth” node is deprecated and replaced with new version (called “OAuth 1 Signer”)

version-2.4.1 (2021-03-28)

Fix
Date Extractor: Fix naming of extraction method “Date and Time Cell (first)” (was incorrectly called “String Cell (first)” before)
Fix
Regex Extractor: Fix styling of preview table
Fix
Regex Extractor: Fix potential race condition which would not properly update the preview (see also here – thanks to PBJ)

version-2.4.0 (2021-02-16)

Info
Requires at least KNIME 4.1 (please make sure you’re using an update site URL corresponding to your KNIME version)
Add
Trim Image Whitespace: Node to remove white space surrounding a PNG image
Add
HTTP Retriever: Allow to override default proxy configuration in the “Proxy” tab (see here)
Add
HTTP Retriever: Store redirected location in HTTP Results
Add
HTTP Result Data Extrator: Add setting “Append redirected locations”
Add
HTTP Retriever: Allow PATCH
Add
N-Gram Extractor: Allow to specify output column name
Add
N-Gram Extractor: Allow to drop input column
Add
Base64 Encoder, Base64 Decoder: New nodes for encoding/decoding Base64
Add
OAuth Connector: New node for connecting to 50+ OAuth-based APIs. This node is currently labled as “BETA” -- there might be bugs or later versions might change the functionality. In case of feedback or bug reports, please do reach out!
Change
Use version 2.0 of Palladian Toolkit library
Change
HTTP Retriever: Show exection warnings on node additionally to logging them (e.g. when HTTP method is missing or invalid, when URL contains whitespace, in case of network errors)
Change
HTTP Retriever: Automatically trim whitespace around URLs (see here)
Change
AP Calculator: Make node streamable
Change
Coordinate to Latitude/Longitude: Make node streamable
Change
Form Encoded HTTP Entity Creator: Make node streamable
Change
Hash Calculator: Make node streamable
Change
HTML Parser: Make node streamable
Change
Latitude/Longitude to Coordinate: Make node streamable
Change
Multipart Encoded HTTP Entity Creator: Make node streamable
Change
String Similarity: Make node streamable
Change
Trim Image Whitespace: Make node streamable
Change
URL Domain Extractor: Make node streamable
Change
URL Normalizer: Make node streamable
Change
URL Resolver: Make node streamable
Change
Web Page Content Extractor: Make node streamable
Remove
Ranking Services: Remove obsolete Compete, Delicious, DMOZ
Fix
HTTP Retriever: Required validation for User Agent input in dialog
Fix
HTTP Retriever: Prevent entering negative values for Socket Timeout
Fix
Text Classifier Learner, Text Classifier Predictor: Fix link to press release in node documentation (kudos to Armin Ghassemi Rudd)
Fix
TF-IDF Similarity: Fix NaN values (see here)
Fix
HTML Parser: Fix absolute URLs on redirected requests
Fix
HTTP Retriever: Properly handle URLs with ? which are not query params (see here)
Fix
HTTP Retriever: Make parsing of cookie “expires” attribute more lenient and behave more like a web browser (see here)
Fix
HTTP Retriever: Automatically strip away URL #fragments (see here)
Fix
HTTP Retriever: Properly honor the given “Maximum file size” limit, even when below 1024 bytes (see here)
Fix
GeoIP2 Extractor: Improve error message when DB file cannot be accessed
Fix
Hash Calculator: Correctly honor “Remove input column” setting
Fix
HTTP Retriever: Catch potential NullPointerException in SSL-related code (see here)
Fix
Fix potential version conflicts with NodePit licensing plugin

version-2.3.0 (2020-09-25)

Info
Requires at least KNIME 4.0 (please make sure you’re using an update site URL corresponding to your KNIME version)
Add
Regex Extractor: Add a “Rows or Missing” output mode which appends a row with missing value cells in case of a no-match (see here)
Add
Text Classifier Model Writer: Report progress while writing model
Add
Text Classifier Model Reader: Report progress whiel reading model
Add
GeoIP2 Extractor, GeoIP2 DB Connector, GeoIP2 WS Connector: New nodes to get information for IP addresses using the MaxMind API or MMDB files
Change
More efficient storage of HttpResult cells
Change
Improved renderer for HttpResult cells showing headers and payload
Change
HTML Parser: Add “Drop input column” setting (see here)
Change
HTML Parser: Allow to input HTML strings
Change
Regex Extractor: Timeout presumably endless regexes in dialog after 15 seconds
Change
Regex Extractor: Allow to cancel long running regexes during node execution
Change
String Similarities: Allow to configure name of output column (see here)
Fix
Text Classifier Model Writer: Ensure that model file is always written in GZIP format (see here)
Fix
Text Classifier Model Writer: Ensure that .palladianDictionaryModel extension is appended
Fix
String Similarities: Handle missing value input

version-2.2.0 (2020-05-15)

Add
Regex Extractor: Add a “Columns” output mode which appends a column for each matched group.
Fix
Google Address Geocoder: Fix pointer to preferences in node documentation – kudos to joan_beneyto
Fix
Location Extractor: Fix pointer to preferences in node documentation
Fix
MapQuest Geocoder: Fix pointer to preferences in node documentation

version-2.1.0 (2020-05-08)

Add
Regex Extractor: Add “Drop Full Match” option (see here)

version-2.0.2 (2020-02-01)

Fix
Regex Extractor: Fix configuration logic which would prevent output when picking a different input column than the first – kudos to Armin Ghassemi Rudd

version-2.0.1 (2020-01-26)

Fix
Date Extractor: Fix execution exception which would happen for some settings combinations – kudos to Armin Ghassemi Rudd
Fix
Improve KNIME server detection, avoid false alarms on “normal” KNIME when “KNIME Executor connector” is installed

version-2.0.0 (2020-01-24)

Info
Requires at least KNIME 3.7 (please make sure you’re using an update site URL corresponding to your KNIME version)
Info
Provide a zipped version of the update site -- simply append .zip to the update site URL, e.g. http://download.nodepit.com/palladian/4.1.zip (see here, here)
Info
Rename “Palladian Nodes for KNIME Workbench” to simply “Palladian for KNIME”, update the license to version 2.1.1 to reflect this change
Info
Update Palladian library to version 1.0
Add
Add “Virtual Earth” tiles to “Map Viewer” node
Add
Add “Stamen” tiles “Toner”, “Terrain”, and “Watercolor” to “Map Viewer” node
Add
Add “Wikimedia” maps to “Map Viewer” node
Add
Add new node “Regex Extractor” -- create your regular expressions as easy as a breeze; build, preview, and test your regexes in real time with your real data
Add
Add new node “Web Page Content Extractor” -- replace old “Content Extractor” and outputs the results as plain String and XML cells instead of prorietary “Document” cell from KNIME textprocessing, which makes use much more flexible
Add
Add new “Hash Calculator” node with additional hashing algorithms (MD2, MD5, SHA, SHA-224, SHA-256, SHA-384, SHA-512), possibility to hash binary data beside string data, and option the drop input input column
Change
Date Extractor: Allow to extract dates into collection cells, individual rows, or only extract first data occurrence (see here).
Change
Date Extractor: Allow to append column with input Row ID
Change
Date Extractor: Make use of KNIME’s “Local Date Time” cells
Change
Date Extractor: Allow to remove input column
Change
Date Extractor: Allow to specify output column name
Change
Threshold Analyzer: Update node documentation to mention “Accuracy” measure (see here)
Change
Update “Map Viewer” to new JXMapViewer2 library
Change
Moved nodes which depend on KNIME Textprocessing (“Date Extractor”, “Palladian NER”, “Content Extractor”) and deprecated nodes (“NekoHtmlParser”) to a separate, optional feature; this avoids having to install the heavyweight Textprocessing dependency
Change
Restructure Palladian-related preferences to common entry, allow to enter license key
Deprecate
Date Extractor: Old node is deprecated -- replace with new version for additional functionality
Deprecate
URL Extractor: Mark node as deprecated -- we recommend to use the new “Regex Extractor” instead which has a dedicated “URL” preset
Deprecate
Content Extractor: Mark node as deprecated -- we recommend to use the new “Web Page Content Extractor” from now on
Deprecate
Hash Calculator -- replaced with new version
Remove
Remove MapQuest tiles in “Map Viewer” node; they do not offer direct tile access any longer
Remove
Ranking Services: Remove Facebook ranking source
Fix
Fix missing OSM tiles in “Map Viewer” node
Fix
Google Address Geocoder: Fix link to API key in node documentation

version-1.8.0 (2019-07-27)

Change
Updated categories structure; move “Palladian” entry to root, organize nodes into sub-categories
Change
Adding categories description for better presentation on NodePit
Change
Changed node labels from “CamelCase” to proper spacing
Change
Added new Palladian logo
Change
Minor typograhpy fixes in node documentation
Change
Added additional content to node documentation

version-1.7.1 (2019-06-29)

Info
KNIME 4 compatibility

version-1.7.0 (2018-07-04)

Change
Rename “ColumnDistanceNode” to “ColumnDistance”
Deprecate
RankingServices: Deprecate node

version-1.7.0 (2018-06-20)

Deprecate
freegeoip: Deprecate node due to changed API

version-1.7.0 (2017-04-18)

Change
TextClassifierLearner: Option to disable listening for memory warnings; see https://forum.knime.com/t/textclassifierlearner-received-memory-warning-at-30-of-dedicated-mem-usage/10766
Change
GoogleAddressGeocoder: Add option to specify API key
Deprecate
MapzenGeocoder, ReverseGeocoder: Deprecate nodes and remove logic due to Mapzen shutdown

version-1.7.0 (2017-12-07)

Change
MultipartEncodedHttpEntityCreator: Make sure empty filenames are transformed to null
Change
MultipartEncodedHttpEntityCreator: Improve usability -- set default entity name when selecting input column in dialog
Change
HttpRetriever: Add content type 'text/plain'
Change
MultipartEncodedHttpEntityCreator: Add validation
Change
MultipartEncodedHttpEntityCreator: Allow StringValue input as well
Change
MultipartEncodedHttpEntityCreator: Allow to specify name of output column
Change
MultipartEncodedHttpEntityCreator: Allow 10 instead of 5 inputs
Change
MultipartEncodedHttpEntityCreator: Documentation

version-1.6.100 (2017-10-12)

Change
HttpRetriever: Add “Fail on network error” setting

version-1.6.100 (2017-05-29)

Add
TfIdfSimilarity: Node to calculate similarity between two strings based on their tf—idf vectors and their cosine similarities
Add
CorpusCreator: Node to create a corpus which contains counts for each unique term within the given texts
Add
NGramExtractor: New node for creating token-/word-n-grams as lightweight alternative to the n-gram creator from the Text Processing plugin which works on simple strings as input and produces string collections as output

version-1.6.100 (2017-05-07)

Add
HtmlNodeToText: New node to convert HTML documents/nodes to human-readable strings
Change
TextClassifier: Allow setting minimum and maximum term lengths

version-1.6.100 (2017-05-06)

Change
TextClassifier: Improve handling of large dictionary models: Load model data lazily to speed up opening of workflows

version-1.6.100 (2017-04-12)

Remove
WebSearcher: Removing Bing, DuckDuckGo, Social Mention, WebKnox as they are no longer functional

version-1.6.100 (2017-04-06)

Change
TextClassifierPredictor: Enable parallel processing
Fix
TextClassifierPredictor: Fix overriding classification column
Fix
TextClassifierLearner: Fix enabling/disabling of applicable checkboxes in configuration dialog

version-1.6.100 (2017-02-20)

Fix
Fix node description and column guessing in ReverseGeocoder

version-1.6.100 (2016-10-31)

Change
Stricter throttling in GoogleGeocoder to avoid being blocked

version-1.6.100 (2016-09-10)

Change
Setting to retrieve location hierarchy from Geonames, which greatly improves extraction quality in LocationExtractor

version-1.6.100 (2016-07-27)

Remove
Remove obsolete searchers (old Google API, Topsy), add Flickr searcher to WebSearcher node

version-1.6.100 (2016-07-07)

Change
Return image- and video-specific properties from WebSearcher node

version-1.6.100 (2016-06-08)

Add
Reverse geocoder node using MapZen
Add
Added node to create Multipart encoded HTTP entities
Change
Option to fail HttpRetriever node execution in case a non-success HTTP status code is returned (>= 400)
Change
Make FreeGeoIP lookup more robust

version-1.6.100 (2016-03-31)

Change
Skip-gram features for TextClassifierLearner

version-1.6.100 (2016-02-27)

Add
Added node to set Eclipse preferences for testing purposes
Add
Added Mapzen geocoder

version-1.6.100 (2016-02-26)

Fix
Fix synchronization issue with cookie store

version-1.6.100 (2016-02-02)

Change
Accept any kind of NominalValue for InformationGainCalculator

version-1.6.100 (2016-01-15)

Add
Added FreeGeoIP node

version-1.6.100 (2016-01-04)

Change
Added RankingService for Hacker News
Fix
Missing value handling in RankingServices node

version-1.6.100 (2015-12-20)

Add
Added new column-based distance calculation node

version-1.6.100 (2015-12-09)

Fix
Fixed proxy issue in HttpRetriever node

version-1.6.100 (2015-12-01)

Remove
Removed obsolete RankingServices: Friendfeed Stats, Friendfeed Aggregated Stats, Twitter
Remove
Removed obsolete WebSearchers: WebKnox News

version-1.6.100 (2015-11-01)

Add
UrlDomainExtractor node to extract domain from URLs, optionally without subdomains

version-1.6.100 (2015-10-09)

Info
Adaption to KNIME 3.0

version-1.6.0 (2015-09-17)

Change
Replaced “accept self-signed certificates” by “accept all certificates” option in HttpRetriever

version-1.6.0 (2015-06-22)

Change
HttpRetriever also accepts StringValues as HTTP entity, HttpRetriever allows to specify an arbitrary content type.

version-1.6.0 (2015-06-22)

Change
Additional preprocessing options for TextClassifierLearner node: stemming, stop word removal for German and English language

version-1.6.0 (2015-06-01)

Change
Setting for HttpRetriever to allow self-signed SSL certificates

version-1.6.0 (2015-05-31)

Fix
Remove temporary debugging code in HtmlParser, which was causing exception with invalid encoding string

version-1.6.0 (2015-05-28)

Change
HtmlParser node additionally accepts binary object cells as input

version-1.6.0 (2015-05-27)

Fix
Improve missing value handling in FeedDiscovery node

version-1.6.0 (2015-05-23)

Add
Cookie support for new HttpRetriever node (optional input and output tables)
Add
Ability to specify HTTP methods in new HttpRetriever node by input column
Add
HttpResultDataExtractor node optionally creates a binary instead of a string cell
Add
New HttpRetrieverNode can send binary data, which can be specified through an optional input column
Add
Added FormEncodedHttpEntityCreator node to convert key-value data to form-encoded input for HttpRetriever
Add
Possibility to input HTTP headers in HttpRetriever requests.
Deprecate
Mark old HttpRetriever node as deprecated

version-1.6.0 (2015-05-01)

Change
Change default file extension for text classifier models from '.gz' to 'palladianDictionaryModel', ability to drop models and create appropriate TextClassifierModelReader dialog

version-1.6.0 (2015-04-30)

Change
Stop training via TextClassifierLearner when memory is getting full (using KNIME's MemoryWarningSystem)

version-1.6.0 (2015-04-20)

Fix
Fix guessing of category column in TextClassifierLearner node

version-1.6.0 (2015-04-17)

Fix
Better handling of missing values in HttpRetriever, return IntCell instead of LongCell for HTTP status codes

version-1.6.0 (2015-04-16)

Fix
Better handling of missing values in ContentExtractor and HtmlParser nodes

version-1.6.0 (2015-04-10)

Add
Added MapQuest cell renderer

version-1.5.0 (2015-04-07)

Add
Added MapQuestGeocoder node

version-1.5.0 (2015-03-28)

Add
Added GoogleAddressGeocoder node

version-1.5.0 (2015-03-26)

Add
Added Jaro–Winkler string distance measure

version-1.5.0 (2015-02-24)

Add
Added CoordinateToLatitudeLongitude node
Add
Added ReverseLocationLookupNode
Change
Renamed CoordinateParser to LatitudeLongitudeToCoordinate node
Change
Renamed CoordinateParser to LatitudeLongitudeToCoordinate.

version-1.5.0 (2015-02-23)

Add
Adding HttpResultToStringNode

version-1.2.0 (2014-11-20)

Change
Output warning in HtmlParser when processing http URLs (should use HttpRetriever)

version-1.2.0 (2014-11-10)

Fix
Use explicitly given encoding in HtmlParser node when processing HttpResults

version-1.2.0 (2014-08-28)

Change
Possibility for weighted inputs for TextClassifierLearner node

version-1.2.0 (2014-08-27)

Change
Additional preprocessing options for TextClassifierLearner node: case sensitivity, border padding

version-1.2.0 (2014-08-05)

Change
Offer additional languages in WebSearcher node

version-1.2.0 (2014-08-04)

Change
Add SocialMentionSearcher to WebSearcher node
Deprecate
Mark RMSE node as deprectated

version-1.2.0 (2014-07-10)

Change
Provide accuracy values in ThresholdAnalyzer node
Change
Changed pruning capabilities to updated Palladian functionality

version-1.2.0 (2014-06-17)

Change
Automatically trim spaces when entering API keys in preferences

version-1.2.0 (2014-05-14)

Change
WebSearcher node append column with tags
Change
Provide paging for Twitter searcher in WebSearcher node

version-1.2.0 (2014-05-03)

Change
Ability to switch scoring algorithms in TextClassifierPredictor node (expert mode)

version-1.2.0 (2014-04-30)

Add
InformationGain node

version-1.2.0 (2014-04-25)

Fix
Fixing shifted month in DateParserNode (+ adding test)

version-1.2.0 (2014-04-23)

Change
TextClassifierModelToTable outputs second table with category priors

version-1.2.0 (2014-03-14)

Change
Setting for maximum number of terms for TextClassifierLearner

version-1.2.0 (2014-03-10)

Change
TextClassifierModelToTable provides the term counts as column

version-1.2.0 (2014-02-27)

Change
Cutoff irrelevant parts of graph in ThresholdAnalyzer node (values on the left, which are no different from their successors)

version-1.2.0 (2014-02-25)

Change
Greatly reduce memory consumption when training with TextClassifierLearner node
Fix
Fix NullPointerException in ThresholdAnalyzer node

version-1.2.0 (2014-02-21)

Change
Try to auto-select positive class column in ThresholdAnalyzer node

version-1.2.0 (2014-02-20)

Add
TextClassifierModelToTable node to write a Palladian text classifier dictionary to a KNIME table.
Change
Give statistics about text classifier dictionary on output port's tooltip

version-1.2.0 (2014-01-23)

Change
Output warning to log, in case a deprecated searcher is used.
Change
WebSearcher allows to append column with total number of results available for a given query (in case the specific searcher provides this information)

version-1.2.0 (2014-01-18)

Change
FeedParser now allows input of XML documents

version-1.2.0 (2013-12-26)

Change
DateExtractor now optionally appends a column with the parse pattern used for extracting a specific date.
Change
WebSearcher node adds a column providing GeoCoordinate values (in case this information is provided by the actual search engine; currently, YouTube, Twitter, Instagram, Flickr, Panoramio provide coordinates for some results)
Fix
DateExtractor now handles date/time precision correctly (e.g. only extract date without time in case it is appropriate)

version-1.2.0 (2013-12-25)

Change
Provide additional short version rendering for GeoCoordinate values (beside full precision and DMS)
Change
Log output from the Palladian library is now piped to KNIME's integrated node logger.