1 ×

String Similarity

Palladian for KNIME version 2.3.0.202009251618 by palladian.ws; Philipp Katz, Klemens Muthmann, David Urbansky

Allows to calculate various string similarity metrics between two strings, like n-gram overlap, Levenshtein, and Jaro-Winkler.

If one of the input cells contains a “missing value”, a missing value will be returned as output as well.

Options

Text input 1
The column in the input table with the first string.
Text input 2
The column in the input table with the second string.
Similarity column name
The name of the column in the output table which contains the similarity value.
Similarity measure
The similarity measure to calculate.

Input Ports

Icon
Table which contains two columns with strings to compare to each other.

Output Ports

Icon
Table with an appended column holding the similarity between the strings for each row.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install Palladian for KNIME from the following update site:

KNIME 4.2

A zipped version of the software site can be downloaded here. Read our FAQs to get instructions about how to install nodes from a zipped update site.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.