Fuzzy Address Matching

This node matches a set of string columns against a reference table and appends an identifying column (the ID column) of the closest match to the original table. The individual columns are compared with their counterparts in the reference table using the Levenshtein Distance and can be weighted.

Please note that at least 2 columns need to be selected.

Options

Comparison Columns: The columns to use for the fuzzy matching.
ID Column: The column from the reference table that identifies an address.
Column Weights: Comma-separated list of weights for the columns selected above. The higher the weight, the more important a column is for the selection of a match from the reference table.

Input Ports

: The table with the addresses to be matched against the reference table.
: The table with the reference addresses.

Output Ports

: The input table with the ID of the closest match from the reference table.

Nodes

Table Row to Variable5 ×
Aggregated Distance2 ×
Cell To Model2 ×
Model to Cell2 ×
Row Filter2 ×
Show all 22 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Distance Matrix
FeatureKNIME Ensemble Learning Wrappers
FeatureKNIME Math Expression (JEP)
FeatureKNIME Quick Forms