Fuzzy Address Matching

This node matches a set of string columns against a reference table and appends an identifying column (the ID column) of the closest match to the original table. The individual columns are compared with their counterparts in the reference table using the Levenshtein Distance and can be weighted.

Please note that at least 2 columns need to be selected.

Options

Comparison Columns
The columns to use for the fuzzy matching.
ID Column
The column from the reference table that identifies an address.
Column Weights
Comma-separated list of weights for the columns selected above. The higher the weight, the more important a column is for the selection of a match from the reference table.

Input Ports

Icon
The table with the addresses to be matched against the reference table.
Icon
The table with the reference addresses.

Output Ports

Icon
The input table with the ID of the closest match from the reference table.

Nodes

Extensions

Links