Products
Nodes
Workflows

v5.5
Select your KNIME version:v5.5 v5.4 v5.3 v5.2 v5.1 v5.0 v4.7 v4.6 v4.5 v4.4 v4.3 v4.2 v4.1 v4.0 v3.7 v3.6
v5.6Nightly

Log In Sign Up

Home
Workflows
KNIME Hub
Users
knime
Examples
50_Applications
13_Address_Deduplication
01_Deduplication_of_Address_Data

01_Deduplication_of_Address_Data

Address Deduplication

The workflow looks for duplicate records of restaurants by searching for similar names and addresses in a reference table. The similarity is based on the mean of the 2-gram dice distances of the restaurant name and address.

Nodes

String Distances2 ×
ARFF Reader1 ×
Aggregated Distance1 ×
Column Rename (Regex)1 ×
GroupBy1 ×
Show all 9 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Distance Matrix
FeatureKNIME Javasnippet

Links

Restaurant data set
http://www.cs.utexas.edu/users/ml/riddle/data.html
Address Deduplication
www.knime.com/blog/address-deduplication
Hierachical Clustering, more weight to start of Stringby izaychik63 on 2024-04-24
forum.knime.com/p/255384

Download

To use this workflow in KNIME, download it from the below URL and open it in KNIME:

Download Workflow

Created by: wiswedel

Created at: 2014-03-28

On NodePit since: 2019-07-12

Last update: 2025-07-02

Created with KNIME version: v4.1.0

Tags: similarity searchdistance measurement

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!

Intro Nodes Extensions Links

Contact

Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.

Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.

NodePit is the world’s first and most comprehensive search engine that allows you to easily search, find and install KNIME nodes and workflows. Explore the KNIME community’s variety.

Impressum / Imprint
Datenschutzerklärung / Privacy Policy

Thank you to the following supporters — without them, NodePit would not be possible!

Legal info: The KNIME® trademark is registered in the United States and Germany by the KNIME GmbH. NodePit is not affiliated with KNIME®.