Products
Nodes
Workflows
About

KNIME 5.12
KNIME 5 5.12LTS 5.11 5.10 5.9 5.8LTS 5.7 5.6 5.5LTS 5.4 5.3 5.2 5.1 5.0
KNIME 4 4.7 4.6 4.5 4.4 4.3 4.2 4.1 4.0
KNIME 3 3.7 3.6
In Development 5.13Nightly

Log In Sign Up

Home
Workflows
KNIME Forum
KNIME Analytics Platform
how-can-i-define-and-list-the-duplication-in-an-adress-data-set-sucessfully-with-using-string-distances-node-and-similarity-search-node
Dubletten Adressen_MZ

Dubletten Adressen_MZ

Address Deduplication

The workflow looks for duplicate records of restaurants by searching for similar names and addresses in a reference table. The similarity is based on the mean of the 2-gram dice distances of the restaurant name and address.

Nodes

String Distances5 ×
Aggregated Distance2 ×
Column Rename (Regex)2 ×
GroupBy2 ×
Joiner2 ×
Show all 11 nodes

Extensions

No modules found

Links

Restaurant data set
http://www.cs.utexas.edu/users/ml/riddle/data.html
Address Deduplication
www.knime.com/blog/address-deduplication
How can I define and list the duplication in an adress data set sucessfully, with using String Distances node and Similarity Search node?
forum.knime.com/p/156055

Download

To use this workflow in KNIME, download it from the below URL and open it in KNIME:

Download Workflow

Created by: wiswedel

Created at: 2014-03-28

On NodePit since: 2022-05-06

Last update: 2026-08-03

Created with KNIME version: v4.5.2

Tags: similarity searchdistance measurement

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!

Intro Nodes Extensions Links

NodePit is the world’s first and most comprehensive search engine that allows you to easily search, find and install KNIME nodes and workflows. Explore the KNIME community’s variety.

About
Impressum / Imprint
Datenschutzerklärung / Privacy Policy

Thank you to the following supporters — without them, NodePit would not be possible!

Legal info: The KNIME® trademark is registered in the United States and Germany by the KNIME GmbH. NodePit is not affiliated with KNIME®.