freshest Workflow_Deduplication_of_Address_Data

Address Deduplication

The workflow looks for duplicate records of restaurants by searching for similar names and addresses in a reference table. The similarity is based on the mean of the 2-gram dice distances of the restaurant name and address.

Nodes

Extensions

Links

Restaurant data set
http://www.cs.utexas.edu/users/ml/riddle/data.html
Address Deduplication
www.knime.com/blog/address-deduplication
How can I define and list the duplication in an adress data set sucessfully, with using String Distances node and Similarity Search node?
forum.knime.com/p/157330