Icon

03_​Example_​for_​Fuzzy_​Address_​Matching

Indexing and Searching plug-in for address database cleansing
Evaluate the results by filtering the generated noisyaddresses This workflow demonstrates the usage of the Indexing & Searching plugin for address data base cleansing.For more information see the workflow metadata. Find it here: View -> Description 1k addressescreate index from the addresses for searchingconvert numbers such as PLZto string forfuzzy matchinglucene score > 2duplicatesto review -each row representsan address with the typos that occureduse configureto change thesensitifity levelof the fuzzy searchduplicate 50% of theaddresses by adding noisy entries by mutatingletters and generatinggiven name abbrevationsfilter resultsto show onlythe duplicateaddressesnot detected duplicates File Reader Table Indexer Number To String(deprecated) Row Filter GroupBy Search Duplicates Generate Noise Extract Duplicates ReferenceRow Filter Evaluate the results by filtering the generated noisyaddresses This workflow demonstrates the usage of the Indexing & Searching plugin for address data base cleansing.For more information see the workflow metadata. Find it here: View -> Description 1k addressescreate index from the addresses for searchingconvert numbers such as PLZto string forfuzzy matchinglucene score > 2duplicatesto review -each row representsan address with the typos that occureduse configureto change thesensitifity levelof the fuzzy searchduplicate 50% of theaddresses by adding noisy entries by mutatingletters and generatinggiven name abbrevationsfilter resultsto show onlythe duplicateaddressesnot detected duplicatesFile Reader Table Indexer Number To String(deprecated) Row Filter GroupBy Search Duplicates Generate Noise Extract Duplicates ReferenceRow Filter

Nodes

Extensions