Icon

Just KNIME It S03 _​ CH10 _​ Correcting Postal Addresses

You work as a data analyst for a delivery company, and some packages were not delivered last week due to address typos. Thanks to the postal carriers, addresses that were not found due to typos were marked as such. Given a dataset with successful deliveries (due to no typos) and unsuccessful ones (due to typos), your goal is to automatically fix the incorrect addresses by leveraging the correct ones.

Author: Aline Bessa


URL: Solutions to “Just KNIME It!” Challenge 10 - Season 3 https://forum.knime.com/t/solutions-to-just-knime-it-challenge-10-season-3/81162

Just KNIME It - Season 3 - Challenge 10: Correcting Postal Addresseshttps://hub.knime.com/-/spaces/-/~WGVlAUUueRlTquyV/current-state/ JKI S03 CH 10 Results usa_addresses.csvin 'Current workflow data area'corrected typo $nearest neighbor - duplicate$has typos splittupper == YES [4]lower == NO [2340]distance definition configured == Levenshtain Distanceget $number$, $postal_d number$ compare vectorduplicate == $full address$typo warningsmerge matches splitsremove duplicatescolumn clean-up CSV Reader Similarity Search Row Splitter String Distances Regex Split String Manipulation Column Expressions Joiner DuplicateRow Filter Column Filter Just KNIME It - Season 3 - Challenge 10: Correcting Postal Addresseshttps://hub.knime.com/-/spaces/-/~WGVlAUUueRlTquyV/current-state/ JKI S03 CH 10 Results usa_addresses.csvin 'Current workflow data area'corrected typo $nearest neighbor - duplicate$has typos splittupper == YES [4]lower == NO [2340]distance definition configured == Levenshtain Distanceget $number$, $postal_d number$ compare vectorduplicate == $full address$typo warningsmerge matches splitsremove duplicatescolumn clean-up CSV Reader Similarity Search Row Splitter String Distances Regex Split String Manipulation Column Expressions Joiner DuplicateRow Filter Column Filter

Nodes

Extensions

Links