Icon

Address Normalization & Matching

<p>This use case demonstrates how <strong>M|Box’s Alias Generator, Phrase Indexer, and Approximate Phrase Index Matcher </strong>nodes can be combined to efficiently <strong>index and search address data</strong>, even in the presence of abbreviations, spelling variants, and other fuzzy input patterns.</p><p>Workflow Overview:</p><ol><li><p>The workflow loads:</p><ul><li><p>A reference table of full, normalized addresses (without abbreviations)</p></li><li><p>An alias dictionary mapping canonical terms (e.g. <em>“Street”</em>) to known alternatives (e.g. <em>“St”, “Str”</em>)</p></li><li><p>A search table containing addresses with abbreviations and other variants</p></li></ul></li><li><p>The reference addresses are indexed using the <strong>Phrase Indexer</strong>, while the <strong>Alias Generator</strong> ensures that defined aliases are treated as equivalent terms during indexing.</p></li><li><p>Incoming search addresses are matched against the index using the <strong>Approximate Phrase Index Matcher</strong>, allowing abbreviations in the input without negatively affecting match quality.</p></li></ol><p>Index-based matching with the <strong>M|Box Phrase Indexing and Alias Generation nodes</strong> ensures robust and scalable address search, even in the presence of abbreviations and spelling variants. This makes it ideal for:</p><ul><li><p>Harmonizing addresses from heterogeneous sources with inconsistent formatting</p></li><li><p>Matching user-entered or third-party address data against a standardized reference set</p></li><li><p>Improving address search and matching quality without sacrificing performance or explainability</p></li></ul><p></p>

URL: exorbyte GmbH https://exorbyte.ai/

Request/Activate Exorbyte License

Request and register your exorbyte license before running any M|Box nodes.

If you do not have an active license, within the License Requester:

  1. Choose Demo (30 days) or Production.

  2. Enter your email (and Customer Token if production).

  3. Execute the node – it sends a secure request to the exorbyte team.

  4. When you receive the .lic file, reopen the node → Use available license fileand run the node

Afterwards, or if you already have an active license, run License Activator

⚠️ Each KNIME installation or Hub environment needs its own license

👉 See full exorbyte License Activation Guide

Import Data

Import three data tables:

  • Aliases: A table containing full address terms (e.g. "Street", "Apartment") and the corresponding aliases (e..g "St", "Apt")

  • Reference table: A table containing the full, normalized addresses (without any abbreviations)

  • Search table: Search addresses, with some abbreviations

Aliases & Indexing

  • Create aliases out of the alias dictionary. These will be viewed by the matcher as equivalent to their true word counterparts (e.g. "St" will be equivalent to "Street").

  • Create an index out of the full addresses using the Phrase Indexer.

Matching

Find the search addresses in the index. The Approximate Phrase Matcher:

  • Quickly and reliably finds the correct addresses

  • Takes into account possible typos, incomplete inputs etc.

  • Considers abbreviations, thanks to the Alias Generator.

Generate Aliases
Alias Generator (Labs)
Read Aliases
CSV Reader
Phrase Indexer (Labs)
Addresses with abbreviations
CSV Reader
Reference Table
CSV Reader
Approximate Phrase Index Matcher (Labs)
License Activator (Labs)
License Requester (Labs)

Nodes

Extensions

Links