Icon

Identity Resolution in Customer Data with Table Indexer

<p>This use case demonstrates how <strong>M|Box's Table Indexing Nodes</strong> can be combined to perform <strong>efficient, fast and fault-tolerant Customer Identity Resolution </strong>across multiple data fields.</p><p>Workflow Overview:</p><ol><li><p>The workflow loads:</p><ul><li><p>A master data table containing 100,000 customer records (IDs, names, addresses etc.)</p></li><li><p>A search table with an incoming customer record that needs to be verified against the master data.</p></li></ul></li><li><p>The <strong>Character Mapper</strong> creates an invisible mapping that expands umlauts (e.g. ö → oe). The <strong>Table Indexer</strong> then builds a multi-field index over relevant attributes such as customer names, birthdates, and telephone numbers.</p></li><li><p>The incoming customer record is matched against the index using the <strong>Table Index Matcher</strong>, which performs a weighted, multi-field search. Fields that are not part of the search criteria are passed through the indexing nodes, ensuring that the result contains the complete matched customer record.</p></li></ol><p>Index-based matching with the <strong>M|Box Table Indexing Nodes</strong> enables fast and scalable identity resolution across multiple customer attributes. By combining multi-field indexing with weighted search, customer records can be reliably matched even when individual fields contain variations or incomplete information.</p>

URL: exorbyte GmbH https://exorbyte.ai/
URL: exorbyte/KNIME https://exorbyte.ai/knime

Request/Activate Exorbyte License

Request and register your exorbyte license before running any M|Box nodes.

If you do not have an active license, within the License Requester:

  1. Choose Demo (30 days) or Production.

  2. Enter your email (and Customer Token if production).

  3. Execute the node – it sends a secure request to the exorbyte team.

  4. When you receive the .lic file, reopen the node → Use available license fileand run the node

Afterwards, or if you already have an active license, run License Activator

⚠️ Each KNIME installation or Hub environment needs its own license

👉 See full exorbyte License Activation Guide

Import Data

Import two data tables:

  • Reference table: A table containing the full customer data (IDs, names, addresses etc.)

  • Search table: A table with incoming customer records to be verified against master data.

Mapping & Indexing

  • Use the Character Mapper to create a hidden mapping, expanding umlauts (e.g. ö → oe)

  • Use the Table Indexer to create a multi-field index on customer names, birthdates and telephone numbers. All other fields are passed through the index to be included in the output without querying them.

Matching

  • Query the index with incoming customer records using the Table Index Matcher

  • Query fields are weighted and search options are adapted to the field's data type (e.g. looking for exact ID matches)

  • Data Columns that are not queried are still passed through to the output, yielding a complete master data match.

Search the indexed master data using weighted multi-field matching.
Table Index Matcher (Labs)
Full Customer Data
CSV Reader
Incoming customer records to be verified against master data.
CSV Reader
Expand Umlauts
Character Mapper
Create a multi-fieldindex
Table Indexer (Labs)
License Activator
License Requester
Include only master dataand match quality columns
Column Filter

Nodes

Extensions

Links