Icon

Email Lookup & Validation (1M Records)

<p>This workflow demonstrates <strong>fast and fault-tolerant email address validation</strong> using the Exorbyte MatchMaker engine integrated into KNIME.<br>A dataset of more than <strong>1 million customer emails</strong> is indexed using the <em>Single-Field Indexer (Labs)</em> node to create a highly optimized lookup structure. The index is stored once and reused instantly without the need for re-indexing.</p><p>The <strong>Email Validation Data App</strong> allows users to type any email address and perform a real-time lookup against the full dataset. MatchMaker handles typos, substitutions, insertions, and other variations, returning the closest matching email if one exists.<br>The result is visualized using <strong>color-coded character alignment</strong>, making it easy to see how the input differs from the matched address.<br>If no match exists, the app clearly displays a <strong>“no results found”</strong> message.</p><p>This workflow is ideal for scenarios such as <strong>customer data quality checks</strong>, <strong>email deduplication</strong>, and <strong>contact list validation</strong>.</p>

URL: exorbyte GmbH https://www.exorbyte.ai/

Building the Index

The Single-Field Indexer (Labs) node converts the `Email` column into an optimized exorbyte index.
This enables fast, fault-tolerant lookups across the entire dataset.

Emails Dataset

This dataset contains approximately 1M records, each including a fake email address.

exorbyte License Setup

Activates the exorbyte Nodes license to enable indexing and matching functionality.

👉 See full workflow guide: How to license exorbyte Extension

Saving the Pre-Built Index

The Index Writer (Labs) node stores the generated `Emails` index in the workflow’s data folder.

Interactive Email Lookup & Validation


The Index Reader (Labs) node loads the pre-built email index, enabling instant access to over one million email records without re-indexing.
This index is used by the Email Validation Data App, which allows users to input an email address, perform a fast and fault-tolerant lookup, and view the results interactively.
If a match is found, the corresponding email is visualized with color-coded character alignment; if no match exists, a clear “no results found” message is displayed.

Email Addresses (1M Records)
CSV Reader
Creating the index from Emails column
Term Indexer
License Activator
Saving the Pre-Built Index
Index Writer
Loading the pre-built index
Index Reader
Email Validation Data App

Nodes

Extensions

Links