Icon

Alias-Tolerant Sanction List Matching with Phrase Indexing

<p>This workflow demonstrates how exorbyte's<strong> M|Box nodes</strong> enable fast, efficient and fault-tolerant sanction list matching, taking into account name variations.</p><p>Workflow Overview:</p><ol><li><p>Data Import &amp; Preprocessing</p><ul><li><p>The workflow loads a list of full names of sanctioned individuals.</p></li><li><p>First names are extracted separately, as aliases are only generated for given names to keep matching precise.</p></li></ul></li><li><p>Synonym Generation</p><ul><li><p>The <strong>Contextual Synonym Generator</strong> component uses generative AI to create synonyms of the first names (e.g. "Thomas" → "Tom").</p></li></ul></li><li><p>Character Mapping &amp; Alias Creation</p><ul><li><p>The <strong>Character Mapper</strong> creates a mapping to normalize the sanctioned names.</p></li><li><p>The <strong>Alias Creator</strong> generates an alias mapping of the true first names to the synonyms from the previous step.</p></li></ul></li><li><p>The <strong>Phrase Indexer </strong>builds a fast and fault-tolerant search index from the sanctioned names, taking into account the character mapping, as well as the first name aliases.</p></li><li><p>The <strong>Customer Sanction Screening</strong> Data App allows users to search for customers' names and returns the sanction status using the <strong>Phrase Index Matcher</strong> for rapid and reliable matching.</p></li></ol><p>Index-based matching with M|Box ensures robust and scalable sanction screening, even in the presence of name variations, transcription differences, and alternative transliterations. This makes it ideal for:</p><ul><li><p>Normalizing names from heterogeneous data sources with inconsistent formatting or character sets</p></li><li><p>Delivering fast, reliable screening results without scrificing transparency or matching explainability</p></li></ul>

URL: exorbyte GmbH https://exorbyte.ai/
URL: exorbyte/KNIME https://exorbyte.ai/knime

OpenAI API Credentials

Please enter your username and API key in the Credentials Configuration node

Request/Activate Exorbyte License

Request and register your exorbyte license before running any M|Box nodes.

If you do not have an active license, within the License Requester:

  1. Choose Demo (30 days) or Production.

  2. Enter your email (and Customer Token if production).

  3. Execute the node – it sends a secure request to the exorbyte team.

  4. When you receive the .lic file, reopen the node → Use available license fileand run the node

Afterwards, or if you already have an active license, run License Activator

⚠️ Each KNIME installation or Hub environment needs its own license

👉 See full exorbyte License Activation Guide

Import Data & Isolate First Names

  • Import a table containing the full names of sanctioned individuals.

  • Use the Cell Splitter to isolate the first names which will be used for alias creation

Generate Synonyms

Use the Contextual Synonym Generator component to generate synonyms of the sanctioned first names, which may be used to avoid triggering sanction screening.

  • e.g. "James" and "Jimmy"

Last names will be kept the same to avoid false matches based on surname variations.

Character Mapping & Alias Creation

Use the Character Mapper to normalize sanctioned names:

  • De-accentuate (e.g. "Ríos" → "Rios)

  • Expand umlauts (e.g. "Übermeier" → "Uebermeier")

  • Map spaces (e.g. "Erik Lindqvist" → "Erik Lindqvist")

  • Latinize (e.g. "Марат Джаксыбеков" → "Marat Dzhaksybekov")

Use the Alias Creator to create aliases of the sanctioned names, which will be treated equally during matching.

Indexing

Use the Phrase Indexer to build a fast, fault-tolerant index on the sanctioned names, taking into account:

  • The character mappings

  • The first name aliases

Sanction Screening – Fault- and Alias-Tolerant Search

Find out whether a customer may be included on a sanctions list. The Customer Sanction Screening Data App lets the user enter a customer's name and searches the index, accounting for typos or incomplete inputs, as well as name variations.

  • For example, searching for "Tom Breckmann" will result in an exact match for "Thomas Breckmann", who is on the sanction list.

Activate License
License Activator
Alias Creator
Customer Sanction Screening
Cell Splitter
CSV Reader
Phrase Indexer
generate synonyms of first names
Contextual Synonym Generator
Character Mapper
Put the last names back together
Column Aggregator
Column Renamer
License Requester
Credentials Configuration
OpenAI Authenticator
Double to Integer
add a penalty column
Expression
OpenAI LLM Selector

Nodes

Extensions

Links