Approximate Phrase Index Matcher (Labs)

The Approximate Phrase Index Matcher node compares each phrase in the Comparison Table to phrases stored in a prebuilt Index Object (from Phrase Indexer).
It uses exorbyte’s deterministic, subword-aware engine to compute fuzzy similarity over multi-word strings.
Optionally, an Alias Object can expand the search with synonymous, canonical, or normalized forms (e.g., "NYC" ⇄ "New York City"), improving recall without sacrificing determinism.
This node is suited for entity normalization, product/title matching, review/snippet clustering, and duplicate detection. It can emit scores, alignment sequences, hit coverage, and best-match candidates, and it can filter rows by threshold.
This node may only be used for private and non-commercial purposes. Commercial use requires a valid license from exorbyte GmbH. All rights reserved.
For more information contact consulting@exorbyte.com.

Options

Select Settings Group
Allowing the user to navigate through different sections of the configuration options
  • Input
  • Search
  • Output
Select Columns in Comparison Input
Select columns applicable to comparison to the Reference Terms
Add Column with Numeric Matching Value
Appends a column showing the calculated similarity score (character count or percentage).
Add Column with Character Match Sequence
Adds a symbolic alignment string visualizing matching and mismatching characters.
'=' -> Match
'x' -> Mismatch
'+' -> Insertion
'/' or '\' -> transition
Add Column with Hit Characters Sequence
Appends a column showing which parts of the reference phrase were matched by the comparison.
Add Column with Best Reference Match
Appends the most similar reference phrase for each comparison row, identifying the best match candidate.
Row Filter Condition
Controls which rows are included in the node output based on the match result.
Options:
  • Output matching rows - Only outputs rows that meet or exceed the similarity threshold.
  • Output non-matching rows - Only outputs rows that do not meet the threshold.
  • No Filtering - Outputs all rows with match metadata for analysis.
Matching Value Threshold - Minimal Matching Percentage
This setting allows you to set the filter criteria based on Match Quality.
This setting only appears if filtering is actually switched on by the previous setting.

Input Ports

Icon
Searchable phrase index produced by Phrase Indexer. Includes metadata and normalization settings.
Icon
Table with phrases to be matched against the index.

Output Ports

Icon
Original comparison rows enriched with numeric score, match coverage, alignment sequence, best-match index phrase, and auxiliary metadata.

Popular Predecessors

  • No recommendations found

Popular Successors

  • No recommendations found

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.