Term Indexer

The Term Indexer node creates a searchable Index Object from a selected string column of an input data table.
The node abstracts internal engine complexity and exposes two generalized index strategies:

  • Generic - Identity
  • Generic - Term
These strategies optimize indexing behavior depending on the nature of the indexed values (structured identifiers vs. natural terms).
The generated index can be passed to downstream nodes, such as the Term Index Matcher, to quickly retrieve similar values within large datasets. By storing the indexed values in a structured format, the node enables fast and scalable similarity lookups, even in the presence of spelling variations, typos, or inconsistent text data.

Options

Select Column to Index
Selects the string column from the input table that will be used to build the index. Only string-convertible columns are available.
Index Type
Specifies the indexing strategy used to construct the searchable index. Available Options
Generic - Ident
Optimized for identity-like or structured values such as:
  • IDs
  • Codes
  • Reference numbers
  • Single structured tokens
Designed for exact or near-exact matching scenarios where tolerance is limited and precision is critical.
Generic - Term
Optimized for natural-language-like terms such as:
  • Names
  • Brands
  • Product titles
  • Single-word entities
Provides more tolerant matching behavior for handling typos, spelling variations, and inconsistencies.
Representation of Indexed Strings
Defines how indexed values are represented in downstream results (e.g., in Term Index Matcher).
Indexing internally applies normalization to improve match precision. This option determines which representation is exposed in match outputs.
This option determines whether the matched index values in the results are displayed as their transformed or original representations.
Possible values:
  • Original
    Returns values exactly as they appear in the input column.
  • Normalized
    Returns the internally transformed representation used for indexing.
Users may apply the Character Mapper node upstream to override default normalization behavior.
Aliases
Select the alias set you want to apply to the index.
An Alias Object (created by the Alias Creator node) enables deterministic synonym expansion or canonicalization during indexing.
When an alias set is applied:
  • Terms may be expanded or rewritten according to alias rules
  • Penalty values influence downstream similarity scoring
  • Deterministic synonym handling improves recall while preserving explainability
If no alias set is selected, indexing proceeds without synonym expansion.

Input Ports

Icon
Table containing the canonical string values to match against.
Icon
A mapping object defined by Character Mapper node.
Icon
Optional synonym/canonicalization mappings used to expand or rewrite queries before matching.

Output Ports

Icon
Contains the indexed representation of the selected column, normalization rules, alias configuration, and metadata. Used as input for the Term Index Matcher node.

Popular Predecessors

  • No recommendations found

Popular Successors

  • No recommendations found

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.