The Phrase Indexer node creates a searchable index from a selected string column
that contains multi-word phrases.
Before indexing, each phrase is split into individual tokens based on a user-defined
delimiter (default: blank space). Each token is then indexed, enabling efficient
approximate string matching over multi-word data.
This node is particularly useful for text fields containing full names, product
descriptions, or address lines, where indexing based on words rather than entire
strings improves recall and match flexibility.
The generated index can be passed to downstream node Approximate Phrase Index
Matcher, enabling rapid retrieval of similar phrases or partial matches from large
datasets.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension exorbyte matchmaker toolbox from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!