Icon

Product Title Normalization using Phrase Indexer (E-Commerce)

<p>This use case demonstrates how <strong>M|Box’s Phrase Indexer</strong> and <strong>Approximate Phrase Index Matcher</strong> nodes can be combined to efficiently normalize product titles from e-commerce sources by matching them to a standardized internal product catalog.</p><p>Workflow Overview:</p><ol><li><p>The workflow loads:</p><ul><li><p>A reference table containing clean, standardized internal product names.</p></li><li><p>A second table containing e-commerce product titles, which may include additional attributes, reordered terms, or marketplace-specific wording</p></li></ul></li><li><p>The internal product names are indexed using the <strong>Phrase Indexer</strong>, creating a searchable phrase index optimized for approximate matching.</p></li><li><p>The e-commerce product titles are matched against this index using the <strong>Approximate Phrase Index Matcher</strong>. This allows robust matching even when:</p><ul><li><p>Titles contain additional descriptive terms (e.g., colors, storage sizes, marketing phrases)</p></li><li><p>Word order differs between title and reference name</p></li><li><p>Minor formatting variations occur</p></li></ul></li></ol><p>The result is a mapping from noisy marketplace titles to clean, normalized internal product names.</p>

URL: exorbyte GmbH https://exorbyte.ai/
URL: exorbyte/KNIME https://exorbyte.ai/knime

Request/Activate Exorbyte License

Request and register your exorbyte license before running any M|Box nodes.

If you do not have an active license, within the License Requester:

  1. Choose Demo (30 days) or Production.

  2. Enter your email (and Customer Token if production).

  3. Execute the node – it sends a secure request to the exorbyte team.

  4. When you receive the .lic file, reopen the node → Use available license fileand run the node

Afterwards, or if you already have an active license, run License Activator

⚠️ Each KNIME installation or Hub environment needs its own license

👉 See full exorbyte License Activation Guide

Import Data

Import two data tables:

  • Reference table: A table containing canonical internal product names

  • Search table: A table containing product titles from an e-commerce site. These titles can be messy and include additional details that don't appear in the product name

Indexing & Matching

  • Use the Phrase Indexer to create an index out of the internal product names

  • Use the Approximate Phrase Index Matcher to match the e-commerce titles to the internal names, finding the best match

Phrase Indexer
Ungroup
Phrase Index Matcher
License Activator
License Requester
Product Names
CSV Reader
E-Commerce Titles
CSV Reader

Nodes

Extensions

Links