0 ×

Spell Checker: Create Rules

Spellchecker Nodes version 1.1.0.201906221527 by Philipp Katz; Selenium Nodes

Please have a look at the “Spell Checker (simple)” node for a general description of the spell checking and correction mechanism and the supported languages.

This node creates spelling correction rules. For each potential error in the given text, a new row with a so called “rule match” is created. The rule match points to the corresponding text fragment, contains several replacement suggestions and an identifier and description of the specific rule.

The structure of the created rule table looks as follows:

Column Description
Input Row ID The Row ID of the input text.
From Position The rule’s start offset in characters in the input text.
To Position The rule’s end offset in characters in the input text.
Value The original, uncorrected value in the input text.
Suggested Replacements A list of potential replacement suggestions.
Rule ID Identifier of the rule.
Message Human-readable message of the rule.

You can use standard KNIME nodes (e.g. “Row Filter”) to manipulate the list of rules before applying them with the “Spell Checker: Apply Rules” node.

Options

Input
Input column with the text.
Language
Language of the text.

N-gram Data

Selected Directory
(Optional) Directory which contains an n-gram data set. This allows to detect errors with words that are often confused, like their and there. The n-gram data sets need to be downloaded separately and are available here. After downloading, unzip the file and put it in its own directory named “en”, “de”, “fr”, or “es”, depending on the language. The path you need to set in the configuration is the directory that the “en” etc. directory is in, not that directory itself. More information can be found here.

Input Ports

Icon
Table with a string column which contains the texts to correct.
Icon
Table with a list of phrases which should not be corrected.

Output Ports

Icon
Table with correction rules.

Best Friends (Incoming)

Best Friends (Outgoing)

Installation

To use this node in KNIME, install Spellchecker Nodes from the following update site:

KNIME 4.3

A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.