Spell Checker (simple)

Go to Product

Spell checker and corrector based on LanguageTool. LanguageTool is an Open Source proof­reading program for more than 25 languages. It finds many errors that a simple spell checker cannot detect.

This node automatically corrects the input text by applying all rules which LanguageTool detected (hence the appendix “simple”). In case, you need more control over the corrections (e.g. only correct certain categories of errors), use the “Spell Checker: Create Rules” node, inspect and filter the rules using standard KNIME nodes (e.g. “Row Filter”), and then finally apply your rules using the “Spell Checker: Apply Rules” node.

Supported languages:

  • Arabic
  • Asturian
  • Belarusian
  • Breton
  • Catalan
  • Catalan (Balearic)
  • Catalan (Valencian)
  • Chinese
  • Crimean Tatar
  • Danish
  • Dutch
  • Dutch (Belgium)
  • English (Australian)
  • English (Canadian)
  • English (GB)
  • English (New Zealand)
  • English (South African)
  • English (US)
  • Esperanto
  • French
  • French (Belgium)
  • French (Canada)
  • French (Switzerland)
  • Galician
  • German (Austria)
  • German (Germany)
  • German (Swiss)
  • Greek
  • Irish
  • Italian
  • Japanese
  • Khmer
  • Persian
  • Polish
  • Portuguese (Angola preAO)
  • Portuguese (Brazil)
  • Portuguese (Moçambique preAO)
  • Portuguese (Portugal)
  • Romanian
  • Russian
  • Simple German
  • Slovak
  • Slovenian
  • Spanish
  • Spanish (voseo)
  • Swedish
  • Tagalog
  • Tamil
  • Ukrainian

Options

Input
Input column with the text.
Language
Language of the text.

N-gram Data

Selected Directory
(Optional) Directory which contains an n-gram data set. This allows to detect errors with words that are often confused, like their and there. The n-gram data sets need to be downloaded separately and are available here. After downloading, unzip the file and put it in its own directory named “en”, “de”, “fr”, or “es”, depending on the language. The path you need to set in the configuration is the directory that the “en” etc. directory is in, not that directory itself. More information can be found here.

Input Ports

Icon
Table with a string column which contains the texts to correct.
Icon
Table with a list of phrases which should not be corrected.

Output Ports

Icon
Table with an appended column which contains the corrected texts.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.