0 ×

Kuhlen Stemmer

DeprecatedKNIME Textprocessing Plug-in version 4.0.0.v201906171531 by KNIME AG, Zurich, Switzerland

This node allows you to reduce terms to their stem. The used stemming algorithm is the Kuhlen stemmer. The stemmed terms are stored in the outgoing DataTable, as well as the documents containing these terms. Be aware that the Kuhle stemmer stems only english words correctly.

Options

Deep preprocessing options

Deep preprocessing
If deep preprocessing is checked, the terms contained inside the documents are preprocessed too, this means that the documents themselves are changed too, which is more time consuming.
Document column
Specifies the column containing the documents to preprocess.
Append unchanged documents
If checked, the documents contained in the specified "Original Document column" are appended unchanged even if deep preprocessing is checked. This helps to keep the original documents in the output data table without the agonizing pain of joining.
Original Document column
Specifies the column containing the original documents which can be attached unchanged.
Ignore unmodifiable tag
If checked unmodifiable terms will be preprocessed too.

Input Ports

The input table which contains the terms to stem.

Output Ports

The output table which contains the stemmed terms.

Installation

To use this node in KNIME, install KNIME Textprocessing Plug-in from the following update site:

KNIME 4.0
Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform.