Stop word Filter

This Node Is Deprecated — This node is kept for backwards-compatibility, but the usage in new workflows is no longer recommended. The documentation below might contain more information.

Filters all terms contained in the specified stop word file. Stop words need to be written among each other in one column, in a way that each line contains only one stop word.

Options

Preprocessing options

Case sensitive
If checked the stop words are matched case sensitive otherwise not.
Use built-in list
Specifies if built-in stop word list is used or not. If checked the specified path will be ignored.
Stopword lists
Available built-in stop word lists. If "Use built-in list" is checked, the built-in stop word list to use can be specified here.
Selected file
The location of the stop word file. Stop words need to be written below each other in one column, in a way that each line contains only one stop word.

Deep preprocessing options

Deep preprocessing
If deep preprocessing is checked, the terms contained inside the documents are preprocessed too, this means that the documents themselves are changed too, which is more time consuming.
Document column
Specifies the column containing the documents to preprocess.
Append unchanged documents
If checked, the documents contained in the specified "Original Document" column are appended unchanged even if deep preprocessing is checked. This helps to keep the original documents in the output data table without the agonizing pain of joining.
Original Document column
Specifies the column containing the original documents which can be attached unchanged.
Ignore unmodifiable tag
If checked unmodifiable terms will be preprocessed too.

Input Ports

Icon
The input table which contains the terms to convert.

Output Ports

Icon
The output table which contains the preprocessed terms.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.