Stop Word Filter

This Node Is Deprecated — This version of the node has been replaced with a new and improved version. The old version is kept for backwards-compatibility, but for all new workflows we suggest to use the version linked below.
Go to Suggested ReplacementStop Word Filter

Filters all terms of the input documents, which are contained in the specified stop word list. The node provides built-in stop word lists for various languages. Alternatively stop word can be loaded from a file. The path to that file can be specified in the node dialog. The stop words need to be written among each other, line by line in a way that each line contains only one stop word.

Options

Filter options

Case sensitive
If checked the stop words are matched case sensitive otherwise not.
Use built-in list
Specifies if built-in stop word list is used or not. If checked the specified path will be ignored.
Stopword lists
Available built-in stop word lists. If "Use built-in list" is checked, the built-in stop word list to use can be specified here.
Selected file
The location of the stop word file. Stop words need to be written below each other in one column, in a way that each line contains only one stop word.

Preprocessing options

Document column
The column containing the documents to preprocess.
Replace documents
If checked, the documents will be replaced by the new preprocessed documents. Otherwise the preprocessed documents will be appended as new column.
Append column
The name of the new appended column, containing the preprocessed documents.
Ignore unmodifiable tag
If checked, unmodifiable terms will be preprocessed too.

Input Ports

Icon
The input table which contains the documents to preprocess.

Output Ports

Icon
The output table which contains the preprocessed documents.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.