Dictionary Filter

Filters all terms of the input documents, which are contained in the dictionary provided by the second input port. As dictionary a string column of the second input port can be specified.

Options

Filter options

Case sensitive
If checked the words are matched case sensitive otherwise not.
Column containing the strings to filter
The column containing the strings (terms) to filter.

Preprocessing options

Document column
The column containing the documents to preprocess.
Replace documents
If checked, the documents will be replaced by the new preprocessed documents. Otherwise the preprocessed documents will be appended as new column.
Append column
The name of the new appended column, containing the preprocessed documents.
Ignore unmodifiable tag
If checked, unmodifiable terms will be preprocessed too.

Input Ports

Icon
The input table which contains the documents to preprocess.
Icon
The input table containing at least one the string column (dictionary).

Output Ports

Icon
The output table which contains the preprocessed documents.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.