Dictionary tagger

This Node Is Deprecated — This node is kept for backwards-compatibility, but the usage in new workflows is no longer recommended. The documentation below might contain more information.

This node recognizes named entities specified in a dictionary column and assigns a specified tag value and type. Optionally the recognized named entity terms can be set unmodifiable, meaning that the terms are not modified or filtered afterwards by any following node.

Options

Tagger options

Dictionary column
Specifies the dictionary column containing the terms to search for.
Set named entities unmodifiable
Sets recognized named entity terms unmodifiable.
Case sensitive
If checked, case sensitive named entity recognition will be applied, otherwise not.
Exact match
If checked, terms are tagged as as named entities only if they match exactly with an entity to find. Otherwise terms are tagged if they contain the entity string.
Tag type
Specifies the tag type of which tag values can be chosen.
Tag value
Specifies the tag value to use for tagging recognized named entities.

General options

Number of maximal parallel tagging processes
Defines the maximal number of parallel threads that are used for tagging. Please note, that for each thread a tagging model will be loaded into memory. If this value is set to a number greater than 1, make sure that enough heap space is available, in order to be able to load the models. If you are not sure how much heap is available for KNIME, leave the number to 1.
Word tokenizer
Select the tokenizer used for word tokenization. Go to Preferences -> KNIME -> Textprocessing to read the description for each tokenizer.

Input Ports

Icon
The input table containing the documents to tag.
Icon
The input table containing the dictionary column.

Output Ports

Icon
An output table containing the tagged documents.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.