0 ×

POS tagger

StreamableDeprecatedKNIME Textprocessing Plug-in version 4.0.2.v201909251213 by KNIME AG, Zurich, Switzerland

This node assigns to each term of a document a part of speech (POS) tag. Therefore the Penn Treebank tag set is used, for details see (http://www.cis.upenn.edu/~treebank). The underlying tagger model deciding what tag to assign to which term is a model of the opennlp framework version 1.5.2 (see for http://opennlp.apache.org/documentation.html details).

Options

General options

Number of maximal parallel tagging processes
Defines the maximal number of parallel threads that are used for tagging. Please note, that for each thread a tagging model will be loaded into memory. If this value is set to a number greater than 1, make sure that enough heap space is available, in order to be able to load the models. If you are not sure how much heap is available for KNIME, leave the number to 1.
Word tokenizer
Select the tokenizer used for word tokenization. Go to Preferences -> KNIME -> Textprocessing to read the description for each tokenizer.

Input Ports

The input table containing the documents to tag.

Output Ports

An output table containing the tagged documents.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install KNIME Textprocessing from the following update site:

KNIME 4.0
Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.