Spacy Tokenizer

The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.

Options

Select column
Select a String or Document column that will be tokenized and converted to Document.
Replace column
If checked, the selected column will be replaced by the new documents. Otherwise the documents will be appended as a new column.
Append column
The name of the newly appended column, containing the preprocessed documents.

Python

Python
Select one of Python execution environment options:
  • use default Python environment for the Redfield NLP nodes
  • use Conda environment specified by a Conda flow variable (only selectable if such a flow variable is available)

Input Ports

Icon
The Spacy model
Icon
The input table which contains the documents to preprocess.

Output Ports

Icon
The Spacy model
Icon
The output table which contains the preprocessed documents.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.