Spacy Lemmatizer

The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc. Not all spaCy models contain lemmatizer. Different spaCy models might lack this model.

Options

Select column
Select a Document column that will be affected by lemmatization.
Replace column
If checked, the document column will be replaced by the new preprocessed documents. Otherwise the preprocessed documents will be appended as a new column.
Append column
The name of the new appended column, containing the preprocessed documents.

Model

spaCy model
Pick one of the official spaCy models, or refer to a custom model stored in the filesystem. In the latter case refer to a folder with meta.json and config files.

Python

Python
Select one of Python execution environment options:
  • use default Python environment for Deep Learning
  • use Conda environment

Input Ports

Icon
The input table which contains the documents to preprocess.

Output Ports

Icon
The output table which contains the preprocessed documents.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.