spaCy pipeline — NodePit

There are 8 nodes that can be used as predessesor for a node with an input port of type

The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc.

The node allows to select and load a spaCy model.

The node performs morphology analysis of the text and assigns the tags for singular/plural, gender, case, conjugation, animacy, etc. for the tokens.

The node assigns named entity tags to the words of the document.

The node assigns part of speech to each token of the document.

The node filters out words that are identified as stop words by the provided spaCy model.

The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.

Maps String or Document data to a numerical vector (list of doubles) according to the embedder provided by the spaCy model.

Node Connectivity