There are 41 nodes that can be used as successor
for a node with an output port of type spaCy pipeline.
The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc.
The node performs morphology analysis of the text and assigns the tags for singular/plural, gender, case, conjugation, animacy, etc. for the tokens.
The node assigns named entity tags to the words of the document.
The node assigns part of speech to each token of the document.
The node filters out words that are identified as stop words by the provided spaCy model.
The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.
Maps String or Document data to a numerical vector (list of doubles) according to the embedder provided by the spaCy model.
Compares and scores phrase similarity between two tables using subword-based approximate matching algorithms.
Calculates similarities between strings and allows for filtering.
Builds a searchable edit-distance index over a multi-word phrase column. Allows splitting phrases into individual terms using a configurable delimiter.