Node Connectivity

There are 41 nodes that can be used as successor for a node with an output port of type spaCy pipeline.

Spacy Lemmatizer 

The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc.

Spacy Morphologizer 

The node performs morphology analysis of the text and assigns the tags for singular/plural, gender, case, conjugation, animacy, etc. for the tokens.

Spacy NER 

The node assigns named entity tags to the words of the document.

Spacy POS Tagger 

The node assigns part of speech to each token of the document.

Spacy Stop Word Filter 

The node filters out words that are identified as stop words by the provided spaCy model.

Spacy Tokenizer 

The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.

Spacy Vectorizer 

Maps String or Document data to a numerical vector (list of doubles) according to the embedder provided by the spaCy model.

Approximate Phrase Matcher (Labs) StreamableModern UI

Compares and scores phrase similarity between two tables using subword-based approximate matching algorithms.

Approximate String Matcher (Labs) StreamableModern UI

Calculates similarities between strings and allows for filtering.

Phrase Indexer (Labs) StreamableModern UI

Builds a searchable edit-distance index over a multi-word phrase column. Allows splitting phrases into individual terms using a configurable delimiter.