This category contains 8 nodes.
The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc.
The node allows to select and load a spaCy model.
The node performs morphology analysis of the text and assigns the tags for singular/plural, gender, case, conjugation, animacy, etc. for the tokens.
The node assigns named entity tags to the words of the document.
The node assigns part of speech to each token of the document.
The node filters out words that are identified as stop words by the provided spaCy model.
The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.
Maps String or Document data to a numerical vector (list of doubles) according to the embedder provided by the spaCy model.