There are 8 nodes that can be used as predessesor for a node with an input port of type spaCy pipeline.
The node converts all tokens to their root form (lemma), removing cases, plurals, conjugations, etc.
The node allows to select and load a spaCy model.
The node performs morphology analysis of the text and assigns the tags for singular/plural, gender, case, conjugation, animacy, etc. for the tokens.
The node assigns named entity tags to the words of the document.
The node assigns part of speech to each token of the document.
The node filters out words that are identified as stop words by the provided spaCy model.
The node converts a string column with raw text to a KNIME Document column using the tokenizer of the provided spaCy model.
Maps String or Document data to a numerical vector (list of doubles) according to the embedder provided by the spaCy model.
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.