Here we execute the workflow in a streming fashion. The aim of this workflow is to create a vector space with the collection of documents being analzsed, bz using the Document Vector Hashing node. The node creates document vectors with a fixed number of dimensions using various hashing methods.
This workflow starts reading the data and converts the strings into documents, which are then preprocessed, i.e. filtered and stemmed; all in a streaming fashion. All the preprocessing steps take place in the Streaming Pre-processing component. Then a bag of word is created and finally the documents are transformed into numerical/binary document vectors with the Document vector hashin node. The all workflow is executed in a streaming fashion.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com, follow @NodePit on Twitter or botsin.space/@nodepit on Mastodon.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.