This workflow can be run by itself, but it is in truth complemetary to a main one, which needs to be executed at least once before this one, at least until the node before the lexicon text mining module and the related table writer, in order to save in the working directory the necessary .table file. The main workflow is present in the same directory of this flow on Knime Hub.
This workflow performs lexicon-based text mining on the movies’ titles using the three features present in the VAD dictionary: Valence, Arousal and Dominance. After adding the title length as a new variable, words are tagged with POS tagger and using the VAD dictionary. Then, in the pre-processing part punctuation is erased, numbers are filtered, everything is converted to lower case and all the titles without any tag are filtered out (for example, titles with only proper names).
After creating the Bag of words (and added a Term column to keep each term without the tag attached), a joiner is used to add to each term tagged its value of Valence, Arousal and Dominance through the VAD dictionary. Eventually, through a GroupBy node, we grouped the term by the movies’ titles and keep the main data about the film and the value of Valence, Arousal and Dominance of the title.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.