Icon

03_​Applying_​Text_​and_​Network_​Analysis_​Techniques_​to_​Forums

Text and Network Analysis of the KNIME Forum

This workflow draws the network of contacts in the forum and the word cloud of the posts for each forum section.

Network Mining Text Mining General ReportingIn order to see the report execute the entire workflow and thenclick "Open the report" button in the toolbar. Reads contents from the KNIME Forum site after web-crawlingThis node reads the pages downloaded from the KNIME Forum web site () afterweb crawling.The web crawling part used to be here as well as an alternative to read theprepared data. However, the KNIME Forum web site has changed over theyears and the old workflows does not parse the extract content correctlyanymore.If you want to implement your own web crawler, use the Palladian or theSelenium nodes. Palladian extension is available in the community extensions.The Selenium nodes can be downloaded from: http://seleniumnodes.com/ KNIME Forum: Text and Network AnalysisThis workflow draws the network of contacts in the forum and the word cloud of the posts for each forum section.This workflow is quite computational intensive. Make sure to run it on a machine with enough memory. desc. abs. freq.top 200 terms by abs. term freq.by categoryposts with commentsgroup links between the same users and topicempty networkall componentsall user names1: posts2: commentscreate link tableAll connected componentsPOS, persons and KNIMEnode namesall categoriesall categoriesnetwork miningtext miningremove topicswith only one posttop 3 posterand commentersmost active usersby categoryby category & dateremote reading from servernew posts orcommentsappend category column Sorter Row Filter Extract Term Tags Group Loop Start Object Inserter GroupBy Network Creator Column Resorter Joiner GroupBy Strings To Document(deprecated) Tag Cloud GroupBy Filtering &Stemming Row Splitter Joiner Column Rename Loop End (2 ports) Image To Table Extract components Column Resorter Column Rename Column Rename Analyze LargestComponent Tagging Frequencies GroupBy Data to Report Data to Report Data to Report Nominal ValueRow Filter User Stats Data to Report Group Loop Start Loop End Sorter Time to String(legacy) Table Reader Rule Engine Double To Int Variable to TableColumn (deprecated) Network Mining Text Mining General ReportingIn order to see the report execute the entire workflow and thenclick "Open the report" button in the toolbar. Reads contents from the KNIME Forum site after web-crawlingThis node reads the pages downloaded from the KNIME Forum web site () afterweb crawling.The web crawling part used to be here as well as an alternative to read theprepared data. However, the KNIME Forum web site has changed over theyears and the old workflows does not parse the extract content correctlyanymore.If you want to implement your own web crawler, use the Palladian or theSelenium nodes. Palladian extension is available in the community extensions.The Selenium nodes can be downloaded from: http://seleniumnodes.com/ KNIME Forum: Text and Network AnalysisThis workflow draws the network of contacts in the forum and the word cloud of the posts for each forum section.This workflow is quite computational intensive. Make sure to run it on a machine with enough memory. desc. abs. freq.top 200 terms by abs. term freq.by categoryposts with commentsgroup links between the same users and topicempty networkall componentsall user names1: posts2: commentscreate link tableAll connected componentsPOS, persons and KNIMEnode namesall categoriesall categoriesnetwork miningtext miningremove topicswith only one posttop 3 posterand commentersmost active usersby categoryby category & dateremote reading from servernew posts orcommentsappend category column Sorter Row Filter Extract Term Tags Group Loop Start Object Inserter GroupBy Network Creator Column Resorter Joiner GroupBy Strings To Document(deprecated) Tag Cloud GroupBy Filtering &Stemming Row Splitter Joiner Column Rename Loop End (2 ports) Image To Table Extract components Column Resorter Column Rename Column Rename Analyze LargestComponent Tagging Frequencies GroupBy Data to Report Data to Report Data to Report Nominal ValueRow Filter User Stats Data to Report Group Loop Start Loop End Sorter Time to String(legacy) Table Reader Rule Engine Double To Int Variable to TableColumn (deprecated)

Nodes

Extensions

Links