Icon

Chapter4

This directory contains 7 workflows.

Icon01_​Frequencies 

After creating BoW we can compute frequencies and other important measures based on terms, characters, co-occurences, etc. The nodes applied are TF, IDF, […]

Icon02_​DocumentVector_​Creation 

Here we transform the collection of documents into numerical vectors. The dataset used in this example is the KNIME Forum Dataset. After the pre-processing […]

Icon03_​Streaming_​DocumentVector 

Here we execute the workflow in a streming fashion. The aim of this workflow is to create a vector space with the collection of documents being analzsed, bz […]

Icon04_​Keyword_​Extraction 

Here we perform an automated keyword extraction from a subset of the given collection of documents. In this example workflow the Chi-square keyword […]