Icon

Chapter6

This directory contains 6 workflows.

Icon01_​Topic_​Detection Clustering 

After classic pre-processing (in streaming mode!) we: - extract the document keywords with a KeyGraph Keyword Extractor node. - we display the words […]

Icon02_​Topic_​Detection_​LDA 

This workflow extracts topics from the KNIME Forum dataset using the Topic Extractor (Parallel LDA) node.

Icon03_​Document_​Classification 

This is a workflow for topic classification. After converting the Documents into word vectors, it becomes a traditional classification problem which can […]

Icon04_​Word_​Embedding_​Clax 

Word Embedding for Classification Word2Vec Learner node here is trained on a training set extracted from the Human AIDS vs. Mouse Cancer dataset to […]

Icon05_​Word_​Embedding_​Distance 

Here we use word embedding instead of hot encoding, using a Word2Vec Learner node. The hidden layer size is set to 10, therefore producing an embedding with […]