0 ×

13 Clustering - solution

Workflow

Text Mining Course: Clustering (solution)
educationsolution
Reading Textual Data Enrichment Preprocessing Transformation Clustering Solution Solution: Clustering- What groups of documents are in the data?- Compute pairwise cosine distances- Apply hierarchical clustering- View dendrogram to find out the number of clusters (k)- Assign k clusters- Apply k-Medoids with k as number of clusters- Select documents of one cluster in dendrogram, hilite them, and inspect data in a table view Tagging (POS)Creation ofdocument vectorsFiltering, Stemming, ...Filtering based onoccurrencesCompute cosinedistancesDendrogramComplete LinkageRead documents fromclustering data setWITHOUT TITLESK=2Read documents fromclustering data setWITH TITLESAssign 2 clusters Enrichment Transformation Preprocessing I Preprocessing II Distance MatrixCalculate HierarchicalCluster View Hierarchical Clustering(DistMatrix) Table Reader k-Medoids Table Reader HierarchicalCluster Assigner Composite view of atable and dendrogram Table View Reading Textual Data Enrichment Preprocessing Transformation Clustering Solution Solution: Clustering- What groups of documents are in the data?- Compute pairwise cosine distances- Apply hierarchical clustering- View dendrogram to find out the number of clusters (k)- Assign k clusters- Apply k-Medoids with k as number of clusters- Select documents of one cluster in dendrogram, hilite them, and inspect data in a table view Tagging (POS)Creation ofdocument vectorsFiltering, Stemming, ...Filtering based onoccurrencesCompute cosinedistancesDendrogramComplete LinkageRead documents fromclustering data setWITHOUT TITLESK=2Read documents fromclustering data setWITH TITLESAssign 2 clusters Enrichment Transformation Preprocessing I Preprocessing II Distance MatrixCalculate HierarchicalCluster View Hierarchical Clustering(DistMatrix) Table Reader k-Medoids Table Reader HierarchicalCluster Assigner Composite view of atable and dendrogram Table View

Download

Get this workflow from the following link: Download

Nodes

13 Clustering - solution consists of the following 26 nodes(s):

Plugins

13 Clustering - solution contains nodes provided by the following 5 plugin(s):