Icon

13 Clustering - exercise

Text Mining Course: Clustering

- What groups of documents are in the data?
- Compute pairwise cosine distances
- Apply hierarchical clustering
- View dendrogram to find out the number of clusters (k)
- Assign k clusters
- Apply k-Medoids with k as number of clusters
- Select documents of one cluster in dendrogram, hilite them, and inspect data in a table view

Exercise: Clustering- What groups of documents are in the data?- Compute pairwise cosine distances- Apply hierarchical clustering- View dendrogram to find out the number of clusters (k)- Assign k clusters- Apply k-Medoids with k as number of clusters- Select documents of one cluster in dendrogram, hilite them, and inspect data in a table view Reading Textual Data Enrichment Preprocessing Transformation Tagging (POS)Creation ofdocument vectorsFiltering, Stemming, ...Filtering based onoccurrencesRead documents fromclustering data setWITHOUT TITLESRead documents fromclustering data setWITH TITLES Enrichment Transformation Preprocessing I Preprocessing II Table Reader Table Reader Exercise: Clustering- What groups of documents are in the data?- Compute pairwise cosine distances- Apply hierarchical clustering- View dendrogram to find out the number of clusters (k)- Assign k clusters- Apply k-Medoids with k as number of clusters- Select documents of one cluster in dendrogram, hilite them, and inspect data in a table view Reading Textual Data Enrichment Preprocessing Transformation Tagging (POS)Creation ofdocument vectorsFiltering, Stemming, ...Filtering based onoccurrencesRead documents fromclustering data setWITHOUT TITLESRead documents fromclustering data setWITH TITLES Enrichment Transformation Preprocessing I Preprocessing II Table Reader Table Reader

Nodes

Extensions

Links