text clustering of Wikipeidia articles II

Text clustering of Wikipeidia articles

Text clustering of Wikipedia articles. 12 different Wikipedia articles, three each on subjects of Philosophy, Religion, Law and Quantum-Mechanics were randomly selected, manually copied from Internet, saved into respective twelve text files (*.txt) in a folder. These twelve text files were then read, text-processed and finally hierachical clustering was performed. Clustering is perfect (even though files are just 12). At the lowest level in the dendogram articles on each subject first cluster together. Any distance measure other than 'cosine', reduces accuracy drastically.

Nodes

Number Filter2 ×
Bag Of Words Creator1 ×
Case Converter1 ×
Category To Class1 ×
Column Expressions1 ×
Show all 17 nodes

Extensions

No modules found

text clustering of Wikipeidia articles II

Nodes

Extensions

Links

Download