Icon

4. Clustering and Regression

KBL: Clustering and Regression
Workflow: Clustering and Regression This workflow shows a few more data analysisalgorithms:- a linear regression to predict hte number of hours/week given all other attribute values- a k-Means clustering to group together the mostsimilar data rows. Remember that k-Means, like Neural Networks,needs normalized data.The Statistics node mainly produces generalstatstical measures about one data column,including a roughly drawn histogram. training_set.csv filewritten by workflowPrepare Dataassign new datato clusterstest_set.csv filewritten by workflowPrepare Datageneral statspredict hours/weekon all remaining attrsMin-max normalizationin [0,1]3 clusterswith Euclideandistancerun predictions from model CSV Reader Cluster Assigner CSV Reader Statistics Linear RegressionLearner Normalizer k-Means RegressionPredictor Workflow: Clustering and Regression This workflow shows a few more data analysisalgorithms:- a linear regression to predict hte number of hours/week given all other attribute values- a k-Means clustering to group together the mostsimilar data rows. Remember that k-Means, like Neural Networks,needs normalized data.The Statistics node mainly produces generalstatstical measures about one data column,including a roughly drawn histogram. training_set.csv filewritten by workflowPrepare Dataassign new datato clusterstest_set.csv filewritten by workflowPrepare Datageneral statspredict hours/weekon all remaining attrsMin-max normalizationin [0,1]3 clusterswith Euclideandistancerun predictions from modelCSV Reader Cluster Assigner CSV Reader Statistics Linear RegressionLearner Normalizer k-Means RegressionPredictor

Nodes

Extensions

Links