Icon

SEM7_​Data_​Mining

Original Data set from Fontys' data base

External data from the weather forecast

Joining the internal and external data

Data Exploration

Data Preparation

Categorical models

Numeric models

Removes duplicate rows
Duplicate Row Filter
Date&Time Part Extractor
The amound of passes is almost two times lower than the amoun of fails therefore a split of 70/30 In this case I will do 60/40
Table Partitioner
Random Forest Predictor
Remove rows where the final grade is missing
Row Filter
Missing values are replaced based on the following criteria:- String -> Unknown - Integer -> most frequent value
Missing Value
Random Forest Learner (Regression)
Excel Reader
Scorer
Linear Correlation
Joiner
Random Forest Predictor (Regression)
Logistic Regression Learner
Excel Reader
Scatter Plot
Statistics
Statistics View
Logistic Regression Predictor
Random Forest Learner
No outliers
Box Plot
Scorer
No outliers
Box Plot
Decision Tree Predictor
Scorer
Histogram
Decision Tree Learner
Simple Regression Tree Learner
Numeric Scorer
Simple Regression Tree Predictor
Numeric Scorer
Numeric Scorer
Decision Tree Predictor
Decision Tree Learner
Scorer
Decision Tree Learner
Decision Tree Predictor
Table Partitioner
Decision Tree Learner
Column Filter
Row Filter
Linear Regression Learner
Regression Predictor
Decision Tree Predictor
Scorer
Logistic Regression Predictor
Scorer
Logistic Regression Learner
Scorer

Nodes

Extensions

Links