Icon

Covid

Data Reading

Read the adult data set file. There is one row for each person, plus demographic info and the income group. The file is located in TheData/Basics/.

Data Partitioning

Create two separate partitions from original data set: training set (80%) and test set (20%). The test set deliberately consists of unseen data that will not be used for training.

Descriptive Statistics

Calculate the statistical properties of the data set attributes.

Train a Model

This node builds a decision tree. Other Learner nodes train other models. Most Learner nodes output a PMML model (blue square output port).

Apply the Model

Predictor nodes apply a specific model to a data set and append the model predictions.

Visualize

Create interactive scatter plot.

Score the Model

Compute a confusion matrix between real and predicted class values and calculate the related accuracy measures.

Apply decision tree model to test set
Decision Tree Predictor
Random drawing 80% upper port 20% lower port
Table Partitioner
Train to predict class "income"
Decision Tree Learner
Stats and exploratory histograms in View
Statistics
Compute model accuracy
Scorer
Age vs. number-hours
Scatter Plot
CSV Reader

Nodes

Extensions

Links