Icon

KNIME_​Final_​Project2

<p>Predictive Modeling</p>

Predictive Modeling

Business Understanding

Data Understanding

Data Exploration


Data Visualization

Histograms - distribution and dispersion of each numerical variable

Box Plots - variability and potential outliers within the dayaset

Scatter Plots - potential relationships between customer behaviour variables and monetary value

Data understanding & visualization

Data Preparation

Column Filter - feature selection

Duplicate Row Filter - data cleaning

Missing value - data cleaning

Normalizer - data transformation

Clustering: K-Means

FINAL CLUSTERING SOLUTION

Clustering: Hierarchical

Data Preparation

Missing values - data cleaning

Outliers - anomalies handling

Feature engineering - creating value

Normalizer - feature scaling

Feature selection - reducing noise

Predictive Modeling

Foundational Aspects of Information

Spring Semester 2025-2026

NOVA Information Management School

Bianca Ribeiro nº 20251345

Maria Mestre nº 20251347

Sara Cerdeira nº20251341

Sofia Teles nº20251322

2000 customers
Excel Reader
Statistics
8000 customers
Excel Reader
Statistics
NO custid
Column Filter
RFM
Data Preparation
Denormalizer
Hierarchical Clustering
RFM
Data Preparation
Box Plots
Scatter Plots
Histograms
Normalizer
Row Splitter
k= 5
k-Means
Number to String
Decision Tree Learner
k= 3
k-Means
Scorer
Histograms
k= 4
k-Means
Decision Tree Predictor

Nodes

Extensions

Links