Icon

How to detect and deal with Outliers

<p><strong>How to detect and deal with Outliers</strong></p><p>Starts with the <strong>Iris dataset</strong>, then applies two main strategies for handling outliers: <strong>IQR-based outlier detection</strong> (to either replace or remove outliers) and <strong>Z-score normalization</strong> (to standardize numeric columns). After normalization, it identifies rows where the absolute z-score is greater than 2, marking them as outliers, and finally <strong>filters out these outlier rows</strong> to keep only the typical data points for further analysis.</p>

How to detect and deal with Outliers

Starts with the Iris dataset, then applies two main strategies for handling outliers: IQR-based outlier detection (to either replace or remove outliers) and Z-score normalization (to standardize numeric columns). After normalization, it identifies rows where the absolute z-score is greater than 2, marking them as outliers, and finally filters out these outlier rows to keep only the typical data points for further analysis.

Outlier replacement strategy:
Replace outliers with missing values

Outlier replacement strategy: Remove outlier rows

Outlier replacement strategy: Remove non-outlier rows

How to detect and deal with Outliers

Starts with the Iris dataset, then applies two main strategies for handling outliers: IQR-based outlier detection (to either replace or remove outliers) and Z-score normalization (to standardize numeric columns). After normalization, it identifies rows where the absolute z-score is greater than 2, marking them as outliers, and finally filters out these outlier rows to keep only the typical data points for further analysis.

Iris Dataset
Example Data Reader
IQR based outlier detection
Numeric Outliers
IQR based outlier detection
Numeric Outliers
IQR based outlier detection
Numeric Outliers
Z-score normalization
Normalizer
|z| > 2
Expression
filter outliers
Row Filter

Nodes

Extensions

Links