Icon

Task 1

Saving the Datasets Detecting Missing Data Detecting Duplicate Rows Detecting Outliers Data Visualisation Data Sizing Handling Outliers Feature Engineering Feature Selection Categorical Data Encoding Feature Scaling Visualisation Before Clustering Dataset with Normalisation Applied Dataset with No Normalisation Applied Saving Dataset withNormalisation Applied Saving Dataset with NoNormalisation Applied Data Sizing Read customer.csvVisualisingVia Pie ChartsVisualisingVia HistogramsGet Statistical SummaryOutliers are values1.5x the IQR.If there are outliersthen its rows aredropped.View Dataset after columns renamedReset row IDGet Statistical SummarySelect Numberof Missing DataNo Missing Data for each ColumnRenamingColumns for ClarityRemove Duplicated RowsGet Number of Columns and RowsCombine the numberof rows and columnsfor both tables togetherNo Duplicated RowsKeep Duplicated RowsGet Number of Columns and RowsDisplay Box PlotStatisticsVisualise OutliersGet Number of Columns and RowsAfter Outliers Removal10241 Rows and12 ColumnsGet Number of Columns and Rows10241 Rows and12 ColumnsView the datasetafter CategoricalData EncodingDisplay Numberof Outliers foreach Column thatwas removedView Dataset after outliers were removedConvert all thestring columns into numericalView the datasetafter feature selectionRemove the columns: graduted and family size Get Number of Columns and RowsAfter Feature Selection10238 Rows and10 ColumnsNode 63Apply DimensionalityReductionApply DimensionalityReductionAssign Colours to SubscribedAssign Colours to SubscribedPlace PCA0 and PCA1as first two columnsPlace PCA0 and PCA1as first two columns2D plot of PCA0 and PCA12D plot of PCA0 and PCA1View the datasetafter NormalisationNode 73Node 74 File Reader Interactive Piechart (legacy) InteractiveHistogram (legacy) Statistics Numeric Outliers InteractiveTable (legacy) RowID Statistics Column Filter InteractiveTable (legacy) Column Rename(deprecated) DuplicateRow Filter Extract TableDimension Concatenate InteractiveTable (legacy) DuplicateRow Filter Extract TableDimension InteractiveTable (legacy) Box Plot (legacy) Extract TableDimension InteractiveTable (legacy) Extract TableDimension InteractiveTable (legacy) InteractiveTable (legacy) InteractiveTable (legacy) InteractiveTable (legacy) Category to Number InteractiveTable (legacy) Column Filter Extract TableDimension InteractiveTable (legacy) Normalizer PCA PCA Color Manager Color Manager Column Resorter Column Resorter Scatter Plot(legacy) Scatter Plot(legacy) InteractiveTable (legacy) CSV Writer CSV Writer Saving the Datasets Detecting Missing Data Detecting Duplicate Rows Detecting Outliers Data Visualisation Data Sizing Handling Outliers Feature Engineering Feature Selection Categorical Data Encoding Feature Scaling Visualisation Before Clustering Dataset with Normalisation Applied Dataset with No Normalisation Applied Saving Dataset withNormalisation Applied Saving Dataset with NoNormalisation Applied Data Sizing Read customer.csvVisualisingVia Pie ChartsVisualisingVia HistogramsGet Statistical SummaryOutliers are values1.5x the IQR.If there are outliersthen its rows aredropped.View Dataset after columns renamedReset row IDGet Statistical SummarySelect Numberof Missing DataNo Missing Data for each ColumnRenamingColumns for ClarityRemove Duplicated RowsGet Number of Columns and RowsCombine the numberof rows and columnsfor both tables togetherNo Duplicated RowsKeep Duplicated RowsGet Number of Columns and RowsDisplay Box PlotStatisticsVisualise OutliersGet Number of Columns and RowsAfter Outliers Removal10241 Rows and12 ColumnsGet Number of Columns and Rows10241 Rows and12 ColumnsView the datasetafter CategoricalData EncodingDisplay Numberof Outliers foreach Column thatwas removedView Dataset after outliers were removedConvert all thestring columns into numericalView the datasetafter feature selectionRemove the columns: graduted and family size Get Number of Columns and RowsAfter Feature Selection10238 Rows and10 ColumnsNode 63Apply DimensionalityReductionApply DimensionalityReductionAssign Colours to SubscribedAssign Colours to SubscribedPlace PCA0 and PCA1as first two columnsPlace PCA0 and PCA1as first two columns2D plot of PCA0 and PCA12D plot of PCA0 and PCA1View the datasetafter NormalisationNode 73Node 74 File Reader Interactive Piechart (legacy) InteractiveHistogram (legacy) Statistics Numeric Outliers InteractiveTable (legacy) RowID Statistics Column Filter InteractiveTable (legacy) Column Rename(deprecated) DuplicateRow Filter Extract TableDimension Concatenate InteractiveTable (legacy) DuplicateRow Filter Extract TableDimension InteractiveTable (legacy) Box Plot (legacy) Extract TableDimension InteractiveTable (legacy) Extract TableDimension InteractiveTable (legacy) InteractiveTable (legacy) InteractiveTable (legacy) InteractiveTable (legacy) Category to Number InteractiveTable (legacy) Column Filter Extract TableDimension InteractiveTable (legacy) Normalizer PCA PCA Color Manager Color Manager Column Resorter Column Resorter Scatter Plot(legacy) Scatter Plot(legacy) InteractiveTable (legacy) CSV Writer CSV Writer

Nodes

Extensions

Links