Icon

KNIME_​project_​Industrial_​Wastewater_​Dataset_​CCA23004_​AHMED

Loads the wastewater dataset from a CSV file and automatically detects column types.
CSV Reader
This node merges the original pH column with the pH missing column to create a single unified pH variable for analysis.
Column Merger
This node merges the turbidity column with the turbidity missing column to generate a final turbidity variable
Column Merger
This node removes irrelevant and non-numerical columns, keeping only the variables required for data analysis and clustering.
Column Filter
This node handles missing values by replacing them with the mean value of each respective numerical variable.
Missing Value
This node visualizes the distribution of a selected numerical variable to support exploratory data analysis (EDA)
Histogram
This component contains a scatter plot visualization colored by cluster, summarizing the results of the k-means clustering model.
Component
This node applies Z-score normalization to standardize numerical variables before clustering analysis.
Normalizer
This node performs k-means clustering with k = 3 to group the wastewater samples based on their similarities.
k-Means

Nodes

Extensions

Links