For each chosen column (parameter) a reference population is used to estimate n different bins. The bins are based on the quantiles of the data. Then all data is grouped by the aggregation label (e.g. well) and the data of each group is applied to the bins. The percentage of data falling into a certain bin is calculated and used to estimate a z-score. The result contains a z-score, a percentage and an absolute count for each bin of each aggregation group and each parameter. It should help to detect minor distribution changes which would not be caught with a mean or median per aggregation group of the object data.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME HCS Tools from the below update site following our NodePit Product and Node Installation Guide:
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!