For each chosen column (parameter) a reference population is used to estimate n different bins. The bins are based on the quantiles of the data. Then all data is grouped by the aggregation label (e.g. well) and the data of each group is applied to the bins. The percentage of data falling into a certain bin is calculated and used to estimate a z-score. The result contains a z-score, a percentage and an absolute count for each bin of each aggregation group and each parameter. It should help to detect minor distribution changes which would not be caught with a mean or median per aggregation group of the object data.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME HCS Tools from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com, follow @NodePit on Twitter or botsin.space/@nodepit on Mastodon.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.