This component computes the Kullback-Leibler divergence to output a measure of dissimilarity between two distribution of the same variable, which come from two different datasets.
It could be very useful to identify shift inside data which can potentially lead to a model drift, downgrading the predictive power.
Column names and types must be identical between the two tables.
Continous predictors are binned into classes to make computation of the metric easier.
A value close to 0 means that the two variables have the same distribution.
As the value increases, the dissimilarity is higher.
Columns must have the same name and the same data type.
To use this component in KNIME, download it from the below URL and open it in KNIME:
Download ComponentDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!