Discover relationship between two categorical features
============================================
Draws mosaic plot between two categorical variables. Input variable values (labels) are abbreviated to four letters in the graph. Data types of the two columns must be of string type.
Output plot indicates both the p-value of chi-sqaure test and Pearson Residuals. Null hypothesis is that there is no relationship between the features and both are independent. Value of p less than 0.05, as a rule of thumb, indicates relationship .
Pearson residuals = (obs - exp) / sqrt(exp)
Pearson residuals are calculated for each cell. As a rule of thumb, a cell with pearson residual of 3 or greater contributes more to relationship. Intensity of colour in each cell also indicates the extent observed values deviate from expected values.
Note that mosaic() splits the data in the order in which the variables are provided: first on Ist categorical feature then on IInd categorical feature.
For more about mosaic plots please see Wikipedia: https://en.wikipedia.org/wiki/Mosaic_plot . For more about pearson's residuals please see: https://www.statology.org/pearson-residuals/ . For a more thorough overview, please see: https://www.datavis.ca/courses/VCD/vcd-tutorial.pdf. The component needs R's vcd, package.
To use this component in KNIME, download it from the below URL and open it in KNIME:
Download ComponentDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.