For feature selection/ elemination purposes, this component calculates IV(Information Values) for optimal categories of variables. This component also calculates WOE (Weight of Evidence) of categorized variables.
Step By Step Guide:
1- Initially, to run this component one should install Python Integration extensions.
2- For obtain a better Python node performance, pyarrow library should be installed.
3- Having installed pyarrow library, select serialization library as Apache Arrow under preferences. This option makes a huge difference as performance compared to Flatbuffers Column Serialization.
4- Then, specify desired IV threshold, target (label) and its bad category from dialog window. Target should be a string form to run this component.
To use this component in KNIME, download it from the below URL and open it in KNIME:
Download ComponentDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.