This node helps discover multiple categorical columns in the dataset in one go among numeric columns. Many a time data is annonymized with a large number of numerical columns, some of which are, in fact, nominal. In this component, you specify the maximum number of distinct values for a numeric column. If distinct values are equal to or less than that specified, the column would be transformed to string column else not. The outputs of component are possible categorical columns and the rest of dataframe
The component uses 'Python Script' node to perform this function. It needs 'pandas' library.
To use this component in KNIME, download it from the below URL and open it in KNIME:
Download ComponentDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!