This workflow is based on the adult.csv data set. Try it out to:
1. Remove duplicates
- keep the first or last appearance of the duplicates
- keep the row of duplicates that has a maximum or minimum value regarding a specific feature
2. Flag duplicates
- add a column that flags rows as unique, duplicate or chosen
- add a column that displays the RowID of the (representative) chosen row for each duplicate
- add both columns for the two flag types that were mentioned before
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.