Icon

08_​Filtering_​Duplicates

Duplicate Row Filter
This workflow shows how to remove or flag duplicates in the data set and the different options todefine which row to keep using the Duplicate Row Filter node.For more detailed information see the workflow metadata. Find it here: View -> Description Simply removingduplicates from dataset Flagging Duplicates Read the adult.csvdataAdd column flag if row isunique, duplicate orchosenAdd column to identify the ROWIDof chosen row to each duplicate rowkeep the first appearanceof the duplicateskeep the last appearanceof the duplicateskeep the rowwith minimum agekeep the rowwith maximum ageAdd both flag types File Reader DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter This workflow shows how to remove or flag duplicates in the data set and the different options todefine which row to keep using the Duplicate Row Filter node.For more detailed information see the workflow metadata. Find it here: View -> Description Simply removingduplicates from dataset Flagging Duplicates Read the adult.csvdataAdd column flag if row isunique, duplicate orchosenAdd column to identify the ROWIDof chosen row to each duplicate rowkeep the first appearanceof the duplicateskeep the last appearanceof the duplicateskeep the rowwith minimum agekeep the rowwith maximum ageAdd both flag types File Reader DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter DuplicateRow Filter

Nodes

Extensions

Links