Icon

Filtering Redundant Folder References

You are reorganizing a data warehouse in your company, working with a filesystem that creates parent folders if you give it a reference for a child folder. For example, if you ask the filesystem to create “folder1/folder2” and neither folder1 or folder2 exist, it will create both, with folder2 inside folder1, without raising an error. Given a list of folders, you want to keep only the longest unique child folders, filtering out references to parent folders that will be generated anyway for efficiency.

Here's an example of an initial list of folders:

- folder1/folder3
- folder1/folder3/folder22
- folder1/folder3/folder22/folder47

After executing your workflow, the list above should only contain a reference for folder1/folder3/folder22/folder47.

URL: Just KNIME It! https://www.knime.com/just-knime-it

folderscount subfolderscheck folderstringpairssubfolder countcheckhas subfolderbooleanmaxhas subfoldermaxhas subfolderis falseTable Reader String Manipulation String Manipulation Cross Joiner Rule-basedRow Filter String Manipulation GroupBy Row Filter folderscount subfolderscheck folderstringpairssubfolder countcheckhas subfolderbooleanmaxhas subfoldermaxhas subfolderis falseTable Reader String Manipulation String Manipulation Cross Joiner Rule-basedRow Filter String Manipulation GroupBy Row Filter

Nodes

Extensions

Links