Icon

GroupBy-alternative with regex join component 2

The magic pattern that turns a single column into multiple columns based ongroupings Lower flow show-cases an alternative solution which replaces theunintuitive complexity that achieves the regex-matching with a single "JoinRegexp_Like component Make regex patternEverything in place,the rows are groupedby unique group idReady for the magic?Top is the lines wherethe marker matched.Bottom is lines where marker didn't matchThe top port will contain NO DUPLICATESThe lower port needs to haveDUPLICATES REMOVEDWe need an idthat we can sort onto keep things in theright orderafter we've done some cleaning up. We alsoneed to spot the duplicates created by the cross joiner weare about to useRemove duplicates(keep first)One big family againOne happy familyagainUse RIGHT JOIN Remove rows containingkeywords. They should only bein the top, and will be here only wherecrossjoiner put them against the "wrong" keywordClose your eyes...Slight of hand...Now open your eyes...Pull the curtain roundWhich of my peoplegot on withyour people?cumulatively renumber the groupmarkersDiscard anyinitial"unloved" rowsat the beginningRemove unwantedcolumnsKeywordsMy people will talkto your people..!Keywords COMPONENTAllows join on regular expressionLeft Join + Actual Joinoutput on same portto bring together full setof rowsClose your eyes...Slight of hand...Discard anyinitial"unloved" rowsat the beginningRemove unwantedcolumnsNow open your eyes...Pull the curtain roundOne happy familyagainFill Grouping IDs downwardsthis does what the moving aggregation would havedone when filling in the group numbers.But this time we used Math Formulato provide the group numbering, so we need todo this part ourselvesCounter Generationplus rename(saves on a node!)Node 12942Group the columns for eachkeyword COMPONENTAllows join on regular expressionrejoin to original categories to group separate rows togetherGroup rows by keywordNode 12956Unpack what was previously grouped uprowsUnpack whatwas previously grouped upcolumnssplit offthe "heading"(keyword)Replace columnnameswith the keywords(nice to have :-) )String Manipulation Column Rename Row Splitter Counter Generation DuplicateRow Filter Concatenate Sorter Joiner GroupBy Transpose Ungroup Column Filter Rule Engine Moving Aggregation Row Filter Column Filter Extract Data Tablefrom Delimited Text Cross Joiner Extract Data Tablefrom Delimited Text Join Regexp_Like Joiner GroupBy Transpose Row Filter Column Filter Ungroup Column Filter Sorter Missing Value Math Formula Transpose Column Aggregator Join Regexp_Like GroupBy Transpose Ungroup Ungroup Column Splitter Insert ColumnHeader The magic pattern that turns a single column into multiple columns based ongroupings Lower flow show-cases an alternative solution which replaces theunintuitive complexity that achieves the regex-matching with a single "JoinRegexp_Like component Make regex patternEverything in place,the rows are groupedby unique group idReady for the magic?Top is the lines wherethe marker matched.Bottom is lines where marker didn't matchThe top port will contain NO DUPLICATESThe lower port needs to haveDUPLICATES REMOVEDWe need an idthat we can sort onto keep things in theright orderafter we've done some cleaning up. We alsoneed to spot the duplicates created by the cross joiner weare about to useRemove duplicates(keep first)One big family againOne happy familyagainUse RIGHT JOIN Remove rows containingkeywords. They should only bein the top, and will be here only wherecrossjoiner put them against the "wrong" keywordClose your eyes...Slight of hand...Now open your eyes...Pull the curtain roundWhich of my peoplegot on withyour people?cumulatively renumber the groupmarkersDiscard anyinitial"unloved" rowsat the beginningRemove unwantedcolumnsKeywordsMy people will talkto your people..!Keywords COMPONENTAllows join on regular expressionLeft Join + Actual Joinoutput on same portto bring together full setof rowsClose your eyes...Slight of hand...Discard anyinitial"unloved" rowsat the beginningRemove unwantedcolumnsNow open your eyes...Pull the curtain roundOne happy familyagainFill Grouping IDs downwardsthis does what the moving aggregation would havedone when filling in the group numbers.But this time we used Math Formulato provide the group numbering, so we need todo this part ourselvesCounter Generationplus rename(saves on a node!)Node 12942Group the columns for eachkeyword COMPONENTAllows join on regular expressionrejoin to original categories to group separate rows togetherGroup rows by keywordNode 12956Unpack what was previously grouped uprowsUnpack whatwas previously grouped upcolumnssplit offthe "heading"(keyword)Replace columnnameswith the keywords(nice to have :-) )String Manipulation Column Rename Row Splitter Counter Generation DuplicateRow Filter Concatenate Sorter Joiner GroupBy Transpose Ungroup Column Filter Rule Engine Moving Aggregation Row Filter Column Filter Extract Data Tablefrom Delimited Text Cross Joiner Extract Data Tablefrom Delimited Text Join Regexp_Like Joiner GroupBy Transpose Row Filter Column Filter Ungroup Column Filter Sorter Missing Value Math Formula Transpose Column Aggregator Join Regexp_Like GroupBy Transpose Ungroup Ungroup Column Splitter Insert ColumnHeader

Nodes

Extensions

Links