Icon

Remove copyright text - loop

Adjust GroupBy config to ensure that ithas same number of aggregations asthere are Regex Patterns in the TableCreator Simplified (but doesn't catch as many "multiple ©" substrings Simplifiedwith paladian Regex Extracter node(but also doesn't catch as many "multiple ©" substrings Simplified With loop running tha pattern 2 times to catch "multiple ©" substrings adding a second round of paladianRegex Extracter node, removes the rowsthat are not matching andy regex. Node 1remove copyright message(my alternative)remove copyright message(original with \\ added)but skips some leading letters of following textremove copyright message(alternative endings)List of regex patternsPlace in requiredorder of precedence.**** If a new pattern isadded, adjust the GroupBy node in the upper workflow ****Find values forcurrent regex patternRemoved matched strings from abstractLoop throughregex patternsrepeat eachpatternCreate new column nameRename each columnas MatchString_nPull all the matches back together as each iteration willhave created new rows for each piece of original textTidy upProvide a unique sortable keyBack intooriginal orderMake into a single patternFind values forcurrent regex patternAs a string variableNode 37Tidy upRemoved matched strings from abstractFind values forcurrent regex patternTidy upRemoved matched strings from abstractFind values forcurrent regex patternNode 44Tidy upFind values forcurrent regex patternNode 47Removed matched strings from abstractNode 50Removed matched strings from abstractrename abstract columto re-enter loopTidy upFind values forcurrent regex patterncopy abstract columnto keep abstract_originalStart loopReturn data for second round of data cleaningrename abstract columsto abstractand abstract_cleaned Table Creator String Manipulation String Manipulation String Manipulation Table Creator Regex Split Column Expressions Table Row ToVariable Loop Start Loop End Java Edit Variable Column Rename GroupBy Column Filter Counter Generation Sorter GroupBy Regex Split Table Rowto Variable Column Rename(Regex) Column Filter Column Expressions Regex Extractor Column Filter Column Expressions Regex Extractor Column Rename Column Filter Regex Split Column Rename(Regex) Column Expressions Column Rename(Regex) Column Expressions Column Rename Column Filter Regex Split String Manipulation RecursiveLoop Start Recursive Loop End Column Rename Adjust GroupBy config to ensure that ithas same number of aggregations asthere are Regex Patterns in the TableCreator Simplified (but doesn't catch as many "multiple ©" substrings Simplifiedwith paladian Regex Extracter node(but also doesn't catch as many "multiple ©" substrings Simplified With loop running tha pattern 2 times to catch "multiple ©" substrings adding a second round of paladianRegex Extracter node, removes the rowsthat are not matching andy regex. Node 1remove copyright message(my alternative)remove copyright message(original with \\ added)but skips some leading letters of following textremove copyright message(alternative endings)List of regex patternsPlace in requiredorder of precedence.**** If a new pattern isadded, adjust the GroupBy node in the upper workflow ****Find values forcurrent regex patternRemoved matched strings from abstractLoop throughregex patternsrepeat eachpatternCreate new column nameRename each columnas MatchString_nPull all the matches back together as each iteration willhave created new rows for each piece of original textTidy upProvide a unique sortable keyBack intooriginal orderMake into a single patternFind values forcurrent regex patternAs a string variableNode 37Tidy upRemoved matched strings from abstractFind values forcurrent regex patternTidy upRemoved matched strings from abstractFind values forcurrent regex patternNode 44Tidy upFind values forcurrent regex patternNode 47Removed matched strings from abstractNode 50Removed matched strings from abstractrename abstract columto re-enter loopTidy upFind values forcurrent regex patterncopy abstract columnto keep abstract_originalStart loopReturn data for second round of data cleaningrename abstract columsto abstractand abstract_cleanedTable Creator String Manipulation String Manipulation String Manipulation Table Creator Regex Split Column Expressions Table Row ToVariable Loop Start Loop End Java Edit Variable Column Rename GroupBy Column Filter Counter Generation Sorter GroupBy Regex Split Table Rowto Variable Column Rename(Regex) Column Filter Column Expressions Regex Extractor Column Filter Column Expressions Regex Extractor Column Rename Column Filter Regex Split Column Rename(Regex) Column Expressions Column Rename(Regex) Column Expressions Column Rename Column Filter Regex Split String Manipulation RecursiveLoop Start Recursive Loop End Column Rename

Nodes

Extensions

Links