Icon

20210512 Pikairos Simple question about Term Frequency

I have a table with a single column of text called Source. I have another table with a text column called Terms and a text column called Category.

I want to create a table that shows each Term, its Category, and the number of times that Term exists in the Source table.

There seem to be a lot of ways to approach this. Any ideas of best practice?

List ofHTTP addressesRetrieve informationfrom HTTP addressesParse the informationyou need processYou dictionaryRemove theMarkup fromHTML informationSplit the documentinto words andstore the resultas a collectionof wordsUngroup thecollections perdocumentKeep onlythe collection toof words toprocess itSet the Indexin the RowIDfor later useGroup the wordsper document andwordCreate a columnhere to storethe FrequenciesJoinwith yourdictionaryThe Frequenciesper documentby Classical NodesConvert your informationinto a document formatWork out Wordsper documentWork out Words (Terms)and their frequenciesper documentJoin theinitial documentswith theircounts of unique wordsCreate anIndex ofdocumentsConvert Termsinto String typeJoin withyour dictionaryJoin theinitial documentswith theircounts of unique wordsDummy nodejust forlong connexionResults comparisonbetween methods(Classical againstBag of Words)Concatenate herethe Title & documentto process the content of bothat the same timeThe Frequenciesper documentbased on BAG of WORDSCREATORResults comparisonbetween methodsTable Creator HTTP Retriever Feed Parser Table Creator Markup Tag Filter Cell Splitter Ungroup Column Splitter RowID GroupBy ConstantValue Column Joiner InteractiveTable (local) Strings To Document Bag Of WordsCreator TF Joiner String Manipulation Term To String Joiner Joiner Column Rename Joiner String Manipulation InteractiveTable (local) InteractiveTable (local) List ofHTTP addressesRetrieve informationfrom HTTP addressesParse the informationyou need processYou dictionaryRemove theMarkup fromHTML informationSplit the documentinto words andstore the resultas a collectionof wordsUngroup thecollections perdocumentKeep onlythe collection toof words toprocess itSet the Indexin the RowIDfor later useGroup the wordsper document andwordCreate a columnhere to storethe FrequenciesJoinwith yourdictionaryThe Frequenciesper documentby Classical NodesConvert your informationinto a document formatWork out Wordsper documentWork out Words (Terms)and their frequenciesper documentJoin theinitial documentswith theircounts of unique wordsCreate anIndex ofdocumentsConvert Termsinto String typeJoin withyour dictionaryJoin theinitial documentswith theircounts of unique wordsDummy nodejust forlong connexionResults comparisonbetween methods(Classical againstBag of Words)Concatenate herethe Title & documentto process the content of bothat the same timeThe Frequenciesper documentbased on BAG of WORDSCREATORResults comparisonbetween methodsTable Creator HTTP Retriever Feed Parser Table Creator Markup Tag Filter Cell Splitter Ungroup Column Splitter RowID GroupBy ConstantValue Column Joiner InteractiveTable (local) Strings To Document Bag Of WordsCreator TF Joiner String Manipulation Term To String Joiner Joiner Column Rename Joiner String Manipulation InteractiveTable (local) InteractiveTable (local)

Nodes

Extensions

Links