Icon

Just KNIME It _​ Challenge 003

Just KNIME It _ Challenge 003
Data Gather and Clean-Up: Challenge 3: CDC Cancer Data, Processing and Answers CDC_cancer_2017.csvpopulation2017.xlsxClean Up State NamesRemove Row * 'Cancer Sites''States''Sex''Sex Code''Count'Count byCancer SitesRank from Top Cancer Types$rank$ <= 5 =>"Top 5"TRUE =>"Other"Histogram Chart Showing Top 5'Cancer Site' by SexFemale GREENMale: REDExclude: All invasive Cancer Sites Combined ...Note: I excluded them as I interpreted that they may refer to late diagnosis, so they don't necessarily represent a 'Site' itself; furthermorethe very high sampling in this cathegory inducesbyas in the final analysis. INNER JOINTop 5 Cancer Sites:RANKED Female and MaleUnified'Male and Female Breast'Unified:'Cancer Sites Code'Sum Counts by StateCancer Incidence in cases/100,000Ranked Cancer Incidence in cases / 100,000inhabitantsUpper: (1) top-5 most frequent 'female' Lower: (2) top-5 most frequent 'male'(3)$rank$ <= 1 =>" Highest" TRUE =>"Other" CSV Reader Excel Reader String Manipulation Missing Value GroupBy Rank Rule Engine InteractiveHistogram (local) Color Manager Row Filter Joiner Row Filter Rule Engine Rule Engine GroupBy Math Formula Rank Row Splitter Rule Engine Data Gather and Clean-Up: Challenge 3: CDC Cancer Data, Processing and Answers CDC_cancer_2017.csvpopulation2017.xlsxClean Up State NamesRemove Row * 'Cancer Sites''States''Sex''Sex Code''Count'Count byCancer SitesRank from Top Cancer Types$rank$ <= 5 =>"Top 5"TRUE =>"Other"Histogram Chart Showing Top 5'Cancer Site' by SexFemale GREENMale: REDExclude: All invasive Cancer Sites Combined ...Note: I excluded them as I interpreted that they may refer to late diagnosis, so they don't necessarily represent a 'Site' itself; furthermorethe very high sampling in this cathegory inducesbyas in the final analysis. INNER JOINTop 5 Cancer Sites:RANKED Female and MaleUnified'Male and Female Breast'Unified:'Cancer Sites Code'Sum Counts by StateCancer Incidence in cases/100,000Ranked Cancer Incidence in cases / 100,000inhabitantsUpper: (1) top-5 most frequent 'female' Lower: (2) top-5 most frequent 'male'(3)$rank$ <= 1 =>" Highest" TRUE =>"Other" CSV Reader Excel Reader String Manipulation Missing Value GroupBy Rank Rule Engine InteractiveHistogram (local) Color Manager Row Filter Joiner Row Filter Rule Engine Rule Engine GroupBy Math Formula Rank Row Splitter Rule Engine

Nodes

Extensions

Links