Icon

Challenge Collection

CDC Cancer Data - Solution

You received the 2017 cancer data from the CDC for inspection, and your goal is to answer the following questions: (1) What are the top-5 most frequent cancer types occurring in females? (2) What are the top-5 most frequent cancer types occurring in males? (3) Which US state has the highest cancer incidence rate (that is, the highest number of cancer cases normalized by the size of its population)?

Challenge 1: CDC Cancer DataYou received the 2017 cancer data from the CDC for inspection, and your goal is to answer the following questions: (1) What are the top-5 most frequent cancer types occurring in females? (2) What are the top-5 most frequent cancer types occurring in males? (3) Which US state has the highest cancer incidence rate (that is, the highest number of cancer cases normalized by the size of its population)?
Challenge 3: Rare Blood Types You have a dataset containing information on US citizens who donated blood in the last year, including addresses and blood types. The O- blood type, also known as "universal donor", is one of the most valuable blood types in the world because it can be transfused to nearly any person. Your goal here is to help a group of researchers find the number of citizens with O- blood type per US state. Unfortunately, the address column comes in a single line, so to extract the state information you will have to perform some data wrangling. They also asked you to create a choropleth map of the US to visualize the results.
Challenge 2: Eating OutYou are interning for a travel agency that wants to know about their top spenders' eating habits. As a trial run, you are given a tiny dataset with 6647 rows containing information on 1011 unique citizens traveling with different purposes. In this challenge you will: 1. Find the top 10 participants spending the highest amount of money on eating. 2. Find out whether the people who spend the most money on eating are the same people who spend the most time eating.Note: Sometimes the ending balance is more than the starting balance. Assume this was a mistake when calculating money spent. To calculate time difference, you can use the Date&Time Difference Node. Don't forget to convert the Strings before.
Challenge 4: Days With Price ChangesYou are using KNIME to monitor the daily price of a product online. After using the Line Plot node to visualize the daily prices you have already gathered, you notice that they are often constant for a certain number of days before changing again. You want to create a new column in the price data you have at hand, named "Change", such that its value is 1 if a daily price changed with respect to the previous day, or 0 if it remained unchanged. For the first daily price in the data, the "Change" value should be 1.
CSV Reader
nach Frauen gefiltert
Row Filter
Staat mit der höchsten Krebs-Rate
Top k Row Filter
alle Krebsarten aufgelistet
GroupBy
Absteigend nach der meist vorkommenden Krebsart sortiert
Sorter
Top 5 Krebsarten bei Frauen
Top k Row Filter
CSV Reader
Absteigend nach der meist vorkommenden Krebsart sortiert
Sorter
nach Männern gefiltert
Row Filter
nach Staaten sortiert
GroupBy
alle Krebsarten aufgelistet
GroupBy
Bevölkerung Der US Staaten
Excel Reader
Top 5 Krebsarten bei Männern
Top k Row Filter
Krebs Daten
CSV Reader
Row Filter
Name geändert von der Spalte "sum Count"
Column Renamer
Zusammenführen der Krebs und Bevölkerungs-Daten
Joiner
Punkt vor den Namen der Staaten entfernt
String Manipulation

Nodes

Extensions

Links