Icon

Challenge Collection

CDC Cancer Data - Solution

You received the 2017 cancer data from the CDC for inspection, and your goal is to answer the following questions: (1) What are the top-5 most frequent cancer types occurring in females? (2) What are the top-5 most frequent cancer types occurring in males? (3) Which US state has the highest cancer incidence rate (that is, the highest number of cancer cases normalized by the size of its population)?

Challenge 1: CDC Cancer DataYou received the 2017 cancer data from the CDC for inspection, and your goal is to answer the following questions: (1) What are the top-5 most frequent cancer types occurring in females? (2) What are the top-5 most frequent cancer typesoccurring in males? (3) Which US state has the highest cancer incidence rate (that is, the highest number of cancer cases normalized by the size of its population)? Challenge 3: Rare Blood TypesYou have a dataset containing information on US citizens who donated blood in the last year, including addresses and blood types. The O- blood type, also known as "universal donor", is one of the most valuable blood types in the world becauseit can be transfused to nearly any person. Your goal here is to help a group of researchers find the number of citizens with O- blood type per US state. Unfortunately, the address column comes in a single line, so to extract the state informationyou will have to perform some data wrangling. They also asked you to create a choropleth map of the US to visualize the results. Challenge 2: Eating OutYou are interning for a travel agency that wants to know about their top spenders' eating habits. As a trial run, you are given a tiny dataset with 6647 rows containing information on 1011 unique citizens traveling with different purposes. In thischallenge you will: 1. Find the top 10 participants spending the highest amount of money on eating. 2. Find out whether the people who spend the most money on eating are the same people who spend the most time eating.Note: Sometimes the ending balance is more than the starting balance. Assume this was a mistake when calculating money spent. To calculate time difference, you can use the Date&Time Difference Node. Don't forget to convert the Strings before. Challenge 4: Days With Price ChangesYou are using KNIME to monitor the daily price of a product online. After using the Line Plot node to visualize the daily prices you have already gathered, you notice that they are often constant for a certain number of days before changing again.You want to create a new column in the price data you have at hand, named "Change", such that its value is 1 if a daily price changed with respect to the previous day, or 0 if it remained unchanged. For the first daily price in the data, the"Change" value should be 1. Challenge 1: CDC Cancer DataYou received the 2017 cancer data from the CDC for inspection, and your goal is to answer the following questions: (1) What are the top-5 most frequent cancer types occurring in females? (2) What are the top-5 most frequent cancer typesoccurring in males? (3) Which US state has the highest cancer incidence rate (that is, the highest number of cancer cases normalized by the size of its population)? Challenge 3: Rare Blood TypesYou have a dataset containing information on US citizens who donated blood in the last year, including addresses and blood types. The O- blood type, also known as "universal donor", is one of the most valuable blood types in the world becauseit can be transfused to nearly any person. Your goal here is to help a group of researchers find the number of citizens with O- blood type per US state. Unfortunately, the address column comes in a single line, so to extract the state informationyou will have to perform some data wrangling. They also asked you to create a choropleth map of the US to visualize the results. Challenge 2: Eating OutYou are interning for a travel agency that wants to know about their top spenders' eating habits. As a trial run, you are given a tiny dataset with 6647 rows containing information on 1011 unique citizens traveling with different purposes. In thischallenge you will: 1. Find the top 10 participants spending the highest amount of money on eating. 2. Find out whether the people who spend the most money on eating are the same people who spend the most time eating.Note: Sometimes the ending balance is more than the starting balance. Assume this was a mistake when calculating money spent. To calculate time difference, you can use the Date&Time Difference Node. Don't forget to convert the Strings before. Challenge 4: Days With Price ChangesYou are using KNIME to monitor the daily price of a product online. After using the Line Plot node to visualize the daily prices you have already gathered, you notice that they are often constant for a certain number of days before changing again.You want to create a new column in the price data you have at hand, named "Change", such that its value is 1 if a daily price changed with respect to the previous day, or 0 if it remained unchanged. For the first daily price in the data, the"Change" value should be 1.

Nodes

  • No nodes found

Extensions

  • No modules found

Links