Icon

Manipulating Data Practice

There has been no title set for this workflow's metadata.

There has been no description set for this workflow's metadata.

Manipulating Data PracticeYour goal in this practice problem is to convert the input data set into the output data set using the nodes that we have covered in this andprevious lessons. Note that not all nodes that we have covered will necessarily be used in the practice problem.There are two versions of this practice problem:1) You are only given the input and output data sets. You must figure out which nodes to use, how to configure them, and how to organize yourworkflow in order to generate the desired output.2) In addition to the input and output data sets, we have provided an outline of which nodes to use at which steps to guide you along. Note thatthis is only one possible solution, as there are many ways a problem can be solved in KNIME.Using the NPS Visitation Data file that we have used throughout this lesson, you are to create an output with the following rules:• Filter out any columns that have the word "Total" in their name.• We only want to report on data for National Parks within the Intermountain region since 2010.• We must report on the number of backcountry visits for each park by year.• Reshape the data so parks and years are each only one column.• Finally, rename and reorder fields to match the desired output.BONUS:• Determine which park and year had the highest number of backcountry visits. How many backcountry visits did that park record? INPUT DATA OUTPUT DATA INPUT DATA COLUMN FILTER RULE-BASED ROWFILTERormultiple ROW FILTERS PIVOTING UNPIVOTING TABLE MANIPULATOR SORTER DELETE TO REVEAL STEP-BY-STEP HINTS NPS Visitation DataSolution Table Reader CSV Reader Manipulating Data PracticeYour goal in this practice problem is to convert the input data set into the output data set using the nodes that we have covered in this andprevious lessons. Note that not all nodes that we have covered will necessarily be used in the practice problem.There are two versions of this practice problem:1) You are only given the input and output data sets. You must figure out which nodes to use, how to configure them, and how to organize yourworkflow in order to generate the desired output.2) In addition to the input and output data sets, we have provided an outline of which nodes to use at which steps to guide you along. Note thatthis is only one possible solution, as there are many ways a problem can be solved in KNIME.Using the NPS Visitation Data file that we have used throughout this lesson, you are to create an output with the following rules:• Filter out any columns that have the word "Total" in their name.• We only want to report on data for National Parks within the Intermountain region since 2010.• We must report on the number of backcountry visits for each park by year.• Reshape the data so parks and years are each only one column.• Finally, rename and reorder fields to match the desired output.BONUS:• Determine which park and year had the highest number of backcountry visits. How many backcountry visits did that park record? INPUT DATA OUTPUT DATA INPUT DATA COLUMN FILTER RULE-BASED ROWFILTERormultiple ROW FILTERS PIVOTING UNPIVOTING TABLE MANIPULATOR SORTER DELETE TO REVEAL STEP-BY-STEP HINTS NPS Visitation DataSolution Table Reader CSV Reader

Nodes

Extensions

Links