Icon

Manipulating Data Solution

There has been no title set for this workflow's metadata.

There has been no description set for this workflow's metadata.

Manipulating Data SolutionYour goal in this practice problem is to convert the input data set into the output data set using the nodes that we have covered in this andprevious lessons. Note that not all nodes that we have covered will necessarily be used in the practice problem.There are two versions of this practice problem:1) You are only given the input and output data sets. You must figure out which nodes to use, how to configure them, and how to organize yourworkflow in order to generate the desired output.2) In addition to the input and output data sets, we have provided an outline of which nodes to use at which steps to guide you along. Note thatthis is only one possible solution, as there are many ways a problem can be solved in KNIME.Using the NPS Visitation Data file that we have used throughout this lesson, you are to create an output with the following rules:• Filter out any columns that have the word "Total" in their name.• We only want to report on data for National Parks within the Intermountain region since 2010.• We must report on the number of backcountry visits for each park by year.• Reshape the data so parks and years are each only one column.• Finally, rename and reorder fields to match the desired output. INPUT DATA COLUMN FILTER RULE-BASED ROWFILTER PIVOTING UNPIVOTING TABLE MANIPULATING BONUS: SORTER Filter for:Intermountain regionNational ParksYear >= 2010Filter out fieldsthat contain theword "Total"Pivot to aggregateeach park's backcountryvisitation by yearUnpivot to place yearvalues into one columnRename and reorder fields tomatch outputSort in descendingorder byBackcountry VisitsNPS Visitation Data Rule-basedRow Filter Column Filter Pivot Unpivot Table Manipulator Sorter Table Reader Manipulating Data SolutionYour goal in this practice problem is to convert the input data set into the output data set using the nodes that we have covered in this andprevious lessons. Note that not all nodes that we have covered will necessarily be used in the practice problem.There are two versions of this practice problem:1) You are only given the input and output data sets. You must figure out which nodes to use, how to configure them, and how to organize yourworkflow in order to generate the desired output.2) In addition to the input and output data sets, we have provided an outline of which nodes to use at which steps to guide you along. Note thatthis is only one possible solution, as there are many ways a problem can be solved in KNIME.Using the NPS Visitation Data file that we have used throughout this lesson, you are to create an output with the following rules:• Filter out any columns that have the word "Total" in their name.• We only want to report on data for National Parks within the Intermountain region since 2010.• We must report on the number of backcountry visits for each park by year.• Reshape the data so parks and years are each only one column.• Finally, rename and reorder fields to match the desired output. INPUT DATA COLUMN FILTER RULE-BASED ROWFILTER PIVOTING UNPIVOTING TABLE MANIPULATING BONUS: SORTER Filter for:Intermountain regionNational ParksYear >= 2010Filter out fieldsthat contain theword "Total"Pivot to aggregateeach park's backcountryvisitation by yearUnpivot to place yearvalues into one columnRename and reorder fields tomatch outputSort in descendingorder byBackcountry VisitsNPS Visitation Data Rule-basedRow Filter Column Filter Pivot Unpivot Table Manipulator Sorter Table Reader

Nodes

Extensions

Links