Icon

02_​Hive_​WritingToDB

02 Hive WritingtoDB Exercise
Exercise: 02_Hive_WritingToDB This workflow implements a missing value strategy and writes the results back into Hive.The workflow already reads data from Hive in the Local Big Data Environment; selects table ss13pme; isolates rows with missing cow valueand rows with not missing cow values; imports the data into KNIME; creates a model to predict values for cow; uses cow predictions to overridemissing income values; rebuilds the datasets with the predicted cow values instead of the missing values. Now:1. Write the results back onto a new table named "newTable" in Hive using using the HDFS Connection of the Local Big Data Environment, alongwith a combination of the DB Table Creator & DB Loader nodes.Make sure you have executed the /2_Hadoop/2_Exercises/00_Setup_Hive_Table workflow during your current KNIME session before runningthis workflow. removePUMA* &PWGTP*COW is NOT NULLCOW is NULLremove COWCOW to stringappend predictedCOW columnimport all rows where COW is NOT NULLpredict COWimport all rows whereCOW is NULLselect * from ss13pmeConnect to Local Big DataEnvironment DB Column Filter DB Row Filter DB Row Filter DB Column Filter Number To String Decision TreePredictor DB Reader DecisionTree Learner DB Reader DB Table Selector Concatenate Create Local BigData Environment Exercise: 02_Hive_WritingToDB This workflow implements a missing value strategy and writes the results back into Hive.The workflow already reads data from Hive in the Local Big Data Environment; selects table ss13pme; isolates rows with missing cow valueand rows with not missing cow values; imports the data into KNIME; creates a model to predict values for cow; uses cow predictions to overridemissing income values; rebuilds the datasets with the predicted cow values instead of the missing values. Now:1. Write the results back onto a new table named "newTable" in Hive using using the HDFS Connection of the Local Big Data Environment, alongwith a combination of the DB Table Creator & DB Loader nodes.Make sure you have executed the /2_Hadoop/2_Exercises/00_Setup_Hive_Table workflow during your current KNIME session before runningthis workflow. removePUMA* &PWGTP*COW is NOT NULLCOW is NULLremove COWCOW to stringappend predictedCOW columnimport all rows where COW is NOT NULLpredict COWimport all rows whereCOW is NULLselect * from ss13pmeConnect to Local Big DataEnvironmentDB Column Filter DB Row Filter DB Row Filter DB Column Filter Number To String Decision TreePredictor DB Reader DecisionTree Learner DB Reader DB Table Selector Concatenate Create Local BigData Environment

Nodes

Extensions

Links