Icon

08_​Connecting_​to_​Amazon_​EMR

Connecting to Amazon EMR
This workflow shows how to run a Spark job on an AWS EMR cluster via Apache Livy. It also demonstrates how to use Amazon Athena to query dataset located on an Amazon S3 bucket. Configure connections to S3 and EMR.Please enter your credentials here! Perform simple random forest on the Spark cluster. Connecting to Amazon Athena Select the created tableRead dataset fromAWS Registry of Open DataPartition intotrain and test setTrain a basicRF modelCalculate the prediction scoresVisualizationCreate a tableConnect toAmazon S3Create Spark contextvia LivyGive your Amazoncredentials hereConnect to Athena DB Table Selector CSV to Spark Spark Partitioning Spark Random ForestLearner (Regression) Spark Predictor(Regression) Spark NumericScorer Spark to Table Line Plot DB SQL Executor Amazon S3Connection Create SparkContext (Livy) AmazonAuthentication Amazon AthenaConnector This workflow shows how to run a Spark job on an AWS EMR cluster via Apache Livy. It also demonstrates how to use Amazon Athena to query dataset located on an Amazon S3 bucket. Configure connections to S3 and EMR.Please enter your credentials here! Perform simple random forest on the Spark cluster. Connecting to Amazon Athena Select the created tableRead dataset fromAWS Registry of Open DataPartition intotrain and test setTrain a basicRF modelCalculate the prediction scoresVisualizationCreate a tableConnect toAmazon S3Create Spark contextvia LivyGive your Amazoncredentials hereConnect to AthenaDB Table Selector CSV to Spark Spark Partitioning Spark Random ForestLearner (Regression) Spark Predictor(Regression) Spark NumericScorer Spark to Table Line Plot DB SQL Executor Amazon S3Connection Create SparkContext (Livy) AmazonAuthentication Amazon AthenaConnector

Nodes

Extensions

Links