This workflow demonstrates the usage of the Create Databricks Environment node which allows you to connect to a Databricks Cluster from within KNIME Analystics Platform.
The node provides three output ports that allow you to utilize the existing DB nodes to interact wtih the Databricks DB, the file handling nodes to work with the Databricks File System, and the Spark nodes to visually assemble Spark analytics flows. All of these nodes allow you to push down the data processing into the Databricks cluster.
URL: Databricks on Amazon AWS https://databricks.com/aws
URL: Databricks on Microsoft Azure https://databricks.com/product/azure
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!