Icon

Python - PySpark and MLlib RandomForest Classifier (with vtreat)

Spark MLlib in a KNIME Python node - also using vtreat to prepare the data

Demonstrates how to start a spark session within a Python node (the performance will very much depend on your machine)

URL: Medium: Data preparation for Machine Learning with KNIME and the Python “vtreat” package https://medium.com/p/efcaf58fa783
URL: Medium: KNIME and Python — Setting up and managing Conda environments https://medium.com/p/2ac217792539
URL: HUB: Python - PySpark and MLlib RandomForest Classifier (with vtreat) https://hub.knime.com/-/spaces/-/~prZ9x1tPyiYVM0aj/current-state/

Nodes

Extensions

Links