04_Dimensionality_Reduction_exercise

There has been no title set for this workflow's metadata.

Dimensionality Reduction - exercise

Introduction to Machine Learning Algorithms course - Session 4
Exercise 4
Apply the following dimensionality reduction techniques to the data:
- Filter out columns with a low variance
- Filter out one of two columns with a high linear correlation
- Replace numeric columns with principal components
- Filter out columns which are not important in predicting the target column

URL: Ames Housing Dataset on kaggle https://www.kaggle.com/prevek18/ames-housing-dataset
URL: Description of the Ames Iowa Housing Data https://rdrr.io/cran/AmesHousing/man/ames_raw.html
URL: Seven Techniques for Data Dimensionality Reduction https://www.knime.com/blog/seven-techniques-for-data-dimensionality-reduction
URL: 3 New Techniques for Data-Dimensionality Reduction in Machine Learning https://thenewstack.io/3-new-techniques-for-data-dimensionality-reduction-in-machine-learning/
URL: Slides (Introduction to ML Algorithms course) https://www.knime.com/form/material-download-registration

Nodes

Extensions

Download

To use this workflow in KNIME, download it from the below URL and open it in KNIME:

Download Workflow

Created by: Emilio Silvestri

Created at: 2019-11-29

On NodePit since: 2024-06-15

Last update: 2025-07-02

Created with KNIME version: v5.1.2

Tags: dimensionality reductiondata manipulationpreprocessingPCAfeature importanceeducation

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!

04_​Dimensionality_​Reduction_​exercise

Nodes

Extensions

Links

Download

04_Dimensionality_Reduction_exercise