Correspondence Analysis with R

Correspondence analysis (CA) is a multivariate statistical technique that reveals the relative relationships between and within two groups of variables, based on data given in a contingency table.

This component apply the correspondence analysis to two string columns available in the dataset.

It automatically computes the contingency table. A third column to aggregate and an aggregation function must be specified in the configuration dialog in order to pivot the dataset.

A complete report with grapics is generated in the component dialog.

This component adopts "FactoMineR", "factoextra" and "ggplot2" R libraries.
You need R to be correctly configured to use this component.

Options

Row Dimension
Row dimension of the contingency table
Column dimension
Column dimension of the contingency table
Column to aggregate
Select the column to summarize (count, sum, unique count, mean)
Number of dimensions to keep
Eingevectors to extract
Aggregation function
Function used to aggregate the data

Input Ports

Icon
The input table needs two string columns and a third column to aggregate.

Output Ports

Icon
A summary of all statistics necessary to validate the row profiles
Icon
A summary of all statistics necessary to validate the column profiles

Nodes

Extensions

Links