Linear Regression Learner

Computes a multiple linear regression model. Select a target column that represents the response variable and a set of columns that represent the independent variables.

Options

Target
Select the numeric target column (response variable) the regression should predict.
Values
Choose the independent variables to include in the regression model. Nominal columns are converted into dummy variables automatically, as described in Categorical variables in regression.
Predefined offset value
Provide a user-defined intercept instead of estimating the constant term. When enabled, the specified offset is used as intercept and the model does not estimate a constant.
Offset value
Enter the constant term used when a predefined offset is applied to the regression model.
Missing values in input data
Decide whether rows containing missing values should be ignored or cause the node execution to abort.
  • Ignore rows with missing values.: Skips rows containing missing values so the regression model is built only on complete rows.
  • Fail on observing missing values.: Stops execution with an error if missing values occur in the input data.
First row
Specify the index of the first row that is available for the scatter plot view. Counting starts at 1.
Row count
Define how many rows should be provided to the scatter plot view.

Input Ports

Icon
Table containing the data used for the regression.

Output Ports

Icon
The computed regression model.
Icon
Coefficients and statistics of the linear regression model.

Views

Linear Regression Result View
Displays the estimated coefficients and error statistics.
Linear Regression Scatterplot View
Displays the input data along with the regression line in a scatterplot. The y-coordinate is fixed to the response column (the column that has been approximated) while the x-column can be chosen among the independent variables with numerical values. Note: If you have multiple input variables, this view is only an approximation. It will fix the value of each variable that is not shown in the view to its mean. Thus, this view generally only makes sense if you only have a few input variables.

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.