This node is currently not available in KNIME v5.4 — instead we’re showing this page for KNIME v4.7. You can use the version menu in the title bar to permanently switch your preferred version. This will also show the link to the update site.

TreeSHAP Gradient Boosted Trees

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, Lundberg and his collaborators have developed a high-speed exact algorithm for tree ensemble methods [1], [2].

Usage

The Tree SHAP Gradient Boosted Trees Predictor is used as a substitute to the Gradient Boosted Trees Predictor. Simply replace every Gradient Boosted Trees Predictor with this node to get started. If you are using a different tree based method, consider the other nodes in this package.

Interpretation

The beautiful thing about SHAP values is the intuitive interpretation. Every model has an expected output, the average prediction. The model prediction for a data row is the expected output plus the summation of SHAP values. This leads to intuitive explanations, for example in predictive maintenance "The high production output over the last three months contributed +20% probability that the machine breaks down in the next month.".

Enterprise Support

If you need help integrating explainable machine learning methods in your company, please contact me at morriskurz@gmail.com

Credits

All credits to the original research and development of the C++ and Python code go to Lundberg and his collaborators.

Options

Change prediction column name: Select if you want to change the name of the column containing the prediction.
Prediction column name: The name of the column that will contain the prediction of the tree ensemble model
Append overall prediction confidence: The confidence of the predicted class. It is the maximum of all confidence values (which can be appended separately).
Append individual class probabilities: For each class the prediction confidence. It's the number of trees predicting to the current class (as per column name) divided by the total number of trees.
Suffix for probability columns: Here a suffix for the names of the class probability columns can be entered.
Show explanation: Activate this to compute the SHAP values. If this box is unchecked, the node is equivalent to a simple predictor node.
Compute interactions: Computes the Shapley interaction values exactly. WARNING: Computationally expensive. The runtime increases by 2 * #features compared to the SHAP values without interactions.
Positive class: Select the value from the class column that stands for the "positive" class. In most use cases, the positive class corresponds to the class of interest. For example: In churn prediction, the positive class could be the customers who will cancel the subscription. The node will automatically select the first possible option when the node is not configured.

Input Ports

: The output of the Gradient Boosted Trees Learner.
: Data to be predicted and explained.

Output Ports

: The input data along with prediction columns and corresponding SHAP values.

Popular Predecessors

Popular Successors

Views

This node has no views

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension TreeSHAP - Explainable Machine Learning in KNIME from the below update site following our NodePit Product and Node Installation Guide:

v4.7

A zipped version of the software site can be downloaded here.

Plugin provider: Morris Kurz, morriskurz@gmail.com

Plugin version: 1.0.0.v202108120940

On NodePit since: 2022-12-06

Last update: 2025-06-26

Tags: Streamable

KNIME versions: From v4.1 to v4.7

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!