Giskard LLM Scanner

This node provides an open-source framework for detecting potential vulnerabilites in the GenAI model contained in the provided workflow. It evaluates the workflow by combining heuristics-based and LLM-assisted detectors. Giskard uses the provided LLM for the evaluation but applies different model parameters for some of the detectors. The viability of the LLM-assisted detectors can be improved by providing an optional input table with common example prompts for the workflow.

The node uses detectors for the following vulnerabilities:

  • Hallucination and Misinformation: Detects if the workflow is prone to generate fabricated or false information.
  • Harmful Content: Detects if the workflow is prone to produce content that is unethical, illegal or otherwise harmful.
  • Prompt Injection: Detects if the workflow's behavior can be altered via a variety of prompt injection techniques.
  • Robustness: Detects if the workflow is sensitive to small perturbations in the input that result in inconsistent responses.
  • Stereotypes: Detects stereotype-based discrimination in the workflow responses.
  • Information Disclosure: Attempts to cause the workflow to disclose sensitive information such as secrets or personally identifiable information. Might produce false-positives if the workflow is required to output information that can be considered sensitive such as contact information for a business.
  • Output Formatting: Checks that the workflow output is consistent with the format requirements indicated in the model description, if such instructions are provided.

This node does not utilize Giskard's LLMCharsInjectionDetector. For more details on LLM vulnerabilities, refer to the Giskard documentation

In order to perform tasks with LLM-assisted detectors, Giskard sends the following information to the language model provider:

  • Data provided in your dataset
  • Text generated by your model
  • Model name and description

Note that this does not apply if a self-hosted model is used.

More information on Giskard can be found in the documentation.

Note: If you use the Credentials Configuration node and do not select the "Save password in configuration (weakly encrypted)" option for passing the API key for the LLM connector node, the Credentials Configuration node will need to be reconfigured upon reopening the workflow, as the credentials flow variable was not saved and will therefore not be available to downstream nodes.

Options

Workflow name

A descriptive name for the workflow under evaluation. Used to generate domain-specific probes and included in the generated report.

Workflow decription

A more detailed description of the evaluated workflow that explains its purpose. Used to generate domain-specific probes.

Feature columns

The columns used as features by the generative workflow. Feature columns must be of type string. These columns must exist in the dataset if one is provided.

Response column

The column in the output table of the workflow that represents the LLM responses.

Ignore detection errors

If checked, execution will not stop when detection errors are encountered. Failed detectors will be ignored when creating the result.

Input Ports

Icon

The large language model or chat model used to analyze the workflow.

Icon

The generative workflow to analyze with Giskard.

Icon

Dataset that is used to enhance the LLM-assisted detectors.

Output Ports

Icon

The Giskard scan report as table.

Popular Predecessors

  • No recommendations found

Popular Successors

  • No recommendations found

Views

Giskard report
The Giskard scan report as HTML.

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.