Azure OpenAI LLM Selector

This node establishes a connection with a Large Language Model (LLM) deployed on Azure OpenAI. After successfully authenticating using the Azure OpenAI Authenticator node, enter the deployment name of the model you want to use. You can find the models on the Azure AI Studio at 'Management - Deployments'.

Note: Chat models are LLMs that have been fine-tuned for chat-based usecases. As such, these models can also be used in other applications as well.

Note: If you use the Credentials Configuration node and do not select the "Save password in configuration (weakly encrypted)" option for passing the API key, the Credentials Configuration node will need to be reconfigured upon reopening the workflow, as the credentials flow variable was not saved and will therefore not be available to downstream nodes.

Options

Azure Deployment

Deployment name: The name of the deployed model to use. Find the deployed models on the Azure AI Studio.
The model is a reasoning model: Reasoning models use fixed settings for certain parameters: "temperature=1.0" and "top_p=1.0". These models do not support custom values for either of these parameters. Additionally, reasoning models do not support the standard 'Maximum response length (tokens)' setting unless the "The model is a reasoning model" option is enabled. Once this option is selected, the model will be able to use the maximum token parameter as intended. For more information and to verify whether a model is categorized as a reasoning model (e.g. o3, o4-mini and gpt-5), please refer to the OpenAI Docs.

Model Parameters

Maximum response length (token)

The maximum number of tokens to generate.

This value, plus the token count of your prompt, cannot exceed the model's context length.

Temperature

Sampling temperature to use, between 0.0 and 2.0.

Higher values will lead to less deterministic answers.

Try 0.9 for more creative applications, and 0 for ones with a well-defined answer. It is generally recommended altering this, or Top-p, but not both.

Note: this setting is ignored for reasoning models like GPT-5 or o-series models.

Seed

Set the seed parameter to any integer of your choice to have (mostly) deterministic outputs. The default value of 0 means that no seed is specified.

If the seed and other model parameters are the same for each request, then responses will be mostly identical. There is a chance that responses will differ, due to the inherent non-determinism of OpenAI models.

Please note that this feature is in beta and only currently supported for gpt-4-1106-preview and gpt-3.5-turbo-1106 [1].

[1] OpenAI Cookbook

Number of concurrent requests

Maximum number of concurrent requests to LLMs that can be made, whether through API calls or to an inference server. Exceeding this limit may result in temporary restrictions on your access.

It is important to plan your usage according to the model provider's rate limits, and keep in mind that both software and hardware constraints can impact performance.

For OpenAI, please refer to the Limits page for the rate limits available to you.

Top-p sampling

An alternative to sampling with temperature, where the model considers the results of the tokens (words) with top_p probability mass. Hence, 0.1 means only the tokens comprising the top 10% probability mass are considered.

Input Ports

: Validated authentication for Azure OpenAI.

Output Ports

: Configured Azure OpenAI Large Language Model.

Popular Predecessors

No recommendations found

Popular Successors

No recommendations found

Views

This node has no views

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Python Extension Development (Labs) from the below update site following our NodePit Product and Node Installation Guide:

v5.8

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.8.0.v202510031553

On NodePit since: 2025-10-17

Last update: 2025-10-26

Tags: Modern UI

KNIME versions: Since v5.2

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!