Table Validator (Reference)

This node ensures a certain table structure and table content using the table structure and table content of a second table as reference. The table to validate should be connected to the first input port and the reference table to the second input port. Contrary to the Table Validator node, the performed validation checks will applied on the bases of all columns of the reference table and can't be adjusted per column. It is ensured that the result table structure is mostly identical to the reference specification. That is done by resorting of columns, the insertion of missing columns (filled with missing values) and optional removal of additional columns. You can also choose if each column is required and if the data type or the domain should be checked/converted. If the validation succeeds, data gets output to the first port (potentially renamed, sorted according to the reference specification and with converted types). If the validation fails, the first port is inactive and the second port contains a table that lists all conflicts or the node fails. All options mentioned below marked with Data forces also a traversal of the input data.

Options

Column name matching
Controls what counts as a column name match between the input table and the reference table. If 'case insensitive' is choosen, it still tries to find an exactly (case sensitively) matching column name first, and then falls back to case insensitive matching.
  • Case sensitive: Column names must match exactly
  • Case insensitive: Also columns with a similar name will be considered to be validated according to this configuration.
If a column is missing in the table
Ensures that the reference columns exist in the input table. If case insensitive name matching is selected, the first matching column will satisfy this condition.
  • Ignore: Ignore missing columns and do nothing
  • Fail validation: Fails the validation if columns don't exist
  • Insert column with missing values: Inserts missing columns and fills them with missing values
If there is an additional column in the table
Specifies how to handle columns which are not included in the reference table but in the table to validate. Additional columns can cause the validation to fail, be removed, or moved to the end of the table.
  • Fail validation: Additional columns will cause the validation to fail
  • Remove: Additional columns will be removed
  • Move to end: Additional columns will be moved to the end of the table
If data type does not match
Ensures a correct data type.
  • Ignore: Ignores data type mismatches and do nothing
  • Fail validation: Fails the validation if reference data type is not a super type of the data type to validate
  • Try to convert: Attempts conversion and fails the validation if not possible
If there are missing values
Validation fails if any of the columns contains missing values.
  • Ignore: Missing values in columns are ignored
  • Fail validation: Fails the validation if a column contains missing values
If categoric value is not in the domain
Allows one to optionally validate categoric values in columns against a set of allowed values. This option is only enabled if the reference column defines possible values.
  • Ignore: Categoric values are not validated
  • Fail validation: Fails validation if values are not in the domain of the reference column
  • Replace with missing values: Replaces out-of-domain values with missing values
If numeric value is outside the domain
Checks if each data object is between min and max defined by the domain of the reference column. This option is only enabled if the reference column defines a numeric domain (min/max).
  • Ignore: Numeric values are not validated
  • Fail validation: Fails validation if numeric values are outside domain (min/max)
  • Replace with missing values: Replaces out-of-domain values with missing values
If validation fails
Controls the effect of a failed validation.
  • Fail node: Forces the node to fail if the validation fails, with detailed validation fault descriptions. There data validation will be skipped if the structure validation already fails.
  • Deactivate first output port: Never fails but deactivates first output port if the validation fails and outputs results at the second port. It will always do both, structure and data validation.

Input Ports

Icon
Table to be validated.
Icon
Table that is used as the reference.

Output Ports

Icon
Table with corrected and validated structure. Depending on the validation result and the Behavior if validation fails settings, this port may be inactive.
Icon
Table where missing values have been handled. Depending on the validation result and the Behavior if validation fails settings, this port may be inactive.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.