Data Generator

Creates random data containing some clusters for Parallel Universes. The data contains a certain fraction of noise patterns and data that is generated to clusters (all clusters have the same size). The data is normalized in [0, 1].

Options

Cluster Count
The number of clusters for each universe (comma separated).
Universe Sizes
The number of attributes for each universe (comma separated).
Pattern Count
Number of patterns (overall).
Standard Deviation
Std. dev. for each cluster.
Noise Fraction
Fraction of the overall data to be noise, i.e. randomly drawn data. This value must be in [0, 1].
Random Seed
Seed for "deterministic" data generation.

Input Ports

This node has no input ports

Output Ports

Icon
Contains the data with the cluster id as last column
Icon
Contains the cluster centers. The attributes in the universes where the cluster is not located, are filled with missing values.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.