Icon

modelling_​ollist1

Create Time-Based Features

This step loads the raw data, converts a date/time field from text into a real timestamp, and then extracts useful parts such as time components into new columns. The goal is to turn a plain date string into information the workflow can analyze more easily.

Add Frequency Information to Each Record

This step counts how often each category value appears in the data, then joins that count back to every original row. In other words, each record is enriched with a new group size/frequency feature, which can help later analysis or modeling capture how common or rare a value is.

Split Training Data and Add Aggregated Features

The data is first split into a training subset. From that subset, the workflow creates two summary features: one that captures how often a key value appears, and another that summarizes activity across a time-related grouping. These summaries are then joined back to each training row, so each record keeps its original details while gaining extra context about how common its group is and the broader pattern it belongs to.

Clean and Prepare Features for Modeling

This step fills in missing values so the training data is complete, then keeps only the columns needed for the model. Finally, it converts selected numeric fields into text/categorical values, so they will be treated as categories rather than continuous numbers in later modeling.

CSV Reader
String to Date&Time
String to Number
Date&Time Part Extractor
Column Renamer
GroupBy
Joiner
Column Renamer
Column Renamer
Table Partitioner
Column Renamer
Random Forest Learner
GroupBy
Joiner
Joiner
Column Filter
GroupBy
RProp MLP Learner
Number to String
SMOTE
Missing Value
Column Filter

Nodes

Extensions

Links