Icon

workflow-crime-ex4

Data Transformation-Study Crime Frequency by Time of Day

Data Transformation-Study Relationship Between Crime Types and Area

Visualization

Data Exploration

Data Cleaning

Load the 2024 crime dataset
CSV Reader
remove column which is 'optional'
Column Filter
Keep only the Date and Primary Type columns.
Column Filter
Select disorder crimes
Row Filter
Select serious crimes
Row Filter
exclude records with age == 0AND Crm cd missing
Row Filter
Merge disorder andserious counts by Area
Joiner
Remove column with>50% missing val
Missing Value Column Filter
Column Renamer
Remove rows with outliers
Numeric Outliers
Count serious crimes per area.
GroupBy
Remove rows with missing value
Missing Value
Count disorder crimes per community area
GroupBy
Denormalizer
Test if column contains constant val
Constant Value Column Filter
Leave only Date in DATE OCC and put it in a new column named OCCDate
String Manipulation
Leave only Date in Date Rptd and put it in a new column named ReportDate
String Manipulation
Column Renamer
Sort the format for column TIME OCC to new column named OCCTime
String Manipulation
Covert ReportDate and OCCDate columns to Date format
String to Date&Time
Rank the crimes
Sorter
Add the TimeofDay column (Morning, Afternoon...) according to time
Rule Engine
We can see that top 10 crimes already take over 50% of all crimes
Pie Chart
Box Plot
So we pick top 10 crimes only
Row Filter
Expression
Sorter
Statistics View
Table Manipulator
Cluster areas into three profiles
k-Means (deprecated)
Recheck if there are any other missing vales and fix them
Missing Value
Normalizer
Statistics
Table View
Count each crime in each time-of-day category
GroupBy
Keep only Crime Type and Area.
Column Filter
Table View
Table View
Assign color to clusters
Color Manager
Count each crime
GroupBy
Sort categories based on the numeric rank.
Sorter
Bar Chart

Nodes

Extensions

Links