Icon

Knime_​No_​Code_​project2

Data Visualisation

Data Transformation

Data Transformation With data manipulation

Data Cleaning

Data Dictionary and
Number of Rows & columns

Exploratory Data Analysis

Converting Revenue/Revenues_adj and Budget/Budget_adj < 1000 to null value and after Null value to median for the analysis

Data transformation with no data manipulation

Data Loading

I have decided to keep the homepage column for now, even though it’s full of missing values. Usually, we just delete a column with that many missing values, but since the dataset isn't huge, it’s not hurting anything to keep it. I used the Missing Value node to swap the blanks for 'N/A' so the data stays clean. The reason is that if we ever do an AI project later, we could actually use those links for web scraping to grab more info. If it turns out we don't need it for the AI down the road, we can just drop it then.

For the String values, I converted the missing data into 'N/A' and 'Unknown'

NoData Manipulation

TMDB Movies Data Analysis

GroupBy
GroupBy
cost of production
Line Plot
Joiner
Box Office Growth
Line Plot
GroupBy
Joiner
Row Filter
Sorter
Sorter
Budget & Revenue Table
Column Filter
Date&Time Part Extractor
Tope ten director
Bar Chart
GroupBy
Converting into list
Cell Splitter
Tope ten director
Bar Chart
Ungroup
Row Filter
GroupBy
cost of production
Line Plot
GroupBy
Joiner
if budget_adj<1000 it will convert it into null
Math Formula
Row Filter
Sorter
Joiner
Mean budget VS Budget_adj
Line Plot
Ungroup
Box Office Growth
Line Plot
Converting into list
Cell Splitter
Capitalise tagline
String Manipulation
Keywords Table
Column Filter
ROI
Math Formula
Capitalise overview
String Manipulation
Excel Writer
GroupBy
Cell Splitter
Profit
Math Formula
Genres Table
Column Filter
Mean Profit VS Genres
Bar Chart
Cell Splitter
GroupBy
Ungroup
keyword and sum of profit
GroupBy
Row Filter
Sorter
Top Ten Loss making Key words
Table View
Top 10 Profit making Key words
Table View
Top k Row Filter
Budget_adj Vs runtime
Scatter Plot
Length of title VS Revenue_adj
Scatter Plot
Joiner
ROI VS Genres
Box Plot
GroupBy
adding a new column for checking the length of title
String Manipulation
Excel Writer
Excel Writer
On which day the max movies are released
Heatmap
Excel Writer
Top 10 Production Company
Bar Chart
Joiner
Top k Row Filter
Length of title VS Revenue_adj
Scatter Plot
On which day the max movies are released
Heatmap
Top k Row Filter
GroupBy
Joiner
Production_companies Table
Column Filter
Top 10 Production Company
Bar Chart
Joiner
Joiner
Column Filter
Joiner
Popularity, Runtime, vote & Release date Table
Column Filter
Genres Table
Column Filter
adding a new column for checking the length of title
String Manipulation
Top 10 Profit making Key words
Table View
Ungroup
Excel Writer
Cell Splitter
Budget_adj Vs runtime
Scatter Plot
Production_companies Table
Column Filter
Excel Writer
Cell Splitter
Excel Writer
Popularity, Runtime, vote & Release date Table
Column Filter
Joiner
Excel Writer
Extracting data type
Extract Table Spec
Data Dictionary
CSV Writer
Joiner
keyword and sum of profit
GroupBy
Sorter
Histogram
Top k Row Filter
Release year
Box Plot
Row Filter
Statistics View for the the basic information
Statistics View
Converting into list
Cell Splitter
Top Ten Loss making Key words
Table View
Statistics
Budget & Revenue Table
Column Filter
Checking if movie is in Profit or Loss
Rule Engine
Budget_adj Vs revenue_adj
Scatter Plot
Converting into list
Cell Splitter
Date&Time Part Extractor
Main table
Column Filter
Release_year vs runtime
Heatmap
Joiner
Image View
Ungroup
Column Filter
Removing duplicate
Duplicate Row Filter
Cast Table
Column Filter
Converting String to date and time
String to Date&Time
Column Filter
Counting the "|"
String Manipulation
Director Table
Column Filter
Excel Writer
Counting the "|"
String Manipulation
Ungroup
B/w Budget and budget adj
Box Plot
Main table
Column Filter
Ungroup
Cell Splitter
Cell Splitter
Replacing the missing value
Missing Value
Cast Table
Column Filter
Keywords Table
Column Filter
Strip at "|" at keyword
String Manipulation
Joiner
Ungroup
Ungroup
Removed Diacritics from original title
String Manipulation
Genres Counter
Math Formula
Ungroup
Sorter
Column Renamer
if revenue_adj<1000 it will convert it into null
Math Formula
Checking if movie is in Profit or Loss
Rule Engine
Director Table
Column Filter
GroupBy
Replacing missing value with median
Missing Value
if revenue<1000 it will convert it into null
Math Formula
Row Filter
Joiner
Mean budget VS Budget_adj
Line Plot
Replacing missing value with median
Missing Value
Genres Counter
Math Formula
Excel Writer
if budget<1000 it will convert it into null
Math Formula
Excel Writer
Excel Writer
Excel Writer
Excel Writer
Excel Writer
Excel Writer
Profit
Math Formula
GroupBy
ROI
Math Formula
Extracting No. of Rows & column
Extract Table Dimension
ROI VS Genres
Box Plot
No. of Rows & column
CSV Writer
Mean Profit VS Genres
Bar Chart
Loading the CSV file
CSV Reader

Nodes

Extensions

Links