Time Series Analysis
Summary:
In this exercise we will load the data file for cleaning, filtering, aggregating, and visualizations with different time granularity.
Instructions:
1) Execute the CSV Reader node to load in the Energy Usage Data
2) Use a String to Date&Time node to convert the Row ID column to the correct format. The digits in the string pattern are converted correctly, if you write "yyyy-MM-dd_HH" in the date format field, or press the "Guess data type and format button".
3) Use a Column Filter node to remove all columns except the Row ID and Cluster 26, this is what we will analyze
4) Use the Time Stamp Alignment component to check for missing time stamps in the data
5) Connect a Missing Value node next to replace the missing values discovered in the previous step. Try the linear interpolation setting.
6) Use separate Aggregation Granularity components to aggregate the Time series into Hourly, Daily, and Monthly series
7) Use Line Plot nodes to visualize the outputs. Do you see any patterns?