Icon

Airline Workflow

<p><strong>Data Quality Check</strong></p><ul><li><p>Imported airline stock data (AAL, DAL, LUV, UAL), jet fuel prices, and NYSE ARCA Index</p></li><li><p>Conducted data quality check using&nbsp;<strong>Statistics node</strong></p></li><li><p>Focused on identifying&nbsp;<strong>missing values and unique values</strong></p></li><li><p>Evaluated data completeness and consistency before&nbsp;<strong>preprocessing</strong></p></li></ul><p></p><p><strong>Data Preprocessing Overview</strong></p><ul><li><p><em>Stock and ARCA Prices</em></p></li></ul><ol><li><p>Cleaned dataset by removing duplicates and converting variables into numeric format</p></li><li><p>Standardized and cleaned datasets across&nbsp;<strong>consistent weekly base intervals</strong></p></li><li><p>Transformed price data into<strong>&nbsp;return-based</strong>&nbsp;metrics</p></li><li><p>Constructed<strong>&nbsp;volume-weighted average&nbsp;</strong>price and return relative to the market benchmark</p></li><li><p>Generated&nbsp;<strong>lagged variables&nbsp;</strong>to capture delayed market effects</p></li></ol><ul><li><p><em>Fuel Price</em></p></li></ul><ol><li><p>Cleaned dataset by removing duplicates and converting variables into numeric format</p></li><li><p>Transforming raw fuel price data into structured, lagged return-based features for time-series regression.</p></li><li><p>Then, do the -1,-2,-3,-4 lag accordingly ⟶&nbsp;<strong>capture delayed market reactions</strong></p><p></p></li></ol><p><strong>Regression Overview</strong></p><ul><li><p><em>Steps</em></p></li></ul><ol><li><p>Separated models into&nbsp;<strong>Linear</strong>&nbsp;and&nbsp;<strong>Exponential regression frameworks</strong></p></li><li><p>Linear model captures&nbsp;<strong>direct (additive) relationships</strong></p></li><li><p>Exponential model captures&nbsp;<strong>non-linear (multiplicative) effects</strong></p></li><li><p>Applied&nbsp;<strong>multiple lag structures (Lag 1–4)&nbsp;</strong>to test delayed impact</p></li><li><p>Compared model performance to identify the best predictive specification</p></li></ol><ul><li><p><em>Components</em></p></li></ul><ol><li><p>Each regression model is applied across three time periods</p></li><li><p>Full sample: 2019–2025</p></li><li><p>Sub-period&nbsp;<strong>Chow test analysis</strong>: Pre-war and Post-war (2022-8 break)</p></li><li><p>Allows comparison of<strong>&nbsp;model behavior</strong>&nbsp;under different market conditions</p><p></p></li></ol><p></p><p></p>

Data Quality Check

  • Imported airline stock data (AAL, DAL, LUV, UAL), jet fuel prices, and NYSE ARCA Index

  • Conducted data quality check using Statistics node

  • Focused on identifying missing values and unique values

  • Evaluated data completeness and consistency before preprocessing

Data Preprocessing Overview

  • Stock and ARCA Prices

  1. Cleaned dataset by removing duplicates and converting variables into numeric format

  2. Standardized and cleaned datasets across consistent weekly base intervals

  3. Transformed price data into return-based metrics

  4. Constructed volume-weighted average price and return relative to the market benchmark

  5. Generated lagged variables to capture delayed market effects

  • Fuel Price

  1. Cleaned dataset by removing duplicates and converting variables into numeric format

  2. Transforming raw fuel price data into structured, lagged return-based features for time-series regression.

  3. Then, do the -1,-2,-3,-4 lag accordingly ⟶ capture delayed market reactions

Regression Overview

  • Steps

  1. Separated models into Linear and Exponential regression frameworks

  2. Linear model captures direct (additive) relationships

  3. Exponential model captures non-linear (multiplicative) effects

  4. Applied multiple lag structures (Lag 1–4) to test delayed impact

  5. Compared model performance to identify the best predictive specification

  • Components

  1. Each regression model is applied across three time periods

  2. Full sample: 2019–2025

  3. Sub-period Chow test analysis: Pre-war and Post-war (2022-8 break)

  4. Allows comparison of model behavior under different market conditions

Regression Models

Liner Regression

Data Preprocessing

Exponential Regression

Visualization Component

  • Integrated TSA Checkpoint data and GDP data as additional control variables after outlier detection

  • Used visualization to gain initial insights into data patterns

  • Identified trends and potential anomalies

  • Supported behavior understanding before conducting time-series analysis

Outlier Detection Component

  • Outlier detection using the Hampel filter to preserving underlying time-series structure.

  • Preventing the extreme spike that can distort regression results.

Rule Engine
Data Preprocessing for 4 stocks
ln_ret_(^2)
Fuel Price
Excel Reader
Outlier Detection
Data Preprocessing (Oil Price Format)
Pata Preprocessing
Component
AAL
CSV Reader
Column Filter
Linear Lag(-2)
Statistics View
Column Renamer
Counter Generation
Linear Lag(-1)
Joiner
String Manipulation
ARCANYSE BENCH
CSV Reader
Column Filter
Linear Lag(-3)
Statistics View
Column Renamer
String to Date&Time
Visualzation
Component
Intermediate
CSV Writer
Joiner
GDP
CSV Reader
LUV
CSV Reader
Joiner
Data Preprocessing
Expo Lag(-2)
Expo Lag(-1)
Expo Lag(-4)
Expo Lag(-3)
UAL
CSV Reader
Data Before outlier detection
Duplicate Row Filter
Lag Column
Linear Lag(-4)
DAL
CSV Reader
Column Renamer
Deta for Regression
Column Filter
Rule Engine
Row Filter
Transportation Security Administration Checkpoint
Excel Reader

Nodes

Extensions

Links