Icon

Final Project

For my final project, I chose to use the "Electric & Alternative Fuel Vehicles US [2022]" dataset foundon Kaggle's database, located at https://www.kaggle.com/datasets/saketpradhan/alternative-fuel-vehicles-in-the-us. Downloading the dataset provides several .csv files, however I chose to use the "Alternative FuelVehicles US.csv" database and resave the file as an .xlsx file for a more applicable filetype, except for theData Manipulation examples, where multiple tables will be combined. For this project, I am proposing to display several different methods of viewing, cleansing,manipulation, displaying, and visualizing that were discussed throughout the semester, with each methodbeing labeled as such. Data Cleansing Techniques Data Manipulation Techniques Data Vasualization Techniques Data Science Model Techniques I had several issues with the data science models with this dataset. I understandthe logic behind how each represented works, but getting the first model tofunction properly was extremely difficult Filtered to 2021 ModelsCars with an enginecylinder count between6 and 8Alternative Fuel VehiclesAverage fuel economyof each manufacturerusing conventional fuelPie chart of each manufacturerand the number of modelsthey makeCombines databasesbased on the Manufacturer columnLight Duty VehiclesBoth databasesmerged into oneCreates ConfusionMatrix due tomissing valuesReplaces each missingvalue in the dataset with a 0Excel Reader Column Filter Row Filter Excel Reader Excel Reader Excel Reader Bar Chart Pie/Donut Chart Joiner Excel Reader Concatenate DecisionTree Learner Decision TreePredictor Excel Reader Scorer (JavaScript) Missing Value Missing Value Linear RegressionLearner RegressionPredictor Numeric Scorer Missing Value For my final project, I chose to use the "Electric & Alternative Fuel Vehicles US [2022]" dataset foundon Kaggle's database, located at https://www.kaggle.com/datasets/saketpradhan/alternative-fuel-vehicles-in-the-us. Downloading the dataset provides several .csv files, however I chose to use the "Alternative FuelVehicles US.csv" database and resave the file as an .xlsx file for a more applicable filetype, except for theData Manipulation examples, where multiple tables will be combined. For this project, I am proposing to display several different methods of viewing, cleansing,manipulation, displaying, and visualizing that were discussed throughout the semester, with each methodbeing labeled as such. Data Cleansing Techniques Data Manipulation Techniques Data Vasualization Techniques Data Science Model Techniques I had several issues with the data science models with this dataset. I understandthe logic behind how each represented works, but getting the first model tofunction properly was extremely difficult Filtered to 2021 ModelsCars with an enginecylinder count between6 and 8Alternative Fuel VehiclesAverage fuel economyof each manufacturerusing conventional fuelPie chart of each manufacturerand the number of modelsthey makeCombines databasesbased on the Manufacturer columnLight Duty VehiclesBoth databasesmerged into oneCreates ConfusionMatrix due tomissing valuesReplaces each missingvalue in the dataset with a 0Excel Reader Column Filter Row Filter Excel Reader Excel Reader Excel Reader Bar Chart Pie/Donut Chart Joiner Excel Reader Concatenate DecisionTree Learner Decision TreePredictor Excel Reader Scorer (JavaScript) Missing Value Missing Value Linear RegressionLearner RegressionPredictor Numeric Scorer Missing Value

Nodes

Extensions

Links