Join and Clean Spreadsheets
This workflow demonstrates how to join two spreadsheets from two separate Excel files and how to clean the merged data table. The cleaned dataset is eventually exported to an external file.
In this example, we access two datasets:
The athlete event results of the 1896-2020 Summer Olympic Games
The bio information of the athletes who participated in any of the Summer or Winter Olympics.
The aim is to keep only the information of those athletes in the bio information sheet who have participated in at least one edition of the Summer Olympic Games. We start by joining the two data tables and then continue only with the right unmatched rows, i.e., the athletes that are not contained in the athlete event results of the Summer Olympics. We filter the bio information sheet accordingly and lastly write the cleaned data table to a new Excel file.
For a detailed overview of each node in this workflow, refer to the workflow description in the Info panel.
💡 To view each node's configuration, select the node and see the configuration pane on the right side of the workflow editor.