Icon

02 Data Cleaning

Task 1: Filter rows1. Read the data by executing the provided reader nodes2. Filter the airport data to airports in China3. Filter the airport data to latitude where the overall is at least 10 or more4. Filter the routes to only have a 3U. Task 2: Filter columns1. Read the life expectancy data from the SQLite database by executing the providedworkflow2. Exclude the Rank column- Manually- By including only string and double type columns Task 3: Manipulate strings1. Read the travel advisories data by executing the provided CSV Reader node2. Replace the colons in the Level column by hyphen like this: "Level 1- Exercise normalprecautions"3. Extract the risk levels as numbers into a separate column. You can use, for example, thesubstr() function.4. Complete the tasks above with the Column Expressions node Task 4: Evaluate mathematical expressions and apply rules1. Read the population data by executing the provided Excel Reader node2. Create a new column "Change" by subtracting the population values in 2018 from thepopulation values in 20193. Convert the absolute population change values into percentages: Divide them by the populationvalues in 2018. Multiply the result by 100.4. Create a categorical column with two values "increasing"/"decreasing" based on whether thechange is positive or negative AirportsOnly ChinaLatitude is more then 10AirportsOnly use 3URoutesAirportsRemove Latitude and Longitude ManuallyType Selection String and NumberAirportsreplaced / in TZ database timezone with -Used substr to change OurAirport to AirportApplied both substr and replace functionsLatitude - LongitudeAirportsShow percentage changeShow if there is a increase or decrease CSV Reader Row Filter Row Filter CSV Reader Row Filter CSV Reader CSV Reader Column Filter Column Filter CSV Reader String Manipulation String Manipulation Column Expressions Math Formula CSV Reader Math Formula Rule Engine Task 1: Filter rows1. Read the data by executing the provided reader nodes2. Filter the airport data to airports in China3. Filter the airport data to latitude where the overall is at least 10 or more4. Filter the routes to only have a 3U. Task 2: Filter columns1. Read the life expectancy data from the SQLite database by executing the providedworkflow2. Exclude the Rank column- Manually- By including only string and double type columns Task 3: Manipulate strings1. Read the travel advisories data by executing the provided CSV Reader node2. Replace the colons in the Level column by hyphen like this: "Level 1- Exercise normalprecautions"3. Extract the risk levels as numbers into a separate column. You can use, for example, thesubstr() function.4. Complete the tasks above with the Column Expressions node Task 4: Evaluate mathematical expressions and apply rules1. Read the population data by executing the provided Excel Reader node2. Create a new column "Change" by subtracting the population values in 2018 from thepopulation values in 20193. Convert the absolute population change values into percentages: Divide them by the populationvalues in 2018. Multiply the result by 100.4. Create a categorical column with two values "increasing"/"decreasing" based on whether thechange is positive or negative AirportsOnly ChinaLatitude is more then 10AirportsOnly use 3URoutesAirportsRemove Latitude and Longitude ManuallyType Selection String and NumberAirportsreplaced / in TZ database timezone with -Used substr to change OurAirport to AirportApplied both substr and replace functionsLatitude - LongitudeAirportsShow percentage changeShow if there is a increase or decrease CSV Reader Row Filter Row Filter CSV Reader Row Filter CSV Reader CSV Reader Column Filter Column Filter CSV Reader String Manipulation String Manipulation Column Expressions Math Formula CSV Reader Math Formula Rule Engine

Nodes

Extensions

Links