Icon

06 Aggregations

Exercise for data aggregation.

Calculate summary statistics for subgroups of data with the GroupBy and Pivoting nodes








Exercise: GroupBy1) Read the adult.csv file by executing the File Reader node2) Calculate the total number of rows and average age by gender3) Calculate the modes of all string columns separately for each native country4) Calculate - the number of missing values in the occupation column- the number of non-missing rows in the occupation column- the number of rows in the occupation column- the number of rows in the marital-status column Notice that the last two aggregations should provide the same numbers! Exercise: Pivoting1) Read the adult_binned.csv file by executing the File Reader node2) Calculate the number of people in groups according to their work class and age bin- What is the most common combination of age bin and work class?age 34 or less and Private- How many people belong to this group?109363) Calculate the mode of education level in groups according to their work class and age bin- What is the most widespread education level in the private workclass independently of theage bin?9th Grade number 2number 4sum of people in groupsRead data adult.csvmodesRead data adult_binned.csveducation mode GroupBy GroupBy Pivoting File Reader GroupBy File Reader Pivoting Exercise: GroupBy1) Read the adult.csv file by executing the File Reader node2) Calculate the total number of rows and average age by gender3) Calculate the modes of all string columns separately for each native country4) Calculate - the number of missing values in the occupation column- the number of non-missing rows in the occupation column- the number of rows in the occupation column- the number of rows in the marital-status column Notice that the last two aggregations should provide the same numbers! Exercise: Pivoting1) Read the adult_binned.csv file by executing the File Reader node2) Calculate the number of people in groups according to their work class and age bin- What is the most common combination of age bin and work class?age 34 or less and Private- How many people belong to this group?109363) Calculate the mode of education level in groups according to their work class and age bin- What is the most widespread education level in the private workclass independently of theage bin?9th Grade number 2number 4sum of people in groupsRead data adult.csvmodesRead data adult_binned.csveducation mode GroupBy GroupBy Pivoting File Reader GroupBy File Reader Pivoting

Nodes

Extensions

Links