Icon

extra 06 Aggregations

06 Aggregations
Exercise: GroupBy1) Read the adult.csv file by executing the CSV Reader node2) Calculate the total number of rows and average age by gender3) Calculate the modes of all string columns separately for each native country4) Calculate - the number of missing values in the occupation column- the number of non-missing rows in the occupation column- the number of rows in the occupation column- the number of rows in the marital-status column Notice that the last two aggregations should provide the same numbers! Exercise: Pivoting1) Read the adult_binned.csv file by executing the CSV Reader node2) Calculate the number of people in groups according to their work class and age bin- What is the most common combination of age bin and work class?- How many people belong to this group?3) Calculate the mode of education level in groups according to their work class and age bin- What is the most widespread education level in the private workclass independently of theage bin? total rows and average montly bill by gendernumber of missing and non-missing values in seniorcitizensand total rows in seniorcitizens/tenure table:* seniorcitizen as a group*tenure as a pivot *calculate the number of people in groupsmodes of string columns for each Internet servicetable: * gender as a group* senior citizen as a pivot * find the most widespread level of internet serviceWa Fn UseCTelco Customerchurn.csvWa Fn UseCTelco Customerchurn.csvGroupBy GroupBy Pivoting GroupBy Pivoting CSV Reader CSV Reader Exercise: GroupBy1) Read the adult.csv file by executing the CSV Reader node2) Calculate the total number of rows and average age by gender3) Calculate the modes of all string columns separately for each native country4) Calculate - the number of missing values in the occupation column- the number of non-missing rows in the occupation column- the number of rows in the occupation column- the number of rows in the marital-status column Notice that the last two aggregations should provide the same numbers! Exercise: Pivoting1) Read the adult_binned.csv file by executing the CSV Reader node2) Calculate the number of people in groups according to their work class and age bin- What is the most common combination of age bin and work class?- How many people belong to this group?3) Calculate the mode of education level in groups according to their work class and age bin- What is the most widespread education level in the private workclass independently of theage bin? total rows and average montly bill by gendernumber of missing and non-missing values in seniorcitizensand total rows in seniorcitizens/tenure table:* seniorcitizen as a group*tenure as a pivot *calculate the number of people in groupsmodes of string columns for each Internet servicetable: * gender as a group* senior citizen as a pivot * find the most widespread level of internet serviceWa Fn UseCTelco Customerchurn.csvWa Fn UseCTelco Customerchurn.csvGroupBy GroupBy Pivoting GroupBy Pivoting CSV Reader CSV Reader

Nodes

Extensions

Links