Icon

main-wf

The main workflow for the #66daysofdata challenge on Twitter.

The main workflow for the #66daysofdata challenge on Twitter.

This workflow will be continuously updated as I work through the #66daysofdata challenge.

My progress: https://twitter.com/humanoid_ivan

Read thedatasets Data transformation and validaiton Visualisations artist-uris.csv(artist IDs with names)artist.csv(only "artists" and"popularity")tracks.csv having a look at the statisticsmeasures of tracks.csvfix partial dates;convert to Date&Timeappend a new column:collection of strings(split id_artists)create a separaterow for each artist id in the collectionconvert "popularity"from double to intremove "spotify:artist:"from artist ID stringsjoin artist id withpopularityjoin artists with tracksclean id columnremove redundantcolumnsaggreagate trackcount by artisttop 20 artistsby number of tracksadd gradientto signify track countsort bytrack countprettifymean popularityCSV Reader CSV Reader File Reader(Complex Format) Data Explorer Histograms Transformrelease-date Cell Splitter Ungroup Double To Int String Manipulation Joiner Joiner String Manipulation Column Filter GroupBy Top k Selector Column Rename Color Manager View top artists Sorter Round Double Sampling Scatter Plots Sunchart + Heatmap Box Plots Max yearsof activity Line Plots &Stacked Area Read thedatasets Data transformation and validaiton Visualisations artist-uris.csv(artist IDs with names)artist.csv(only "artists" and"popularity")tracks.csvhaving a look at the statisticsmeasures of tracks.csvfix partial dates;convert to Date&Timeappend a new column:collection of strings(split id_artists)create a separaterow for each artist id in the collectionconvert "popularity"from double to intremove "spotify:artist:"from artist ID stringsjoin artist id withpopularityjoin artists with tracksclean id columnremove redundantcolumnsaggreagate trackcount by artisttop 20 artistsby number of tracksadd gradientto signify track countsort bytrack countprettifymean popularityCSV Reader CSV Reader File Reader(Complex Format) Data Explorer Histograms Transformrelease-date Cell Splitter Ungroup Double To Int String Manipulation Joiner Joiner String Manipulation Column Filter GroupBy Top k Selector Column Rename Color Manager View top artists Sorter Round Double Sampling Scatter Plots Sunchart + Heatmap Box Plots Max yearsof activity Line Plots &Stacked Area

Nodes

Extensions

Links