Icon

Challenge

Import DatasetsThese two nodes basically allow us to import ourchosen datasets; the top one contains data aboutgoogle play store applications (ratings, number ofreviews, free/not free, etc.), while the bottom onecontains a variable number of review for some of theapplications contained in the first dataset.If you're having issue opening the csv files:Drag and drop googleplaystore.csvSelect "Support short data rosw" (because of Nanvalues)Go to advanced setting and Change Limit of rows to"11,000"Drag and drop users_reviews, go to advanced settingsand change the Limit of rows to "65,000"You can find these dataset wether in the folders joinedto the workflow or on kaggle :https://www.kaggle.com/lava18/google-play-store-apps?select=googleplaystore.csv Exploratory and Explanatory AnalysisIn this part we will investigate some relationships of interest between the non-textual data appearing in ourdataset. The most interesting one concerns Rating scores and number of Installs. Read string document from googleplaystore_user_reviews.csvRead string document fromgoogleplaystoreFiltering outsome noisy dataPlease openColor by sentimentlabelAccuracyPlease openPlease openPlease openPlease openPlease open CSV Reader CSV Reader Column Filter To Documents Document Cleanup Mining Sentiment TAGCLOUD Variableoperationalization Color Manager Scorer Extra datacleanup/processing Price and Rating Rating andsentiment Scatter Plots Price and Sentiment Rating and Installs Import DatasetsThese two nodes basically allow us to import ourchosen datasets; the top one contains data aboutgoogle play store applications (ratings, number ofreviews, free/not free, etc.), while the bottom onecontains a variable number of review for some of theapplications contained in the first dataset.If you're having issue opening the csv files:Drag and drop googleplaystore.csvSelect "Support short data rosw" (because of Nanvalues)Go to advanced setting and Change Limit of rows to"11,000"Drag and drop users_reviews, go to advanced settingsand change the Limit of rows to "65,000"You can find these dataset wether in the folders joinedto the workflow or on kaggle :https://www.kaggle.com/lava18/google-play-store-apps?select=googleplaystore.csv Exploratory and Explanatory AnalysisIn this part we will investigate some relationships of interest between the non-textual data appearing in ourdataset. The most interesting one concerns Rating scores and number of Installs. Read string document from googleplaystore_user_reviews.csvRead string document fromgoogleplaystoreFiltering outsome noisy dataPlease openColor by sentimentlabelAccuracyPlease openPlease openPlease openPlease openPlease open CSV Reader CSV Reader Column Filter To Documents Document Cleanup Mining Sentiment TAGCLOUD Variableoperationalization Color Manager Scorer Extra datacleanup/processing Price and Rating Rating andsentiment Scatter Plots Price and Sentiment Rating and Installs

Nodes

Extensions

Links