Icon

BasketballCleaningHubversion 1.4-1

Recalculate "minutes" and "seconds" columnsinto a single column, counted in seconds.Changes the match up column to clarify if it isa home or away game. Remove redundant/uselesscolumns, reorder them moreintuitively. Sort for only rows that havesuccessful or unsuccessful shots,without unknowns. Sort for only rows that have unknown shots. OneCSV is only shot ID and column for eval, one isthe full data for the unknowns. Read in the initial data.csv Afterthought: This convertseverything that's not a number to anumber. Another node can beattached to convert back. Data cleaning: visualize what we have (statistics), delete rows with missingvalues, delete rows with outliers of the features selected, normalize data ofthe features selected, visualize a matriz with linear correlation, partitioningdata for training and testing 80-20 Random forest model: number to string to be able to take the shot_made_flagfeature as target, apply the random forest learner and predictor to the features(numerical only) and score the results. Magic happens.A bunch of models. Some require doubles, so I recast some variables that way. We should consider dropping some columns basedupon the correlation matrix above. Basic exploratory Analysis of data:classes of interest look fairlybalanced Node 1Node 2Node 3Node 4Node 5Node 6Node 7Node 8Node 9Node 10Node 11Node 12Node 13Node 14Node 15Node 16Node 17Node 18Node 19Node 20Node 21Node 23Node 24Node 25Node 26Node 27Node 28Node 29Node 30Node 31Node 32Node 33Node 34Node 35Node 36Node 37Node 41Node 42Node 43Node 47Node 48Node 49Node 52 CSV Reader Math Formula Cell SplitterBy Position Rule Engine Column Filter Column Resorter Row Filter Row Filter CSV Writer CSV Writer Column Filter CSV Writer Category To Number Statistics Missing Value Numeric Outliers Normalizer Partitioning Linear Correlation Random ForestLearner Random ForestPredictor Number To String Scorer RProp MLP Learner MultiLayerPerceptronPredictor Scorer Column Rename Partitioning Denormalizer Normalizer PNN Learner (DDA) PNN Predictor Scorer Gradient BoostedTrees Learner Gradient BoostedTrees Predictor Scorer Fuzzy Rule Learner Fuzzy RulePredictor Scorer SVM Learner SVM Predictor Scorer InteractiveHistogram (local) Recalculate "minutes" and "seconds" columnsinto a single column, counted in seconds.Changes the match up column to clarify if it isa home or away game. Remove redundant/uselesscolumns, reorder them moreintuitively. Sort for only rows that havesuccessful or unsuccessful shots,without unknowns. Sort for only rows that have unknown shots. OneCSV is only shot ID and column for eval, one isthe full data for the unknowns. Read in the initial data.csv Afterthought: This convertseverything that's not a number to anumber. Another node can beattached to convert back. Data cleaning: visualize what we have (statistics), delete rows with missingvalues, delete rows with outliers of the features selected, normalize data ofthe features selected, visualize a matriz with linear correlation, partitioningdata for training and testing 80-20 Random forest model: number to string to be able to take the shot_made_flagfeature as target, apply the random forest learner and predictor to the features(numerical only) and score the results. Magic happens.A bunch of models. Some require doubles, so I recast some variables that way. We should consider dropping some columns basedupon the correlation matrix above. Basic exploratory Analysis of data:classes of interest look fairlybalanced Node 1Node 2Node 3Node 4Node 5Node 6Node 7Node 8Node 9Node 10Node 11Node 12Node 13Node 14Node 15Node 16Node 17Node 18Node 19Node 20Node 21Node 23Node 24Node 25Node 26Node 27Node 28Node 29Node 30Node 31Node 32Node 33Node 34Node 35Node 36Node 37Node 41Node 42Node 43Node 47Node 48Node 49Node 52CSV Reader Math Formula Cell SplitterBy Position Rule Engine Column Filter Column Resorter Row Filter Row Filter CSV Writer CSV Writer Column Filter CSV Writer Category To Number Statistics Missing Value Numeric Outliers Normalizer Partitioning Linear Correlation Random ForestLearner Random ForestPredictor Number To String Scorer RProp MLP Learner MultiLayerPerceptronPredictor Scorer Column Rename Partitioning Denormalizer Normalizer PNN Learner (DDA) PNN Predictor Scorer Gradient BoostedTrees Learner Gradient BoostedTrees Predictor Scorer Fuzzy Rule Learner Fuzzy RulePredictor Scorer SVM Learner SVM Predictor Scorer InteractiveHistogram (local)

Nodes

Extensions

Links