Icon

KNIME_​textanalysis_​group_​project_​PA_​final

Data Import and Cleaning Note that since at the beggining of the true text there wasthe name of the city and of the journal that issued the story,we decided to eliminate the start of each true text to avoidperfect fitting given certain words Tag Cloud Representation One-hot Encoding Word2Vec Embeddings CBOW Algorithm Skip-Gram Algorithm Note that since at the beggining of the true text there wasthe name of the city and of the journal that issued the story,we decided to eliminate the start of each true text to avoidperfect fitting given certain words One-hot Encoding CBOW Algorithm Due to errors inpunctuationerasure, manualfiltering of faultyword columns wasneeded Skip-Gram Algorithm Data Import and Cleaning Tag Cloud Representation Word2Vec Embeddings Node 154 Import Fake DatasetImport True DatasetNode 867Node 868Node 869Node 870Node 872Node 873Node 874Node 875Node 876Node 878Node 879Node 880Node 881Node 882Using Word Vector Applytraining on fulltraining setNode 885Node 887Using Word Vector ApplyNode 890Node 891Node 892Node 893Node 894Node 897Node 898Node 899Node 906Node 915Node 938Node 939Node 940Node 941Node 942Node 943Node 970training on fulltraining setUsing Word Vector ApplyUsing Word Vector ApplyNode 974Node 975Node 976Node 977Node 978Node 979Node 980Node 981Node 982Node 983Node 984Node 985Using Word Vector ApplyNode 987Node 988Using Word Vector ApplyNode 990Import True DatasetNode 992Node 993Node 994Node 995Node 996Node 997Node 998Node 999Using Word Vector ApplyNode 1001Node 1002Node 1003Node 1004Node 1005Node 1006Import Fake DatasetNode 1008Node 1009Node 1010Node 1011Node 1012Using Word Vector ApplyNode 1014Node 1015Node 1016Node 1017Node 1018Node 1019Node 1020Node 1021training on fulltraining setNode 1023Node 1024Node 1025training on fulltraining setNode 1027Node 1028Tag Clouds CSV Reader CSV Reader Value Counter ConstantValue Column ConstantValue Column Concatenate Document DataExtractor Filtered Bagof Words Document Vector Low Variance Filter Partitioning Logistic Regression String Manipulation Row Sampling Row Sampling H2O GradientBoosting Word Embedding Word2Vec Learner Partitioning Data Cleaning Word Embedding Concatenate String Manipulation Value Counter Decision Tree Gradient Boosting Column Filter Column Appender Column Filter Random Forest Concatenate Gradient Boosting Decision Tree H2O GradientBoosting Random Forest Logistic Regression Concatenate Partitioning Word2Vec Learner Word Embedding Word Embedding Gradient Boosting Decision Tree Random Forest H2O GradientBoosting Logistic Regression Concatenate Random Forest Value Counter Row Sampling Column Appender Random Forest Column Filter Word Embedding Document Vector ConstantValue Column Word Embedding Decision Tree CSV Reader Column Filter Gradient Boosting Value Counter Gradient Boosting H2O GradientBoosting Row Sampling Concatenate Logistic Regression Word Embedding Partitioning Document DataExtractor H2O GradientBoosting Concatenate Tag Clouds Decision Tree CSV Reader Concatenate Logistic Regression Concatenate Decision Tree Partitioning Word Embedding Data Cleaning String Manipulation ConstantValue Column Low Variance Filter String Manipulation Random Forest Concatenate Logistic Regression Word2Vec Learner Filtered Bagof Words Gradient Boosting H2O GradientBoosting Word2Vec Learner Partitioning Column Filter Data Import and Cleaning Note that since at the beggining of the true text there wasthe name of the city and of the journal that issued the story,we decided to eliminate the start of each true text to avoidperfect fitting given certain words Tag Cloud Representation One-hot Encoding Word2Vec Embeddings CBOW Algorithm Skip-Gram Algorithm Note that since at the beggining of the true text there wasthe name of the city and of the journal that issued the story,we decided to eliminate the start of each true text to avoidperfect fitting given certain words One-hot Encoding CBOW Algorithm Due to errors inpunctuationerasure, manualfiltering of faultyword columns wasneeded Skip-Gram Algorithm Data Import and Cleaning Tag Cloud Representation Word2Vec Embeddings Node 154 Import Fake DatasetImport True DatasetNode 867Node 868Node 869Node 870Node 872Node 873Node 874Node 875Node 876Node 878Node 879Node 880Node 881Node 882Using Word Vector Applytraining on fulltraining setNode 885Node 887Using Word Vector ApplyNode 890Node 891Node 892Node 893Node 894Node 897Node 898Node 899Node 906Node 915Node 938Node 939Node 940Node 941Node 942Node 943Node 970training on fulltraining setUsing Word Vector ApplyUsing Word Vector ApplyNode 974Node 975Node 976Node 977Node 978Node 979Node 980Node 981Node 982Node 983Node 984Node 985Using Word Vector ApplyNode 987Node 988Using Word Vector ApplyNode 990Import True DatasetNode 992Node 993Node 994Node 995Node 996Node 997Node 998Node 999Using Word Vector ApplyNode 1001Node 1002Node 1003Node 1004Node 1005Node 1006Import Fake DatasetNode 1008Node 1009Node 1010Node 1011Node 1012Using Word Vector ApplyNode 1014Node 1015Node 1016Node 1017Node 1018Node 1019Node 1020Node 1021training on fulltraining setNode 1023Node 1024Node 1025training on fulltraining setNode 1027Node 1028Tag Clouds CSV Reader CSV Reader Value Counter ConstantValue Column ConstantValue Column Concatenate Document DataExtractor Filtered Bagof Words Document Vector Low Variance Filter Partitioning Logistic Regression String Manipulation Row Sampling Row Sampling H2O GradientBoosting Word Embedding Word2Vec Learner Partitioning Data Cleaning Word Embedding Concatenate String Manipulation Value Counter Decision Tree Gradient Boosting Column Filter Column Appender Column Filter Random Forest Concatenate Gradient Boosting Decision Tree H2O GradientBoosting Random Forest Logistic Regression Concatenate Partitioning Word2Vec Learner Word Embedding Word Embedding Gradient Boosting Decision Tree Random Forest H2O GradientBoosting Logistic Regression Concatenate Random Forest Value Counter Row Sampling Column Appender Random Forest Column Filter Word Embedding Document Vector ConstantValue Column Word Embedding Decision Tree CSV Reader Column Filter Gradient Boosting Value Counter Gradient Boosting H2O GradientBoosting Row Sampling Concatenate Logistic Regression Word Embedding Partitioning Document DataExtractor H2O GradientBoosting Concatenate Tag Clouds Decision Tree CSV Reader Concatenate Logistic Regression Concatenate Decision Tree Partitioning Word Embedding Data Cleaning String Manipulation ConstantValue Column Low Variance Filter String Manipulation Random Forest Concatenate Logistic Regression Word2Vec Learner Filtered Bagof Words Gradient Boosting H2O GradientBoosting Word2Vec Learner Partitioning Column Filter

Nodes

Extensions

Links