Icon

02_​DocumentVector_​Creation

02_Document Vector Creation
Document Vector Creation Here we transform the collection of documents into numerical vectors. The dataset used in this example is the KNIME Forum Dataset.After the pre-processing phase, the relative term frequency is computed for each term inside the Transformation component.The input data set is partitioned into training set and test set.The term frequencies from the training set are used to build a vector representation of the distinct terms identified by the BoW with a Document Vectornode.The same Document Vector transformation is then applied to the Documents in the test set. Document Vector Creator / Applier Transform collection of documents to vector spacePOS taggingNumber filterPunctuation ErasureStop Word FilterCase ConverterSnowball StemmerTag Filterapply DocumentVector transformation Document Vector Reading Data Enrichment Pre-processing Transformation Partitioning Transformation DocumentVector Applier Document Vector Creation Here we transform the collection of documents into numerical vectors. The dataset used in this example is the KNIME Forum Dataset.After the pre-processing phase, the relative term frequency is computed for each term inside the Transformation component.The input data set is partitioned into training set and test set.The term frequencies from the training set are used to build a vector representation of the distinct terms identified by the BoW with a Document Vectornode.The same Document Vector transformation is then applied to the Documents in the test set. Document Vector Creator / Applier Transform collection of documents to vector spacePOS taggingNumber filterPunctuation ErasureStop Word FilterCase ConverterSnowball StemmerTag Filterapply DocumentVector transformationDocument Vector Reading Data Enrichment Pre-processing Transformation Partitioning Transformation DocumentVector Applier

Nodes

Extensions

Links