Icon

justKnimeit-15

justKnimeit-15
Challenge 15: Extracting a Table from a PDFGiven a text-based PDF document with a table, can you partially extract the table into a KNIME data table for further analysis? Forthis challenge we will extract the table from this PDF document and attempt to partially reconstruct it within KNIME. Thecorresponding KNIME table should contain the following columns: Day, Max, Min, Norm, Depart, Heat, and Cool.Note 1: Your final output should be a table, not a single row with all the relevant data. Note 2: The Tika Parser node is better suited for this task than the PDF Parser node. We completed this task withoutcomponents, regular expressions, or code-snippet nodes. In fact, our solution has a total of 10 nodes, but labeling the columnsrequired a bit of manual effort. Read PDFContentExtract TableAdd Row CountMultiplyContentRemoveUnwantedClear HeadersAdd HeadersPick Related RowCreate TableStructureRemove MissingRows Tika Parser Column Expressions Column Expressions One Row to Many Column Filter Transpose RowID Transpose Column Expressions Cell Splitter Missing Value Challenge 15: Extracting a Table from a PDFGiven a text-based PDF document with a table, can you partially extract the table into a KNIME data table for further analysis? Forthis challenge we will extract the table from this PDF document and attempt to partially reconstruct it within KNIME. Thecorresponding KNIME table should contain the following columns: Day, Max, Min, Norm, Depart, Heat, and Cool.Note 1: Your final output should be a table, not a single row with all the relevant data. Note 2: The Tika Parser node is better suited for this task than the PDF Parser node. We completed this task withoutcomponents, regular expressions, or code-snippet nodes. In fact, our solution has a total of 10 nodes, but labeling the columnsrequired a bit of manual effort. Read PDFContentExtract TableAdd Row CountMultiplyContentRemoveUnwantedClear HeadersAdd HeadersPick Related RowCreate TableStructureRemove MissingRows Tika Parser Column Expressions Column Expressions One Row to Many Column Filter Transpose RowID Transpose Column Expressions Cell Splitter Missing Value

Nodes

Extensions

Links