Icon

justknimeit-15

justknimeit-15
Just KNIME It!Challenge 15: Extracting a Table from a PDFLevel: HardDescription: Given a text-based PDF document with a table, can you partially extract the table into a KNIME data table for further analysis? For this challenge wewill extract the table from this PDF document and attempt to partially reconstruct it within KNIME. The corresponding KNIME table should contain the followingcolumns: Day, Max, Min, Norm, Depart, Heat, and Cool. Note 1: Your final output should be a table, not a single row with all the relevant data. Note 2: The TikaParser node is better suited for this task than the PDF Parser node. We completed this task without components, regular expressions, or code-snippet nodes. Infact, our solution has a total of 10 nodes, but labeling the columns required a bit of manual effort. importExtraction of table data partandConvert two consecutive spaces into one space transposeSplit by line breakSplit by blankChange ROW ID to be column title transposetransposeFilter only the columns neededString to number Tika Parser String Manipulation Transpose Cell Splitter Cell Splitter RowID Transpose Transpose Column Filter String To Number Just KNIME It!Challenge 15: Extracting a Table from a PDFLevel: HardDescription: Given a text-based PDF document with a table, can you partially extract the table into a KNIME data table for further analysis? For this challenge wewill extract the table from this PDF document and attempt to partially reconstruct it within KNIME. The corresponding KNIME table should contain the followingcolumns: Day, Max, Min, Norm, Depart, Heat, and Cool. Note 1: Your final output should be a table, not a single row with all the relevant data. Note 2: The TikaParser node is better suited for this task than the PDF Parser node. We completed this task without components, regular expressions, or code-snippet nodes. Infact, our solution has a total of 10 nodes, but labeling the columns required a bit of manual effort. importExtraction of table data partandConvert two consecutive spaces into one space transposeSplit by line breakSplit by blankChange ROW ID to be column title transposetransposeFilter only the columns neededString to number Tika Parser String Manipulation Transpose Cell Splitter Cell Splitter RowID Transpose Transpose Column Filter String To Number

Nodes

Extensions

Links