Reads an mbox file and creates a table with a row for each single message within the file.
The parser is based on Apache James Mime4J. Technical specification of the mbox format can be found in: RFC4155: The application/mbox Media Type and mbox manpage.
The result is provided as binary objects (BLOB), which can be further processed by the “mbox Message Extractor” and “mbox Header Extractor” nodes.
^From \\S+@\\S.*\\d{4}$
is more strict and will only match “From” lines which contain an
@ character. The second option ^From \\S+.*\\d{4}$
will also match “From” lines without @ – use this e.g. if
you need to parse Thunderbird mbox content.You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension mbox Nodes from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!