Load text-based files

Loads text-based files into a new table column from a column of URLs or full filepaths. Each file is added in its entirety to a single multi-line String cell in a new column added to the output table. The column can then be re-typed (e.g. to mol, mol2, PDB etc.) as required.

File encoding is attempted as follows when the ' Guess ' option is selected:

  1. Firstly, the URL connection to the supplied file is inspected for encoding information.
  2. If none is available, then the first 4 bytes of the file are inspected for any BOM present, and if so, the following are recognised: UTF-8, UTF-16 (Big- and Little Endian), UTF-32 (Big- and Little Endian)
  3. Finally, the default (UTF-8) is assumed. As UTF-8 is not required to provide a BOM, this is a reasonable guess in most cases.
A console INFO entry is added for each file format detected, and a WARN entry added when the default is used because none could be detected.

This node was developed by Vernalis (Cambridge, UK) . For feedback and more information, please contact knime@vernalis.com

Options

Select filepath column
Select the column containing the paths or URLs to the files
Remove input column
If checked, the input column containing the paths is removed
Select file encoding
Select the required file encoding method. The default 'Guess' option performs as described above. Choosing other options may result in nonsense being returned
Txt Column name
Enter a name for the column containing the loaded files

Input Ports

Icon
Input table containing the filepath or URL column

Output Ports

Icon
Output table with the loaded files added

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.