Decompress Files

This node unpacks and decompresses files from an archive file. The paths to the extracted files are provided in the output table using a path column.

Supported archive formats are:

  • .zip
  • .jar
  • .tar
  • .tar.gz
  • .tar.bz2
  • .cpio
  • .ar
  • .gz / .gzip

Note: This node cannot decompress KNIME artifacts such as workflows. In order to decompress workflows please use a combination of Workflow Reader and Workflow Writer instead.

This node can access a variety of different file systems. More information about file handling in KNIME can be found in the official File Handling Guide.

Options

File
Enter a path to an archive file to be decompressed. The required syntax of a path depends on the chosen file system, such as "C:\path\to\file" (Local File System on Windows) or "/path/to/file" (Local File System on Linux/MacOS and Mountpoint). For file systems connected via input port, the node description of the respective connector node describes the required path format. You can also choose a previously selected file from the drop-down list, or select a file from the "Browse..." dialog. Note that browsing is disabled in some cases.
Folder
Enter a valid directory path where the files should be extracted to. The required syntax of a path depends on the chosen file system, such as "C:\path\to\folder" (Local File System on Windows) or "/path/to/folder" (Local File System on Linux/MacOS and Mountpoint). For file systems connected via input port, the node description of the respective connector node describes the required path format. You can also choose a previously selected folder from the drop-down list, or select a location from the "Browse..." dialog.
Create missing folders
If enabled, missing folders in the specified path will be created automatically.
If exists
Specify the behavior of the node in case the output file already exists.
  • Overwrite: Will replace any existing file.
  • Ignore: Will ignore if a file already exists and continues the copying process.
  • Fail: Will issue an error during the node's execution (to prevent unintentional overwrite).
Guess file name encoding from archive file extension
If selected, tries to guess the encoding for the names of the files to decompress based on file extension of the archive file. For example, it will use UTF-8 for .zip files, CP437 for .arj, and US-ASCII for .cpio. Unfortunately, the encoding cannot always be correctly guessed. If you notice that the names of decompressed files contain '?' characters, then the encoding was wrongly guessed. In this case, uncheck the box and pick the correct encoding.
File encoding
Defines the character set which is used to read files that contain characters in a different encoding. You can choose from a list of character encodings (UTF-8, UTF-16, etc.), or specify any other encoding supported by your Java Virtual Machine (VM). The default value uses the default encoding of the Java VM, which may depend on the locale or the Java property "file.encoding".
  • OS default: Uses the default decoding set by the operating system.
  • ISO-8859-1: ISO Latin Alphabet No. 1, a.k.a. ISO-LATIN-1.
  • US-ASCII: Seven-bit ASCII, also referred to as US-ASCII.
  • UTF-8: Eight-bit UCS Transformation Format.
  • UTF-16: Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark in the file.
  • UTF-16BE: Sixteen-bit UCS Transformation Format, big-endian byte order.
  • UTF-16LE: Sixteen-bit UCS Transformation Format, little-endian byte order.
  • Other: Enter a valid charset name supported by the Java Virtual Machine.
Custom encoding
A custom character set used to read a CSV file.

Input Ports

Icon
The file system connection.
Icon
The file system connection.

Output Ports

Icon
Table containing the list of files and folders that have been extracted.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.