0 ×

Star

IBIS Helmholtz-Node extension for KNIME Workbench version 1.8.1.201707071203 by IBIS KNIME Team

STAR, a "splice aware" RNA-seq aligner, aligns RNA-seq reads to a reference genome using uncompressed suffix arrays. Details are described in the publication: STAR: ultrafast universal RNA-seq aligner
Dobin et al, Bioinformatics 2012; doi: 10.1093/bioinformatics/bts635

Options

STAR Options

runMode
alignReads or generateGenome:

alignReads: Aligns reads to a reference genome.
generateGenome: Generates a genome index using one or more fastA files.
Use 2pass mapping?
For the most sensitive novel junction discovery, it is recommended to run STAR in the 2-pass mode. It does not increase the number of detected novel junctions, but allows to detect more spliced reads mapping to novel junctions.
Number of threads
Number of threads to run STAR genomeGenerate or alignReads.
Optional parameters
Parameters can be set like you would normally use them on the command line for this binary.
Path to genome indexes
Path to the genome directory where genome indeces were generated using the STAR generateGenome mode. This option is only needed when applying the alignReads mode.
Path to GTF file
Path to the file with annotated transcripts in the standard GTF format. STAR will extract splice junctions from this file and use them to greatly improve accuracy of the mapping. While this is optional, and STAR can be run without annotations, using annotations is highly recommended whenever they are available.
Overhang
The overhang specifies the length of the genomic sequence around the annotated junction to be used in constructing the splice junctions database. Ideally, this length should be equal to the ReadLength-1, where ReadLength is the length of the reads. For instance, for Illumina 2x100b paired-end reads, the ideal value is 100-1=99. In case of reads of varying length, the ideal value is max(ReadLength)-1. In most cases, the default value of 100 will work as well as the ideal value.
Path to output folder
Path where all the output data is stored. The folder of the input files is used if no path is specified or the given path is invalid.

Input Ports

Icon
Input depends on the runMode:
alignReads: FastQ file(s) loaded with FileLoader (single- and paired-reads are supported).
generateGenome: FastA file(s) loaded with FileLoader.

Output Ports

Icon
Cell 0: In case of alignReads path to SAM file, otherwise path to the folder containing the generated genome.

Views

STDOUT / STDERR
The node offers a direct view of its standard out and the standard error of the tool.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install KNIME4NGS from the following update site:

KNIME 4.3

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.