Bio Sequence Reader

Reads a sequence file into two columns: seq_name and sequence

This implementation uses the Biojava (VERSION 1.7.1) implementation for reading the data

It creates a sequence object that can be accessed by special nodes

The following file formats can be read in Fasta Genbank uniprot embl INSDseq

you also have to specify if the sequences are RNA, DNA, protein sequences

Annotations are stored with the sequence objects

Sample code to access annotation

Sample code to access sequence information

Large sequence files with lots of annotations might not fit into memory

Options

File name
Filename of sequence file
Type
Sequence type (RNA, DNA, Protein)
Format
File format (FASTA, Genbank, Uniprot, embl, INSDseq)
Limit
Limit the number of entries to read in. (0=all)

Input Ports

This node has no input ports

Output Ports

Icon
Sequence object representation of the input file

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.