User guide¶
SortMeRNA is a local sequence alignment tool for filtering, mapping and OTU clustering. The core algorithm is based on approximate seeds and allows for fast and sensitive analyses of NGS reads. The main application of SortMeRNA is filtering rRNA from metatranscriptomic data. Additional applications include OTU-picking and taxonomy assignation available through QIIME v1.9+. SortMeRNA takes as input one or two (paired) reads file(s) (fasta/fasta.gz/fastq/fastq.gz), and one or multiple rRNA database file(s), and sorts apart aligned and rejected reads into two files. SortMeRNA works with Illumina, 454, Ion Torrent and PacBio data, and can produce SAM and BLAST-like alignments.
Note
This project is under active development.
Basic usage¶
The only required options are --ref
and --reads
. Options (any) can be specified usig a single dash e.g. -ref
and -reads
. Both plain fasta/fastq
and archived fasta.gz/fastq.gz
files are accepted. File extensions .fastq
, .fastq.gz
, .fq
, .fq.gz
, .fasta
, … are optional. The format and compression are automatically recognized. Relative paths are accepted.
Example 1¶
Single reference and single reads file:
sortmerna --ref REF_PATH --reads READS_PATH
Example 2¶
For multiple references use multiple --ref
:
sortmerna --ref REF_PATH_1 --ref REF_PATH_2 --ref REF_PATH_3 --reads READS_PATH
Example 3¶
For Paired reads use --reads
twice:
sortmerna --ref REF_PATH_1 --ref REF_PATH_2 --ref REF_PATH_3 --reads READS_PATH_1 --reads READS_PATH_2