Minnesota Supercomputing Institute
With regards to the safety measures put in place by the university to mitigate the risks of the COVID-19 virus, at this time all MSI systems will remain operational and can be accessed remotely as usual. The only planned outages concern our in-person Helpdesk and tutorials. More information, as well as alternative remote support options, can be found at MSI COVID-19 Continuity Plan
Wednesday, December 7, 2016
MOSAIK is a reference-guided assembler comprising of two main modular programs:
MosaikBuild converts various sequence formats into Mosaik’s native read format. MosaikAligner pairwise aligns each read to a specified series of reference sequences and produces BAMs as outputs. At this time, the workflow consists of supplying sequences in FASTA, FASTQ, Illumina Bustard & Gerald, or SRF file formats and producing results in the BAM format.
Unlike many current read aligners, MOSAIK produces gapped alignments using the Smith-Waterman algorithm. MOSAIK is written in highly portable C++ and currently targetted for the following platforms: Microsoft Windows, Apple Mac OS X, FreeBSD, and Linux operating systems. Other platforms can easily be supported upon request.
MOSAIK is multithreaded. If you have a machine with 8 processors, you can use all 8 processors to align reads faster while using the same memory footprint as when using one processor.
MOSAIK supports multiple sequencing technologies. In addition to legacy technologies such as Sanger capillary sequencing, our program supports next generation technologies such as Roche 454, Illumina, AB SOLiD, and experimental support for the Helicos Heliscope.
To run this software interactively in a Linux environment run the commands:
module load mosaik
Note that when using MosaikAlign the neural network options should be specified as:
-annpe $MOSAIK_DATA/pe.ann -annse $MOSAIK_DATA/se.ann