fastx-toolkit

Genetics

Software Description

The FASTX-Toolkit is a collection of command line tools for preprocessing short nucleotide reads in FASTA and FASTQ formats, usually produced by Next-Generation sequencing machines. The main processing of such FASTA/FASTQ files is mapping (aligning) the sequences to reference genomes or other databases using specialized programs like BWA, Bowtie and many others. However, it is sometimes more productive to preprocess the FASTA/FASTQ files before mapping the sequences to the genome---manipulating the sequences to produce better mapping results. The FASTX-Toolkit tools perform some of these preprocessing tasks.


Info

Module Name

fastx_toolkit

Last Updated On

08/29/2023

Support Level

Secondary Support

Software Access Level

Open Access

Home Page

http://hannonlab.cshl.edu/fastx_toolkit/

Documentation

Software Description

The FASTX-Toolkit is a collection of command line tools for preprocessing short nucleotide reads in FASTA and FASTQ formats, usually produced by Next-Generation sequencing machines. The main processing of such FASTA/FASTQ files is mapping (aligning) the sequences to reference genomes or other databases using specialized programs like BWA, Bowtie and many others. However, it is sometimes more productive to preprocess the FASTA/FASTQ files before mapping the sequences to the genome---manipulating the sequences to produce better mapping results. The FASTX-Toolkit tools perform some of these preprocessing tasks.

General Linux

To run this software interactively in a Linux environment run the commands:

module load fastx_toolkit
fastq_to_fasta [options] -i INFILE -o OUTFILE

To display usage instructions for each tool, you can use the -h argument, e.g.,

fastq_to_fasta -h

All of the FASTX_Toolkit utilities may be found in the directory

/soft/fastx_toolkit/VER/bin

where VER is the module version you are using. Please note, if you are working on fastq files that use the standard Sanger encoding for phred base calling quality scores, you need to add -Q33 in the command. Fastx_toolkit was developed long time ago, and assume the Illumina (1.7-) format as the default encoding. Additional Information

Command Line Usage

Agate Modules

Default

0.0.14

Other Modules

0.0.13, 0.0.14

Mangi Modules

Default

0.0.14

Other Modules

0.0.13, 0.0.14

Mesabi Modules

Default

0.0.14

Other Modules

0.0.13, 0.0.14