adam

Biostatistics
Genetics

Software Description

ADAM is a library and command line tool that enables the use of Apache Spark to parallelize genomic data analysis across cluster/cloud computing environments. ADAM uses a set of schemas to describe genomic sequences, reads, variants/genotypes, and features, and can be used with data in legacy genomic file formats such as SAM/BAM/CRAM, BED/GFF3/GTF, and VCF, as well as data stored in the columnar Apache Parquet format.


Info

Module Name

adam

Last Updated On

08/29/2023

Support Level

Secondary Support

Software Access Level

Open Access

Home Page

http://adam.readthedocs.io/en/latest/

Documentation

Software Description

ADAM is a library and command line tool that enables the use of Apache Spark to parallelize genomic data analysis across cluster/cloud computing environments. ADAM uses a set of schemas to describe genomic sequences, reads, variants/genotypes, and features, and can be used with data in legacy genomic file formats such as SAM/BAM/CRAM, BED/GFF3/GTF, and VCF, as well as data stored in the columnar Apache Parquet format.

General Linux

To load this module for use in a Linux environment, you can run the command:

module load adam

Depending on where you are working, there may be more than one version of adam available. To see which modules are available for loading you can run:

module avail adam

Agate Modules

Default

06042018

Other Modules

06042018

Mangi Modules

Default

06042018

Other Modules

06042018

Mesabi Modules

Default

06042018

Other Modules

06042018