Skip to content

MetaPhlAn

Description

MetaPhlAn is a computational tool for profiling the composition of microbial communities from metagenomic sequencing data. MetaPhlAn relies on unique clade-specific marker genes identified from ~17,000 reference genomes.

License

Free to use and open source under MIT License.

Availability

  • Puhti: 4.0.2

Usage

To activate MetaPhlAn Puhti, run command:

module load metaphlan
metaphlan --help
MetaPhlAn can automatically retrieve the MetaPhlAn database and create the Bowtie2 indexes it needs on-the-fly when it the command is executed. By default MetaPhlAn saves these index files to the MetaPhlAn installation directory, but in Puhti, this is not possible. Because of that, the users should use option --bowtie2db to define a directory that will be used to store the database and index files.

For example in the case of project_2001234 the user could first create a directory for the databases:

cd /scratch/project_2001234
mkdir metaphlan_databases
A test input dataset for testing MataPhlAn can be downloaded from the metaphlan gothub site:
wget https://github.com/biobakery/biobakery/raw/master/demos/biobakery_demos/data/metaphlan3/input/SRS014476-Supragingival_plaque.fasta.gz
In the MetaPhlAn command --bowtie2db is used to define the database directory. In this example the job is executed as an interactive batch job.

sinteractive -m 4G -c 4
module load metaphlan
metaphlan --bowtie2db metaphlan_databases  SRS014476-Supragingival_plaque.fasta.gz --input_type fasta > SRS014476-Supragingival_plaque_profile.txt

More information