Mothur
Mothur is a bioinformatics toolkit for the needs of the microbial ecology related data analysis.
License
Free to use and open source under GNU GPLv3.
Available
- Puhti: 1.39.5, 1.44.0, 1.48.0
- Chipster graphical user interface
Usage
To initialize the default version of Mothur on Puhti, use:
To see all the available versions:
To load a specific version:
To run Mothur in interactive mode, use sinteractive.
If your analyses take a long time, or you wish to use multiple cores, you should run Mothur as a batch job.
Start by collecting your Mothur commands into a command file to use Mothur in batch mode.
Once you have a working Mothur command file, you can launch Mothur jobs that take several days for completion, if needed.
Below is a sample Mothur batch job file. In this example, we assume that the Mothur commands are in the file my_mothur_task.txt
.
#!/bin/bash
#SBATCH --account=project_1234567
#SBATCH --job-name=mothur
#SBATCH --output=output_%j.txt
#SBATCH --error=errors_%j.txt
#SBATCH --ntasks=1
#SBATCH --cpus-per-task=1
#SBATCH --mem=32G
#SBATCH --time=48:00:00
#SBATCH --partition=small
module load mothur
mothur my_mothur_task.txt
If you want to use multiple cores, adjust parameter --cpus_per_task
. You must also adjust the processors
parameter for each command in the Mothur command file accordingly. Note that only some Mothur commands can use multiple cores.
Mothur jobs need to run inside a single node, so the maximum number of cores you can use on Puhti is 40. You should check the scalability before submitting large jobs. Many Mothur tasks won't scale well beyond a few cores. Using too many cores may even make your job run slower.
The batch job script described above (in this case named as mothur_batch_job.sh
) can be submitted to the batch job system
with the command:
See the Puhti user guide for more information about running batch jobs.