STAR
Description
STAR (Spliced Transcripts Alignment to a Reference) is a fast NGS read aligner for RNA-seq data.
License
Free to use and open source under GNU GPLv3.
Available
Version on CSC's Servers
Puhti: 2.7.10a
Usage
The STAR commands listed abowe are activated by loading biokit module.
module load biokit
Before you can run the actual alignment job, you must index your fasta formatted reference genome. In Puhti the working copies of reference genome indexes, as well as any large files, should be stored to the scatch directory ($SCRATCH). In this example we store the indexes to the directory $SCRATCH/star-genomes.
First the reference genome index directory is generated with command:
mkdir $SCRATCH/star-genome
STAR --runMode genomeGenerate --genomeDir $SCRATCH/star-genome --genomeFastaFiles /path/to/genome/genome.fasta --runThreadN 2
mkdir $SCRATCH/starjob1
cd $SCRATCH/starjob1
STAR --genomeDir $SCRATCH/star-genomes --readFilesIn my_reads.fastq
/appl/soft/bio/star/STAR-2.7.2a/source/parametersDefault
#!/bin/bash -l
#SBATCH --job-name=STAR
#SBATCH --output=STAR.stdout
#SBATCH --error=STAR.stderr
#SBATCH --partition=small
#SBATCH --ntasks=1
#SBATCH --nodes=1
#SBATCH --cpus-per-task=6
#SBATCH --account=project_1234567
#SBATCH --mem=24000
# calculate indexes. You don't need to recalculte the indexes if they already exist.
mkdir $SCRATCH/star-genome
STAR --runMode genomeGenerate --genomeDir $SCRATSCH/star-genome --genomeFastaFiles /path/to/genome/genome.fasta --runThreadN $SLURM_CPUS_PER_TASK
# Run the mapping task
STAR --genomeDir $SCRATCH/star-genome --readFilesIn my-reads.fastq --runThreadN $SLURM_CPUS_PER_TASK
The batch job script is launced with command sbatch. For example:
sbatch starjob1.sh