Getting started with Puhti
This is a quick start guide for Puhti users. It is assumed that you have previously used CSC cluster resources like Taito/Sisu. If not, you can start by looking at overview of CSC supercomputers.
Go to MyCSC to apply for access to Puhti or view your projects
and their project numbers if you already have access. You can also use the command
csc-projects
.
Connecting to Puhti
Connect using a normal ssh-client:
ssh yourcscusername@puhti.csc.fi
Where yourcscusername
is the username you get from CSC.
There is also a web interface available at www.puhti.csc.fi, where you can log in with your CSC user name. In this interface you can manage files, launch interactive applications and list jobs, quotas and project statuses. You can use it also for graphical applications.
Module system
CSC uses the Lmod module system.
Modules are set up in a hierarchical fashion, meaning you need to load a compiler before MPI and other libraries appear.
See more information about modules.
Compilers
The system comes with two compiler families installed, the Intel OneAPI and GCC compilers.
We have installed intel-oneapi-compilers-classic/2021.6.0
and intel-oneapi-compilers/2022.1.0
of the Intel compilers, and for GCC 11.3, 9.4 and 8.5 are available.
See more information about compilers.
High performance libraries
Puhti has several high performance libraries installed, see more information about libraries.
MPI
Currently the system has a two MPI implementations installed:
- openmpi
- intel-oneapi-mpi
We recommend to test using openmpi
first.
Note
You need to have the MPI module loaded when submitting your jobs.
More information about building and running MPI applications.
Applications
More information about specific applications can be found here
Python
System Python is available by default both in Puhti and Mahti without loading any module. Python 3
(= 3.6.8) is available as python3
. The default system Python does not include any optional Python
packages. Several other Python installations are available via the module system.
python-data is a good first choice, it includes a lot of regularly used
packages.
Running jobs
Puhti uses the Slurm batch job system.
A description of the different Slurm partitions can be found here. Note that the GPU partitions are available from the normal login nodes.
Instructions on how to submit jobs can be found here and example batch job scripts are found here
Very important change!!
You have to specify your billing project in your batch script with the --account=<project>
flag. Failing to do so will cause your job to be held with the reason AssocMaxJobsLimit
.
Running srun
directly also requires the flag.
More information about billing here and common queuing system error messages in the FAQ.
Network
- Login nodes can access the Internet
- Compute nodes can access the Internet
Storage
The project based shared storage can be found under /scratch/<project>
.
Note that this folder is shared by all users in a project. This folder is
not meant for long term data storage and files that have not been used for 180
days will be automatically removed.
The default quota for this folder is 1 TB. There is also a persistent
project based storage with a default quota of 50 GB. It is located
under /projappl/<project>
. Each user can store up to 10 GB of data in
their home directory ($HOME
).
You can check your current disk usage with
csc-workspaces
. More detailed information about storage can be found
here and a guideline on managing data on the
/scratch
disk here. See also the LUE tool
for efficiently querying how much data/files you have in a directory.
Linux basics Tutorial for CSC
If you are new to Linux command line or using supercomputers, please consult this tutorial section! See also the materials of the CSC Computing Environment self-learning course.