Datasets by discipline
CSC hosts or provides access to several datasets on different platforms.
Biosciences
- ChEMBL Database of bioactive molecules.
- Chipster_genomes Tool to download aligner indexes used by the Chipster software to Puhti
Chemistry
- CSD - Cambridge Crystallographic Database – organic and metallo-organic crystal structures and tools
- Molport 6M molecule database preprocessed for fast GPU screening with Schrödinger Shape
Geosciences
Language research and other digital humanities and social sciences
- The latest versions of CLARIN PUB or ACA licensed corpora are available unpacked in
/appl/data/kielipankki/