IT Center Help

Sie befinden sich im Service: RWTH High Performance Computing (Linux)

Apptainer

Information

Apptainer (formerly Singularity) is a container virtualization software specifically designed for HPC environments. You can imagine containers as lightweight virtual operating systems with preinstalled and preconfigured software that can be run just like any other program on the cluster. You could, for example, run software in an Ubuntu environment within our Rocky compute cluster. This helps overcome portability issues with software that has very specific dependencies, was not built to be run under RHEL-based distributions or needs to be ported with the exact same configuration that was used on another system.

Please Note:

If you want to run software via Apptainer, please read the Best Practices first! They contain important information for beginners and experienced users alike.

Table of Contents

Anleitung

Container Environment

Containers allow software developers and users to package software and its dependencies in a virtual environment that can easily be ported to completely different systems. Not only does this eliminate the need for complex software installation, it also makes results received through the containerized software reproducible. Computations are always run using the exact same software.

Usual containerization software isolates applications from the host system. This poses problems in the context of HPC because users are often running jobs across multiple nodes using special interconnects that need software support. Apptainer, however, allows containerized multi-node MPI jobs and leveraging the Intel OmniPath fabric. Whilst you do not have access to the host operating system, within the container you may still do the following things:

Access all your personal directories ($HOME, $WORK, $HPCWORK). Within most containers you may access these via the aforementioned variables as usual. You may thus comfortably share data between containerized and native applications.
Access other nodes through the network and run multi-node jobs.
Access GPUs

In the following paragraphs, container image refers to the files that are used to run a container whereas container refers to the running instance of such an image.

Run a container

There are three standard ways of running an Apptainer container: The shell subcommand, the run and the exec subcommand.

The shell subcommand allows you to start an interactive shell within the container and is intended for use on frontends or within interactive batch jobs.
The exec subcommand allows users to run custom shell commands in a containerized shell. It is useful within batch scripts and can be coupled with a separate exec script if a complex execution flow is required.
The run subcommand triggers the execution of a pre-defined runscript within the container. Container creators may choose to provide a default workflow which can be accessed this way. Often times, it allows running the image like a regular program.

Example of starting a container:

# Start a shell inside the container
apptainer shell $HOME/my_container
# Run the container and bind an additional directory into the container
apptainer run --bind=$HOME/my-input:/opt/my-input $HOME/my_container
# Run the container directly
$HOME/my_container
# Execute a single command in the container
apptainer exec $HOME/my_container cat /etc/os-release
# Execute a shellscript inside the container
apptainer exec $HOME/my_container $HOME/my_exec_script.sh

Apptainer Autocompletion

Apptainer can generate a file to add command autocompletion to your shell. Autocompletion can both make typing long commands easier and give you more information on the available subcommands and arguments. To add autocompletion to your shell, run the following commands:


apptainer completion zsh > $HOME/.apptainer.autocomp.sh 
# Make sure the file was created successfully and not filled with an error message 
cat $HOME/.apptainer.autocomp.sh 
echo "source $HOME/.apptainer.autocomp.sh" >> $HOME/.zshrc 
. $HOME/.zshrc # or restart your session

After that, you can use autocompletion by entering "apptainer " and then pressing TAB. Repeatedly pressing TAB cycles through the subcommands. Argument autocompletion is supported after double dashes, e.g., apptainer shell --<TAB>

To remove autocompletion, delete the created autocompletion file and remove the source line from $HOME/.zshrc

Apptainer and MPI

Apptainer supports different models of MPI execution, all of which are suitable for certain situations. The two most relevant are commonly called hybrid MPI model and containerized MPI model. The hybrid model uses the MPI runtime on the host, usually supplied via one of our MPI modules, to handle the communication between the different processes. The individual ranks are spawned by the runtime with one container instance per rank and the applications within the containers connect to the host runtime. This model works seamlessly with Slurm and can utilize all the performance optimizations we include for our regular MPI jobs. The major drawback is that it requires the container to use an MPI version that is (ABI-)compatible with one of the runtimes supported on the host which severely limits portability depending on how widespread the MPI runtime is. For Claix this means that the container needs to either use a recent version of OpenMPI or IntelMPI (unlikely). If this requirement is met, the hybrid model is the fastest way to get your job running with the desired performance. For an example batch script, see down below.

The containerized MPI model bundles an MPI runtime along with the application and is independent from the host system. This makes it more flexible when deploying containers on a variety of different systems and is generally the best setup for portable containers as the container can be run virtually anywhere in single-node scenarios. For proper performance and multi-node support the containerized MPI needs to be built with support for (one of) the PMI version utilized by the host Slurm system. For Claix, you will want to make sure that the containerized MPI is built with PMIx v3.0+ support. Unlike the hybrid model, this allows users to use any MPI runtime in the container as long as it supports the required PMI version. If the runtime is also available on the host, these images can also be used hybridly.

The third option is the so-called bind model which requires manually binding the host MPI implementation into the container. We do not recommend using this model because it defies the idea of container portability and is difficult to handle. It may be used in corner cases where the containerized application has very specific requirements.

GPU Usage in Containers

Apptainer supports this with a simple command line argument. To use NVidia GPUs simply add the --nv option after your desired subcommand:

apptainer run --nv $HOME/my_container

Naturally the --nv flag will only work correctly on systems that actually have a GPU installed. If run on a non-GPU host, Apptainer will issue a warning but still execute the container.

Apptainer will use the host's CUDA installation where possible. This works well for a lot of applications that support a recent CUDA version.

Example Batch Scripts

Serial Example

#!/usr/bin/zsh

### Job name
#SBATCH --job-name=APPTAINER_EXAMPLE

### File / path where STDOUT will be written, %J is the job id
#SBATCH --output=apptainer-job-out.%J

### Request the time you need for execution. The full format is D-HH:MM:SS
### You must at least specify minutes or days and hours and may add or
### leave out any other parameters
#SBATCH --time=30

### Request memory you need for your job in MB
#SBATCH --mem-per-cpu=2000

### Request number of hosts
#SBATCH --nodes=1

### Request number of CPUs
#SBATCH --cpus-per-task=4

### Change to the work directory
cd $HOME/jobdirectory

### Execute the container
### myexecscript.sh contains all the commands that should be run inside the container
apptainer exec /path/to/my/container $HOME/myexecscript.sh

MPI Example (Hybrid Model)

#!/usr/bin/zsh

### Job name
#SBATCH --job-name=APPTAINER_MPI_EXAMPLE

### File / path where STDOUT will be written, %J is the job id
#SBATCH --output=apptainer-job-out.%J

### Request the time you need for execution. The full format is D-HH:MM:SS
### You must at least specify minutes or days and hours and may add or
### leave out any other parameters
#SBATCH --time=30

### Request memory you need for your job in MB
#SBATCH --mem-per-cpu=3800

### Request number of tasks/MPI ranks
#SBATCH --ntasks=4

### Change to the work directory
cd $HOME/jobdirectory

### Run the container
$MPIEXEC apptainer exec /path/to/my/container $HOME/myexecscript.sh

Example for Exec Script

The exec command can execute arbitrary commands. It is most useful to use it with a separate script that should be run inside the container.

#!/bin/bash

# The shell in the shebang line above has to exist in the container.
# /bin/bash is a sane default, use /bin/dash (Ubuntu) or /bin/sh if bash is not available
# If you want to use the same shell you use on our native systems, make sure zsh is installed and use /bin/zsh instead.

# Place your application calls here
python ./script.py arg1 arg2

Special problems with shared $HOME directories

Not only do you have access to your home directory within the container but it will also, by default, serve as the container's home directory for every container that you execute. This means that configuration files stored within your home directory, such as application config files (zsh!) will be used within the container as well. This can prove both advantageous and disadvantageous since a shared configuration may make working within the container more comfortable but at the same time introduce settings that are incompatible with the containerized environment.

Shell-based compatibility issues are mitigated by Apptainer's default behavior of invoking containers with /bin/sh. You may invoke another shell by specifying its path via the --shell argument. The shell needs to exist within the container image which is usually the case for bash but not for zsh

Python software within containers should make use of virtual environments or package managers like conda to avoid hard-to-trace side effects.

If you wish to use an empty home directory within a container instead, please add the --no-home flag to your container invocation. This requires you to start the container from a path that is not within your home directory. You can also use a different directory as your temporary home directory via --home /path/on/host.

Converting Docker Images for Apptainer

Pull Docker Image From External Resource

The build command allows pulling arbitrary docker images and converting them to Apptainer images in a single step. Container registries or software documentation will often explain how to retrieve an image using docker pull. The following command can be used instead:

apptainer pull ubuntu-22.04.sif docker://ubuntu:23.04

The prefix docker:// tells singularity that the following URI points to a docker image and should be treated as such.

Pull Image From Nvidia Container Registry

This snippet shows the full process from pulling to executing an image from the NVCR.

# Pull Tensorflow 23.08
apptainer pull tensorflow_23.08-tf2-py3.sif docker://nvcr.io/nvidia/tensorflow:23.08-tf2-py3

# Inspect a container with an interactive shell and GPU support
apptainer shell --nv tensorflow_23.08-tf2-py3.sif

# Execute a predefined script in a container with GPU support, e.g. within Slurm
apptainer exec --nv tensorflow_23.08-tf2-py3.sif ./myexecscript.sh

Build Apptainer Images

Please refer to the official user guide for information on how to build apptainer images yourself. The section on definition files is very exhaustive and contains multiple examples, including one to build apptainer images based on docker images.

Best Practices for Apptainer Usage

Working with containers can be daunting at first. To get you started, we have compiled this list of best practices to follow when using Apptainer on our system. Not all of these may apply to your use case but we still recommend that you skim over them.

Do not save container images on $HOME

Container images are easy to regenerate and usually quite big in file size. All files stored under $HOME are backed up regularly and thus storing non-crucial data here should be avoided. Thus, we strongly recommend to store container images on $WORK or $HPCWORK. For the same reason, downloaded image blobs are cached on $WORK rather than $HOME.
Stay with SIF containers

SIF files are the default container format for Apptainer and store the container image in a single file. Containers can also be stored in a directory-based format (sandbox directories). Sandbox directories tend to perform a lot worse on $HOME, $WORK and $HPCWORK and should be avoided on these filesystems.
Unload all unnecessary modules
Most modules are not supported by containers with MPI modules as a notable exception. Loading modules changes a shell's environment and these changes are carried over to a container invoked within this shell. This does not necessarily break things within the container as long as the environment variables changed by the module are not used by the containerized software. In some instances such as compiler modules, however, these changes may cause software to break, e.g. the C compiler variable CC being set to "icc" (the Intel compiler binary). To avoid such issues we recommend unloading all modules that are not needed for the container before starting Apptainer. If your program does not rely on MPI, you may use ml purge. If you want to go even further, you can use the--contain-env argument to eliminate all environment
Use compatible MPI versions
Containers run via MPI need to be provided with a compatible MPI implementation. This can usually be achieved by choosing a compatible version from our module tree, loading the module and starting Apptainer via the proper MPI wrapper (see above for an MPI batch example). If the container has been provided by a third party it should contain information on the MPI version against which the program was linked. It should be noted that for Open MPI versions below 3.0 compatibility is only guaranteed for versions matching exactly.

Common Usecases

I want to run a container without $WORK or $HPCWORK
By default both $WORK and $HPCWORK are mapped into each container run on the cluster giving you access to all personal directories you would have access to in a native environment. When filesystems are temporarily not available or you want to selectively restrict access to your personal directories, you may want to prevent binding them into the container. This can be achieved like so:
apptainer shell --no-mount $WORK,$HPCWORK my_image
The --no-mount flag disables bind mounts for the passed paths. In this case we have excluded both $WORK and $HPCWORK. You can adjust the list of directories according to your needs.
I want to run Python software in a container
In general, container images will provide all necessary Python modules in a default system path which normally takes precedence over any locally installed modules. In this case potentially conflicting module installations within your $HOME directory will not cause any problems. However, some images - mainly those distributed for Docker - might store modules in a custom location that needs to be added to the PYTHONPATH environment variable in order to use the image as intended. You may find further information on this topic in our Python documentation. A properly configured Apptainer image will take care of such issues by setting the PYTHONPATH accordingly. If this is not being done, you should make sure to start the container with an empty PYTHONPATH variable, e.g. by executing unset PYTHONPATH.
You should abstain from any software installations to default locations while inside a container as this can easily break existing or upcoming software installations. The PIP tool should only be used if you are fully aware of the consequences and if you see the need to use it, you are probably doing something wrong. Likewise sharing venv or conda environments between containers and the host system is almost guaranteed to lead to problems. If your containerized software behaves oddly, you should test dropping your home directory with --no-home or switching it to a test directory with --home /path/to/dir.
I want to pull an image from a different container registry such as GitHub or GitLab but require login credentials

Registries that require authentication cannot be used without a valid endpoint configuration. Luckily, this is supported via a special set of commands. Please follow the instructions on the official user guide

Common Errors

exec: ...: a shared library is likely missing in the image
This error can be caused by numerous issues:
You are trying to execute a script that uses an invalid shebang (any scripting language). Please make sure that the path in your shebang, e.g. #!/bin/bash, is indeed available in your container.
You are trying to execute a python script that relies on modules which have not been installed in your container. In this case please see "Running Python Software in a Container" above.

Further Questions

If you have any questions that were not (fully) answered above or have any suggestions for improvements, please contact us via servicedesk@itc.rwth-aachen.de . If your questions regard Apptainer itself, you may find the official User Guide helpful. Please be aware that while we offer support for Apptainer problems that occur on our system (problems while building, problems to run images etc.), we cannot offer support for the software included in images that were not provided by us. Please contact the image creators or software developers where possible.

zuletzt geändert am 04.07.2025

Dieses Werk ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 3.0 Deutschland Lizenz