Integration Authors: Jarl Sondre Sæther (CERN), Henry Mutegeki (CERN), Iacopo Ferrario (EURAC), Matteo Bunino (CERN), Linus Eickhoff (CERN)
To install this package, use the following command:
pip install -e .If you are on JSC, you need to run a SLURM script to properly install Horovod and DeepSpeed:
sbatch installation-scripts/horovod-deepspeed-JSC.slurmThe script will install Horovod and DeepSpeed with the correct installation flags. It usually takes around 20 minutes to complete.
You can launch the training using itwinai's exec-pipeline command as follows:
itwinai exec-pipeline --config-path configuration_files --config-name <config-name>We currently offer premade configs for juwels, vega, and lumi. You can find them in the
configuration_files directory.