The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective communication primitives that are performance optimized for NVIDIA GPUs. NCCL provides routines such as all-gather, all-reduce, broadcast, reduce, reduce-scatter, that are optimized to achieve high bandwidth and low latency over PCIe and NVLink high-speed interconnect.

Version: 2.14.3

Availability: LEONARDO

Target: all

Official web site:

Related Commands:

Setup the environment with the command:

module load spack
module load autoload nccl