github openucx/ucx v1.4.0-rc1
v1.4.0 RC1

latest releases: v1.17.0-rc1, v1.16.0, v1.16.0-rc5...
pre-release5 years ago

Features:

  • Improved support for installation with latest ROCm
  • Improved support for latest rdma-core
  • Adding support for CUDA IPC for intra-node GPU
  • Added support for CUDA memory allocation cache for mem-type detection
  • Added support for latest Mellanox devices
  • Added support for Nvidia GPU managed memory
  • Added support for multiple connections between the same pair of workers
  • Added support large worker address for client/server connection establishment
    and INADDR_ANY
  • Added support for bitwise atomics operations

Bugfixes:

  • Performance fixes for rendezvous protocol
  • Memory hook fixes
  • Clang support fixes
  • Self tl multi-rail fix
  • Thread safety fixes in IB/RDMA transport
  • Compilation fixes with upstream rdma-core
  • Multiple minor bugfixes (full list on github)

Tested configurations:

  • InfiniBand: MLNX_OFED 4.2-4.4, distribution inbox drivers, rdma-core
  • CUDA: gdrcopy 1.2, cuda 9.1.85
  • XPMEM: 2.6.2
  • KNEM: 1.1.2
  • Multiple bugfixes (full list on github)

Known issues:

  • #2919 - Segfault in CUDA support when KNEM not present and CMA is active intra-node RMA transpor. As a workaround user can disable CMA support at compile time: --disable-cma. Alternatively user can remove CMA from UCX_TLS list, for example: UCX_TLS=mm,rc,cuda_copy,cuda_ipc,gdr_copy.

Don't miss a new ucx release

NewReleases is sending notifications on new releases.