github openucx/ucx v1.4.0-rc2
v1.4.0 RC2

latest releases: v1.17.0, v1.17.0-rc3, v1.17.0-rc2...
pre-release5 years ago

Features:

  • Improved support for installation with latest ROCm
  • Improved support for latest rdma-core
  • Adding support for CUDA IPC for intra-node GPU
  • Added support for CUDA memory allocation cache for mem-type detection
  • Added support for latest Mellanox devices
  • Added support for Nvidia GPU managed memory
  • Added support for multiple connections between the same pair of workers
  • Added support large worker address for client/server connection establishment
    and INADDR_ANY
  • Added support for bitwise atomics operations

Bugfixes:

  • Performance fixes for rendezvous protocol
  • Memory hook fixes
  • Clang support fixes
  • Self tl multi-rail fix
  • Thread safety fixes in IB/RDMA transport
  • Compilation fixes with upstream rdma-core
  • Multiple minor bugfixes (full list on github)
  • Segfault fix for a code generated by armclang compiler
  • UCP memory-domain index fix for zero-copy active messages

Tested configurations:

  • InfiniBand: MLNX_OFED 4.2-4.4, distribution inbox drivers, rdma-core
  • CUDA: gdrcopy 1.2, cuda 9.1.85
  • XPMEM: 2.6.2
  • KNEM: 1.1.2
  • Multiple bugfixes (full list on github)

Known issues:

  • #2919 - Segfault in CUDA support when KNEM not present and CMA is active
    intra-node RMA transpor. As a workaround user can disable CMA support at
    compile time: --disable-cma. Alternatively user can remove CMA from UCX_TLS
    list, for example: UCX_TLS=mm,rc,cuda_copy,cuda_ipc,gdr_copy.

Don't miss a new ucx release

NewReleases is sending notifications on new releases.