github ROCm/rocBLAS rocm-4.3.0
rocBLAS 2.39.0 for ROCm 4.3.0

latest releases: rocm-6.1.0, rocm-6.0.2, rocm-6.0.0...
2 years ago

Optimizations

  • Improved performance of non-batched and batched rocblas_Xgemv for gfx908 when m <= 15000 and n <= 15000
  • Improved performance of non-batched and batched rocblas_sgemv and rocblas_dgemv for gfx906 when m <= 6000 and n <= 6000
  • Improved the overall performance of non-batched and batched rocblas_cgemv for gfx906

Changed

  • Internal use only APIs prefixed with rocblas_internal_ and deprecated to discourage use

Don't miss a new rocBLAS release

NewReleases is sending notifications on new releases.