Added
- Sample for clients using OpenMP threads calling rocBLAS functions.
- gfx1103, gfx1150, and gfx1151 enabled.
Changed
- By default, the Tensile build is no longer based on tensile_tag.txtbut uses the same commit from shared/tensile in the rocm-libraries repository. The rmake or install-toption can build from another local path with a different commit.
Optimized
- Improved the performance of Level 2 gemv transposed (TransA != N) for the problem sizes wheremis small andnis large on gfx90a and gfx942.