This is a patch release containing the following changes to v2.1.2: Updated xbyak_aarch64 to support Apple silicon (dd1a02a, 913010b, 2d155dd) Fixed segfault in fp32 depthwise convolution with padded memory (2d8283f) Fixed potential issues in BRGEMM-based convolution implementation (b183dff, d2b1653) Fixed memory leak on NVIDIA GPUs (06803f2)