This is a patch release for 0.11.0 that contains the following changes:
- Fixed an assertion error that would occur when using
deepsparse.benchmark
on AMD machines with the argument-pin none
.
Known Issues:
- When running quantized BERT with a sequence length not divisible by 4, the DeepSparse Engine will disable optimizations and see very poor performance.