New Features:
- New operator support implemented for Expand.
- Slice operator support for positive step sizes. Only slice operations that operate on a single axis are supported. Previously, slice was only supported for constant tensors and step size equal to one.
Changes:
- Memory usage of compiled models reduced.
- Memory layout for matrix multiplications in Transformers optimized.
- Precision for swish and sigmoid operations improved.
- Runtime performance improved for some networks whose outputs are immediately preceded by transpose operators.
- Runtime performance of softmax operations improved.
- Readme redesigned for better clarity on the repository's purpose.
Resolved Issues:
- Using the multi-stream scheduler, when more threads were selected than the number of cores on the system, it no longer causes a performance hit.
- Neural Magic dependencies upgrade to intended bug versions instead of minor versions.
Known Issues:
- None