github neuralmagic/deepsparse v0.4.0
DeepSparse v0.4.0

latest releases: v1.7.1, v1.7.0, v1.6.1...
2 years ago

New Features:

  • New operator support implemented for Expand.
  • Slice operator support for positive step sizes. Only slice operations that operate on a single axis are supported. Previously, slice was only supported for constant tensors and step size equal to one.

Changes:

  • Memory usage of compiled models reduced.
  • Memory layout for matrix multiplications in Transformers optimized.
  • Precision for swish and sigmoid operations improved.
  • Runtime performance improved for some networks whose outputs are immediately preceded by transpose operators.
  • Runtime performance of softmax operations improved.
  • Readme redesigned for better clarity on the repository's purpose.

Resolved Issues:

  • Using the multi-stream scheduler, when more threads were selected than the number of cores on the system, it no longer causes a performance hit.
  • Neural Magic dependencies upgrade to intended bug versions instead of minor versions.

Known Issues:

  • None

Don't miss a new deepsparse release

NewReleases is sending notifications on new releases.