github michaelfeil/infinity 0.0.3

latest releases: 0.0.46, 0.0.45, 0.0.44...
7 months ago

What's Changed

  • add Flash-Attention+ optimum-BetterTransformers by @michaelfeil in #20
  • Improve real-time / sleep strategy, async await for queues and result futures - reducing latency a bit by @michaelfeil in #12
  • add better FIFO queueing strategy - your requests now have a upper bound how long they queue by @michaelfeil in #19

Docs:

Full Changelog: 0.0.2rc0...0.0.3

Don't miss a new infinity release

NewReleases is sending notifications on new releases.