github EricLBuehler/mistral.rs v0.3.4

19 hours ago

New features

  • Qwen2-VL support
  • Idefics 3/SmolVLM support
  • ️‍🔥 6x prompt performance boost (all benchmarks faster than or comparable to MLX, llama.cpp)!
  • 🗂️ More efficient non-PagedAttention KV cache implementation!
  • Public tokenization API

Python wheels

The wheels now include support for Windows, Linux, and Mac with x84_64 and aarch64.

MSRV

1.79.0

What's Changed

New Contributors

Full Changelog: v0.3.2...v0.3.4

Don't miss a new mistral.rs release

NewReleases is sending notifications on new releases.