github huggingface/optimum-intel v1.21.0
v1.21.0: SD3, Flux, MiniCPM, NanoLlava, VLM Quantization, XPU, PagedAttention

16 days ago

What's Changed

OpenVINO

Diffusers

VLMs Modeling

NNCF

IPEX

  • Unified XPU/CPU modeling with custom PagedAttention cache for LLMs by @sywangyi in #1009

INC

New Contributors

Full Changelog: v1.20.0...v1.21.0

Don't miss a new optimum-intel release

NewReleases is sending notifications on new releases.