huggingface/optimum-intel v1.9.0 on GitHub

OpenVINO and NNCF

Ensure compatibility for OpenVINO v2023.0 by @jiwaszki in #265
Add Stable Diffusion quantization example by @AlexKoff88 in #294 #304 #326
Enable decoder quantized models export to leverage cache by @echarlaix in #303
Set height and width during inference for static models Stable Diffusion models by @echarlaix in #308
Set batch size to 1 by default for Wav2Vec2 for NNCF compatibility v2.5.0 @ljaljushkin in #312
Ensure compatibility for NNCF v2.5 by @ljaljushkin in #314
Fix OVModel for BLOOM architecture by @echarlaix in #340
Add SD OV model height and width attribute and fix export for torch>=v2.0.0 by @eaidova in #342

Add TSModelForCausalLM to enable TorchScript export, loading and inference for causal lm models by @echarlaix in #283
Remove INC deprecated classes by @echarlaix in #293
Enable IPEX model inference for text generation task by @jiqing-feng in #227 #300
Add INCStableDiffusionPipeline to enable INC quantized Stable Diffusion model loading by @echarlaix in #305
Enable the possibility to provide a quantization function and not a calibration dataset during INC static PTQ by @PenghuiCheng in #309
Fix INCSeq2SeqTrainer evaluation step by @AbhishekSalian in #335
Fix INCSeq2SeqTrainer padding step by @echarlaix in #336

Full Changelog: https://github.com/huggingface/optimum-intel/commits/v1.9.0