- Preview: NPU compiler integration with the NPU plugin enables ahead-of-time and on-device compilation without relying on OEM driver updates. This feature is enabled by default in this release package.
- Known Issues
- Component: optimum; ID: 179936
Description: phi-4-multimodal instruct model isn’t functional when converted using optimum-cli as channel-wise one (with –group-size -1) with OpenVINO 2026.0. It’s recommended to use for the conversion OV 2025.4/OV 2025.4.1 - Component: GenAI ; ID: 179754
Description: Text2VideoPipeline will crash with RuntimeError when calling generate(guidance_scale=1.0) after the model was reshaped or compiled with guidance_scale > 1.0; use guidance_scale >= 1.0001 as a workaround until the fix is available. - Component: Runtime; ID: 180693
Description: Qwen3-30B-A3B converted with newer transformers doesn’t work, recommend using transformers 4.55.4 for model conversion which was verified and worked. - Component: GenAI ; ID: 179973
Description: Qwen2-vl, Qwen-2.5VL, Qwen3-VL dense models may not work through GenAI API with GPU, due internal issue on model transformation level - Component: Runtime ; ID: 180696
Description: 2nd (and further) latency degradation for Qwen3-MOE family, including lack of ability to fit a model on iGPU, due high memory consumption and potential graph corruption. Problem affects only IRs generated with 2026.0, former IRs generated with 2025.4 will work properly. - Component: Runtime ; ID: 179009
Description: Memory leak for static builds with HybridCRT enabled; impacts Windows only
- Component: optimum; ID: 179936
You can find OpenVINO™ toolkit 2026.0.1 release here:
- Download archives* with OpenVINO™