invoke-ai/InvokeAI v5.9.0 on GitHub

This release adds FLUX Fill support in Workflows and Canvas, beta support for the LLaVA OneVision VLLM family of models, and a selection of minor fixes and enhancements.

FLUX Fill

FLUX Fill provides high quality inpainting and outpainting, improving on these tasks over the other FLUX models. It's a "main" model, like FLUX dev or schnell.

To use it, download it from Starter Models and then select it from the main model drop-down on Canvas. It's not compatible* with Text to Image or Image to Image - you'll get an error if you try to Invoke without an inpaint mask or some empty regions in your bbox.

*Technically, it can do Text to Image and Image to Image - but the quality is very poor. We've opted to disallow this on Canvas.

LLaVA OneVision VLLM

This multimodal model generates text from text, image and/or video* inputs. You can use it to generate prompts and and describe images. You can use it in Workflows with the LLaVA OneVision VLLM node.

The 0.5B variant of the model is available for download from Starter Models.

*Invoke does not support video inputs.

Other Changes

Support for custom string field drop-downs in Workflow Builder. Add a node's string field to the Builder and choose the dropdown component to see it in action.
The About modal now shows the app's runtime settings. It includes a list of explicitly-set settings (i.e. the contents of invokeai.yaml), so it is possible to see what runtime settings are app defaults and which are user-defined.
Improved UX for missing or unexpected fields in Workflows.
De-wonkified LoRA node names (they got wonkified in v5.8.0).
Better error messages when scanning models with picklescan.
Fixed issue where shift-clicking to draw on Canvas ignored Clip to Bbox setting.
Fixed issue with Image Viewer where the image could overflow the viewer.
Fixed overflow with looooong node titles.
Fixed a minor visual bug in string generator nodes.
Internal: First iteration of improved model probing API.
Internal: Improved testing system for model-related tests.
Internal: Port LLaVA OV models to use new API.
Internal: Cleaned up a lot of model-related code.
Internal: Support hot reload for custom nodes. Thanks @keturn!
Updated translations. Thanks @rikublock @Harvester62 @Linos1391!

Installing and Updating

The new Invoke Launcher is the recommended way to install, update and run Invoke. It takes care of a lot of details for you - like installing the right version of python - and runs Invoke as a desktop application.

Follow the Quick Start guide to get started with the launcher.

If you don't want to use the launcher, or need a headless install, you can follow the manual install guide.

What's Changed

Add LLaVA OneVision model support by @RyanJDick in #7693
Model classification api by @jazzhaiku in #7742
feat(app): better errors when scanning models with picklescan by @psychedelicious in #7803
Stripped models by @jazzhaiku in #7797
Use git-lfs for larger assets by @jazzhaiku in #7804
Add FLUX Fill support by @RyanJDick in #7780
Customized workflow browsing by @maryhipp in #7810
ui: workflows recommended followup by @psychedelicious in #7812
fix(ui): clip shift-draw strokes to bbox when clip to bbox enabled by @psychedelicious in #7813
feat(ui): support FLUX Fill on Canvas by @psychedelicious in #7811
feat(ui): better missing fields ux in workflows & builder by @psychedelicious in #7816
ui: translations update from weblate by @weblate in #7805
chore: prep for v5.9.0rc1 by @psychedelicious in #7815
fix(mm): flux model variant probing by @psychedelicious in #7825
feat(nodes): clean up lora node names by @psychedelicious in #7824
feat(ui): custom dropdowns for strings in builder by @psychedelicious in #7818
fix(ui): siglip model translation key by @psychedelicious in #7827
Port LLaVA to new API by @jazzhaiku in #7817
fix(ui): mask logic in graph builders by @psychedelicious in #7828
fix(ui): restore display: flex to image viewer and node editor by @psychedelicious in #7831
ui: translations update from weblate by @weblate in #7821
fix: make dev_reload work for files in nodes/ by @keturn in #7819
chore: prep for v5.9.0rc2 by @psychedelicious in #7829
Import Smoke Test by @jazzhaiku in #7835
fix: make source location discovery more robust by @keturn in #7837
fix(ui): long node titles overflow by @psychedelicious in #7838
feat: show runtime config in about modal by @psychedelicious in #7840
fix(ui): correctly render whitespace in strings in string generator previews by @psychedelicious in #7832
Taxonomy by @jazzhaiku in #7833
chore: prep for v5.9.0 by @psychedelicious in #7845
fix(mm): LLaVA OneVision model calculates its own size by @psychedelicious in #7844

Full Changelog: v5.8.1...v5.9.0