This release adds FLUX Fill support in Workflows and Canvas, beta support for the LLaVA OneVision VLLM family of models, and a selection of minor fixes and enhancements.
FLUX Fill
FLUX Fill provides high quality inpainting and outpainting, improving on these tasks over the other FLUX models. It's a "main" model, like FLUX dev or schnell.
To use it, download it from Starter Models and then select it from the main model drop-down on Canvas. It's not compatible* with Text to Image or Image to Image - you'll get an error if you try to Invoke without an inpaint mask or some empty regions in your bbox.
*Technically, it can do Text to Image and Image to Image - but the quality is very poor. We've opted to disallow this on Canvas.
LLaVA OneVision VLLM
This multimodal model generates text from text, image and/or video* inputs. You can use it to generate prompts and and describe images. You can use it in Workflows with the LLaVA OneVision VLLM
node.
The 0.5B variant of the model is available for download from Starter Models.
*Invoke does not support video inputs.
Other Changes
- Support for custom string field drop-downs in Workflow Builder. Add a node's string field to the Builder and choose the dropdown component to see it in action.
- The About modal now shows the app's runtime settings. It includes a list of explicitly-set settings (i.e. the contents of
invokeai.yaml
), so it is possible to see what runtime settings are app defaults and which are user-defined. - Improved UX for missing or unexpected fields in Workflows.
- De-wonkified LoRA node names (they got wonkified in v5.8.0).
- Better error messages when scanning models with
picklescan
. - Fixed issue where shift-clicking to draw on Canvas ignored
Clip to Bbox
setting. - Fixed issue with Image Viewer where the image could overflow the viewer.
- Fixed overflow with looooong node titles.
- Fixed a minor visual bug in string generator nodes.
- Internal: First iteration of improved model probing API.
- Internal: Improved testing system for model-related tests.
- Internal: Port LLaVA OV models to use new API.
- Internal: Cleaned up a lot of model-related code.
- Internal: Support hot reload for custom nodes. Thanks @keturn!
- Updated translations. Thanks @rikublock @Harvester62 @Linos1391!
Installing and Updating
The new Invoke Launcher is the recommended way to install, update and run Invoke. It takes care of a lot of details for you - like installing the right version of python - and runs Invoke as a desktop application.
Follow the Quick Start guide to get started with the launcher.
If you don't want to use the launcher, or need a headless install, you can follow the manual install guide.
What's Changed
- Add LLaVA OneVision model support by @RyanJDick in #7693
- Model classification api by @jazzhaiku in #7742
- feat(app): better errors when scanning models with picklescan by @psychedelicious in #7803
- Stripped models by @jazzhaiku in #7797
- Use git-lfs for larger assets by @jazzhaiku in #7804
- Add FLUX Fill support by @RyanJDick in #7780
- Customized workflow browsing by @maryhipp in #7810
- ui: workflows recommended followup by @psychedelicious in #7812
- fix(ui): clip shift-draw strokes to bbox when clip to bbox enabled by @psychedelicious in #7813
- feat(ui): support FLUX Fill on Canvas by @psychedelicious in #7811
- feat(ui): better missing fields ux in workflows & builder by @psychedelicious in #7816
- ui: translations update from weblate by @weblate in #7805
- chore: prep for v5.9.0rc1 by @psychedelicious in #7815
- fix(mm): flux model variant probing by @psychedelicious in #7825
- feat(nodes): clean up lora node names by @psychedelicious in #7824
- feat(ui): custom dropdowns for strings in builder by @psychedelicious in #7818
- fix(ui): siglip model translation key by @psychedelicious in #7827
- Port LLaVA to new API by @jazzhaiku in #7817
- fix(ui): mask logic in graph builders by @psychedelicious in #7828
- fix(ui): restore
display: flex
to image viewer and node editor by @psychedelicious in #7831 - ui: translations update from weblate by @weblate in #7821
- fix: make dev_reload work for files in nodes/ by @keturn in #7819
- chore: prep for v5.9.0rc2 by @psychedelicious in #7829
- Import Smoke Test by @jazzhaiku in #7835
- fix: make source location discovery more robust by @keturn in #7837
- fix(ui): long node titles overflow by @psychedelicious in #7838
- feat: show runtime config in about modal by @psychedelicious in #7840
- fix(ui): correctly render whitespace in strings in string generator previews by @psychedelicious in #7832
- Taxonomy by @jazzhaiku in #7833
- chore: prep for v5.9.0 by @psychedelicious in #7845
- fix(mm): LLaVA OneVision model calculates its own size by @psychedelicious in #7844
Full Changelog: v5.8.1...v5.9.0