v4.2.6 includes a handful of fixes and improvements, plus three major changes:
- Gallery updates
- Tiled upscaling via
MultiDiffusion
- Checkpoint models work without conversion to diffusers
Known Issues
Our last release, v4.2.5, was quickly pulled after a black image issue on MPS (macOS) was discovered. We also had reports of CUDA (Nvidia) GPUs getting unexpected OOM (Out of Memory) errors.
The MPS issue is resolved in this release, but we haven't been able to replicate unexpected OOMs on Linux or Windows. We did fix one issue that may have been a factor.
If you get OOMs on this alpha release with settings that worked fine on v4.2.4 - or have any other issues - please let us know via GH issues or discord.
Gallery Updates
We've made some changes to the gallery, adding features, improving the performance of the app and reducing memory usage. The changes also fix a number of bugs relating to stale data - for example, a board not updating as expected after moving an image to it.
Thanks to @chainchompa and @maryhipp for working on this major effort.
Pagination & Selection
Infinite scroll is dead, long live infinite scroll!
The gallery is now paginated. Selection logic has been updated to work with pagination. An indicator shows how many images are selected and allows you to clear the selection entirely. Arrow keys still navigate.
Gallery.Pagination.and.Selection.mov
The number of images per page is dynamically calculated as the panel is resized, ensuring the panel is always filled with images.
Boards UI Refresh
The bulky tiled boards grid has been replaced by a scrollable list. The boards list panel is now a resizable, collapsible panel.
Boards.List.and.Resizable.Panel.mov
Boards and Image Search
Search for boards by name and images by metadata. The search term is matched against the image's metadata as a string. We landed on full-text search as a flexible yet simple implementation after considering a few methods for search.
Boards.and.Images.Search.mov
Archived Boards
Archive a board to hide it from the main boards list. This is purely an organizational enhancement. You can still interact with archived boards as you would any other board.
Archived.Boards.mov
Image Sorting
You can now change the sort for images to show oldest first. A switch allows starred images to be placed in the list according to their age, instead of always showing them first.
Image.Sorting.mov
Tiled Upscaling via MultiDiffusion
MultiDiffusion
is a fairly straightforward technique for tiled denoising. The gist is similar to other tiled upscaling methods - split the input image up in to tiles, process each independently, and stitch them back together. The main innovation for MultiDiffusion
is to do this in latent space, blending the tensors together continually. This results in excellent consistency across the output image, with no seams.
This feature is exposed as a Tiled MultiDiffusion Denoise Latents
node, currently classified as a beta version. It works much the same as the OG Denoise Latents
node. Here's a workflow to get you started: sd15_multi_diffusion_esrgan_x2_upscale.json
We are still thinking about to expose this in the linear UI. Most likely, we expose this with very minimal settings. If you want to tweak it, use the workflow.
Thanks to @RyanJDick for designing and implementing MultiDiffusion
.
How to use it
This technique is fundamentally the same as normal img2img. Appropriate use of conditioning and control will greatly improve the output. The one hard requirement is to use the Tile ControlNet model.
Besides that, here are some tips from our initial testing:
- Use a detail-adding or style LoRAs.
- Use a base model best suited for the desired output style.
- Prompts make a difference.
- The initial upscaling method makes a difference.
- Scheduler makes a difference. Some produce softer outputs.
VRAM Usage
This technique can upscale images to very large sizes without substantially increasing VRAM usage beyond what you'd see for a "normal" sized generation. The VRAM bottlenecks then become the first VAE encode (Image to Latents
) and final VAE decode (Latents to Image
) steps.
You may run into OOM errors during these steps. The solution is to enable tiling using the toggle on the Image to Latents
and Latents to Image
nodes. This allows the VAE operations to be done piecewise, similar to the tiled denoising process, without using gobs of VRAM.
There's one caveat - VAE tiling often introduces inconsistency across tiles. Textures and colors may differ from tile to tile. This is a function of the diffusers handling of VAE tiling, not the tiled denoising process introduced in v4.2.5. We are investigating ways to improve this.
Takeaway: If your GPU can handle non-tiled VAE encode and decode for a given output size, use that for best results.
Checkpoint models work without conversion to diffusers
The required conversion of checkpoint format models to diffusers format has long been a pain point. Diffusers now supports loading single-file (checkpoint) models directly, and we have removed the mandatory checkpoint-to-diffusers conversion step.
The main user-facing change is that there is no longer a conversion cache directory!
Major thanks to @lstein for getting this working.
📈 Patch Nodes for v4.2.6
Enhancements
- When downloading image metadata, graphs or workflows, the JSON file includes the image name and type of data. Thanks @jstnlowe!
- Add
clear_queue_on_startup
config setting to clear problematic queues. This is useful for a rare edge case where your queue is full of items that somehow crash the app. Set this to true, and the queue will clear before it has time to attempt to execute the problematic item. Thanks @steffy-lo! - Performance and memory efficiency improvements for LoRA patching and model offloading.
- Addition of a simplified model installation methods to the Invocation API:
download_and_cache_model
,load_local_model
andload_remote_model
. These methods allow models to be used without needing them to be added to the model manager. For example, we are now using these methods to load ESRGAN models. - Support for probing and loading SDXL VAE checkpoint.
- Updated gallery UI.
- Checkpoint models work without conversion to diffusers.
- When using a VAE in tiled mode, you may now select the tile size.
Fixes
- Fix handling handling of 0-step denoising process.
- If a control image's processed version is missing when the app loads, it is now re-processed.
- Fixed an issue where a model's size could be misreported as 0, possibly causing memory issues.
- Fixed an issue where images - especially large images - may fail to delete.
Performance improvements
- Improved LoRA patching.
- Improved RAM <-> VRAM model transfer performance.
Internal changes
- The
DenoiseLatentsInvocation
has had its internal methods split up to support tiled upscaling viaMultiDiffusion
. This included some amount of file shuffling and renaming. Theinvokeai
package's exported classes should still be the same. Please let us know if this has broken an import for you. - Internal cleanup, intending to eliminate circular import issues. There's a lot left to do for this issue, but we are making progress.
💾 Installation and Updating
To install or update to v4.2.6rc1, download the installer and follow the installation instructions.
To update, select the same installation location. Your user data (images, models, etc) will be retained.
Missing models after updating from v3 to v4
See this FAQ.
Error during installation ModuleNotFoundError: No module named 'controlnet_aux'
See this FAQ
What's Changed
- Prefixed JSON filenames with the image UUID by @jstnlowe in #6486
- feat(ui): control layers internals cleanup by @psychedelicious in #6487
- LoRA patching optimization by @lstein in #6439
- fix(ui): re-process control image if processed image is missing on page load by @psychedelicious in #6494
- Split up latent.py (code reorganization, no functional changes) by @RyanJDick in #6491
- Add simplified model manager install API to InvocationContext by @lstein in #6132
- fix: Some imports from previous PR's by @blessedcoolant in #6501
- Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere by @lstein in #6490
- Fix
DEFAULT_PRECISION
handling by @RyanJDick in #6492 - added route to install huggingface models from model marketplace by @chainchompa in #6515
- Model hash validator by @brandonrising in #6520
- Tidy
SilenceWarnings
context manager by @RyanJDick in #6493 - [#6333] Add clear_queue_on_startup config to clear problematic queues by @steffy-lo in #6502
- [MM] Add support for probing and loading SDXL VAE checkpoint files by @lstein in #6524
- Add
TiledMultiDiffusionDenoiseLatents
invocation (for upscaling workflows) by @RyanJDick in #6522 - Update prevention exception message by @hipsterusername in #6543
- Fix handling handling of 0-step denoising process by @RyanJDick in #6544
- chore: bump version v4.2.5 by @psychedelicious in #6547
- feat(ui): gallery pagination by @maryhipp in #6527
- fix(backend): mps should not use
non_blocking
by @psychedelicious in #6549 - Load single-file checkpoints directly without conversion by @lstein in #6510
- feat(ui, api): archive boards by @maryhipp in #6546
- Update controlnet starters by @hipsterusername in #6537
- feat(ui, api): gallery sort by @maryhipp in #6551
- fix(app): fix SQL query w/ enum for python 3.11 by @psychedelicious in #6557
- feat(ui, api): gallery search by @maryhipp in #6561
- Update README.md by @hipsterusername in #6519
- fix(ui): don't mess up redux actions if 403 gets thrown by @maryhipp in #6567
- (ui) clarify auto-add logic by @maryhipp in #6566
- feat(ui): gallery ux by @psychedelicious in #6568
- Make the VAE tile size configurable for tiled VAE by @RyanJDick in #6555
- fix(ui): reset page when search term changes by @psychedelicious in #6569
- Fix access token lookup for model downloads by @lstein in #6570
- fix sort order by @maryhipp in #6573
- Update pnpm github action by @ebr in #6574
- [MM bugfix] Put model install errors on the event bus by @lstein in #6578
- Fix some models treated as having size 0 in the model cache by @RyanJDick in #6571
- Enforce absolute imports with ruff by @RyanJDick in #6576
- Delete unused duplicate libc_util.py file by @RyanJDick in #6579
- Update the PR template QA instructions by @RyanJDick in #6580
- Fix circular imports related to SCHEDULER_NAME_VALUES by @RyanJDick in #6582
- Fixed 404 error on latest release link (line 16): by @ddm21 in #6583
- Demote error log to warning for models treated as having size 0 by @RyanJDick in #6589
- Overhaul docker documentation and fix minor bugs by @ebr in #6540
- Boards UI update and add support for private boards by @chainchompa in #6588
- More Styling Updates for Boards and Gallery UI by @chainchompa in #6592
- chore: bump version to 4.2.6a1 by @psychedelicious in #6595
- ui: translations update from weblate by @weblate in #6498
- Do not crash if there are invalid model configs in the DB by @RyanJDick in #6593
- fix(ui): gallery followups by @psychedelicious in #6601
- fix(app): vae tile size field description by @psychedelicious in #6600
- ui: translations update from weblate by @weblate in #6604
- fix(api): delete image race condition by @psychedelicious in #6607
- chore: bump version to 4.2.6rc1 by @psychedelicious in #6608
New Contributors
Full Changelog: v4.2.4...v4.2.6rc1