π Summary
v8.4.79 is a performance-focused release led by a major upgrade to semantic segmentation exports π, plus several quality-of-life improvements in deployment, docs, and developer tooling.
π Key Changes
-
Big semantic segmentation export boost β‘
The standout update from PR #24598 by @onuralpszr makes ONNX and TFLite semantic segmentation exports much faster and lighter by baking the class selection step directly into the exported model.- Exported models now output compact class maps instead of large raw score tensors.
- Reported output size can drop from about 159 MB to 2 MB π
- Inference can be up to 10Γ faster for this use case.
-
Better semantic export compatibility π§
Runtime and postprocessing were updated so exported semantic models work correctly whether they return:- traditional floating-point logits, or
- the new precomputed integer class maps.
-
Smaller and simpler similarity search setup π
PR #24931 by @glenn-jocher removes heavier optional dependencies from thesolutionsextra:- FAISS was replaced with a NumPy-based search approach
- openai-clip is no longer bundled there as an install dependency
This makes installs leaner while keeping semantic image search working.
-
Tracking code cleanup with no output changes π οΈ
PR #24930 by @glenn-jocher simplifies tracker internals by removing duplicated Kalman update logic and reusing helper functions.- Important note: ByteTrack, BoT-SORT, and OC-SORT behavior stays effectively unchanged.
-
Clearer validation for RegionCounter π‘οΈ
PR #24890 adds a more understandable error when a region polygon has too few points, making debugging easier for users building region-based counting apps. -
Large documentation refresh across deployment and workflow guides π
Many merged PRs improved guides for:- AzureML, Modal, Triton, Raspberry Pi, DeepStream, DALI, and ROS
- preprocessing, evaluation, fine-tuning, deployment choices, monitoring, and troubleshooting
These changes donβt introduce major runtime features, but they make setup and usage more accurate and easier to follow.
π― Purpose & Impact
-
Faster semantic segmentation on edge and mobile π±
The main export change is especially valuable for ONNX and TFLite deployments, where output size and postprocessing overhead matter a lot. -
Lower memory and bandwidth costs πΎ
By returning compact class IDs instead of huge dense outputs, semantic models become much more practical for embedded devices, mobile apps, and production pipelines. -
Simpler deployment pipelines π
Baking argmax into the exported model means less custom postprocessing code outside the model, which reduces integration complexity and potential errors. -
Lighter installs for solution users πͺΆ
The dependency cleanup should make optional solution environments easier to install and maintain, especially in constrained or CI environments. -
Improved reliability and usability β
Better validation messages, cleaned-up internals, and more accurate docs help both new and advanced users work faster with fewer confusing issues.
Overall, this release is most important for anyone using semantic segmentation with ONNX or TFLiteβthose users should see the biggest real-world gains in speed, output size, and deployment simplicity π
What's Changed
- Fix model reference artifacts and restructure defining project goals guide by @raimbekovm in #24797
- Refactor Modal, AzureML, and Triton deployment guides by @raimbekovm in #24928
- Refactor computer vision project lifecycle guides by @raimbekovm in #24921
- Update Raspberry Pi, DeepStream, DALI, and ROS deployment guides by @raimbekovm in #24925
- Refactor inference and troubleshooting guides by @raimbekovm in #24923
- Restructure monitoring solutions guides and remove boilerplate sections by @raimbekovm in #24911
- Fix metrics API usage and warmup_epochs docs in model evaluation guide by @raimbekovm in #24904
- Correct normalization and augmentation docs in data preprocessing guide by @raimbekovm in #24902
- Complete Google-style docstrings for NumPy helper functions by @glenn-jocher in #24929
- Deduplicate tracker Kalman update and box helpers by @glenn-jocher in #24930
- Standardize Mermaid diagram colors across docs by @glenn-jocher in #24932
- Remove faiss-cpu, openai-clip, and setuptools pins from solutions extra by @glenn-jocher in #24931
- Add https://youtu.be/eZvZEzslTz8 to docs by @RizwanMunawar in #24938
- Add Codecov badges to CI docs by @glenn-jocher in #24941
- Clarify Platform dataset ingest API flow by @glenn-jocher in #24943
- Raise clear error for region counter regions with fewer than 3 points by @raimbekovm in #24890
- Optimize semantic class-map exports by @onuralpszr in #24598
Full Changelog: v8.4.78...v8.4.79