🌟 Summary
The release of 'v8.3.38' introduces significant enhancements, particularly emphasizing video interaction capabilities through the new SAM2VideoPredictor
class for object segmentation and tracking in videos. This update also includes general improvements and optimizations across various modules.
📊 Key Changes
- SAM2VideoPredictor: A new class aimed at enhancing video segmentation and object tracking, supporting advanced interactions such as prompts for segment modifications.
- Improved Video Segmentation: Features non-overlapping masks, better memory management, and support for interactive user prompts.
- Configuration Clean-Up: Removal of obsolete parameters such as
label_smoothing
. - Platform Compatibility: Extended detection for NVIDIA Jetson devices, accommodating more models.
- Documentation and Code Updates: Adjustments for improved clarity and accuracy in both code and documentation.
🎯 Purpose & Impact
- 📽️ Enhanced Video Interaction: The
SAM2VideoPredictor
allows users to fine-tune video processing outputs dynamically, making video segmentation more precise and interactive. - 🚀 Efficiency & Resource Management: Optimized memory use during video segmentation leads to faster inferencing and resource savings, beneficial for running on resource-constrained devices.
- 🛠️ Code Simplification: Removing unnecessary parameters like
label_smoothing
helps streamline configuration settings, reducing potential user confusion. - 📱 Broader Device Support: Updating device compatibility ensures the software is functional across a wider range of hardware, improving the user experience for those utilizing NVIDIA Jetson platforms.
- 📚 Improved User Documentation: Enhanced documentation aids both beginners and advanced users by making it easier to understand and implement model configurations and changes efficiently.
What's Changed
- Delete .github/workflows/codeql.yaml by @glenn-jocher in #17772
- Fix
RepC3
module for RT-DETR models by @Andrewymd in #17086 - Removes unused argument
label_smoothing
by @Burhan-Q in #16014 - Concat all segments by default for multi-part masks by @Y-T-G in #16826
- Improve
is_jetson
to support more Jetson devices by @lakshanthad in #17770 - Fix DLA export by @Laughing-q in #17765
- Fix CI.md CodeQL badges by @glenn-jocher in #17779
- Fix Prettier docs issues by @glenn-jocher in #17798
- Convert frames to RGB before SAHI inference by @Y-T-G in #17795
- Fix parking points annotator for macOS by @RizwanMunawar in #17797
- Fix
forward_fuse
comment by @arun477 in #17714 ultralytics 8.3.38
SAM 2 video inference by @Laughing-q in #14851
New Contributors
- @Andrewymd made their first contribution in #17086
- @arun477 made their first contribution in #17714
Full Changelog: v8.3.37...v8.3.38