TorchVision 0.26 is out! It is compatible with torch 2.11. It's a small release that comes with the following changes:
Breaking changes and deprecations
The video decoding and encoding utilities of TorchVision, which have been deprecate for a long time, are now removed. This includes torchvision.io.video.*, read_video, write_video, the VideoReader class, etc. Users are encouraged to switch to TorchCodec, which is faster and more stable.
The rare torchvision utilities that were still relying on video decoding (like the video datasets) have been transparently migrated to TorchCodec.
Note: the image decoders and encoders are staying in TorchVision.
Improvements
[ops] Speed up masks_to_boxes on CPU and GPU (#9358)
[ops] Improve runtime complexity of roi_align on MPS (#9100)
Various code quality improvements (#8760, #9364, #9317, #9359, #9334, #9286, #9327)
Various documentation improvements (#9339, #9374, #9323, #9324, #8879, #9350)
Bug Fixes
[transforms] Fix edge case conversion from CXCYWH to XYXY for integer bounding boxes in F.convert_bounding_box_format (#9322)
[transforms] Fix tv_tensors.wrap to preserve subclass types for BoundingBoxes and KeyPoints (#9332)
[transforms] Fix incorrect normalization axis in v2.ElasticTransform (#9300)
[ops] Fix masks_to_boxes for empty masks (#9357)
[io] Fix CPU jpeg and png decoder/encoder error-path leak on malformed inputs (#9434)
Contributors
🎉 We're grateful for our community, which helps us improve Torchvision by submitting issues and PRs, and providing feedback and suggestions. The following persons have contributed patches for this release:
Adam J. Stewart, Andrey Talman, Jaebeom, MPSFuzz , Murat Raimbekov, Nicolas Hug, ribbon-otter , Roy Hvaara, Salman Chishti, Scott Todd, Zhitao Yu