huggingface/pytorch-image-models v1.0.20 on GitHub

Sept 21, 2025

Remap DINOv3 ViT weight tags from lvd_1689m -> lvd1689m to match (same for sat_493m -> sat493m)
Release 1.0.20

DINOv3 (https://arxiv.org/abs/2508.10104) ConvNeXt and ViT models added. ConvNeXt models were mapped to existing timm model. ViT support done via the EVA base model w/ a new RotaryEmbeddingDinoV3 to match the DINOv3 specific RoPE impl
- HuggingFace Hub: https://huggingface.co/collections/timm/timm-dinov3-68cb08bb0bee365973d52a4d
MobileCLIP-2 (https://arxiv.org/abs/2508.20691) vision encoders. New MCI3/MCI4 FastViT variants added and weights mapped to existing FastViT and B, L/14 ViTs.
MetaCLIP-2 Worldwide (https://arxiv.org/abs/2507.22062) ViT encoder weights added.
SigLIP-2 (https://arxiv.org/abs/2502.14786) NaFlex ViT encoder weights added via timm NaFlexViT model.
Misc fixes and contributions

Pass init_values at hieradet_sam2 by @hassonofer in #2559
Add mobileclip2 encoder weights by @rwightman in #2560
Add support for Gemma 3n MobileNetV5 encoder weight loading by @rwightman in #2561
Fix #2562, add siglip2 naflex vit encoder weights by @rwightman in #2564
fix: create results_dir if missing before saving results by @zhima771 in #2576
feat(validate): add precision, recall, and F1 metrics by @ha405 in #2568
Allow user to ask for features other than image and label in ImageDataset by @grodino in #2571
Add MobileCLIP2 image encoders by @rwightman in #2578
Add DINOv3 support by @rwightman in #2579

Full Changelog: v1.0.19...v1.0.20