Sept 21, 2025
- Remap DINOv3 ViT weight tags from
lvd_1689m
->lvd1689m
to match (same forsat_493m
->sat493m
) - Release 1.0.20
Sept 17, 2025
- DINOv3 (https://arxiv.org/abs/2508.10104) ConvNeXt and ViT models added. ConvNeXt models were mapped to existing
timm
model. ViT support done via the EVA base model w/ a newRotaryEmbeddingDinoV3
to match the DINOv3 specific RoPE impl - MobileCLIP-2 (https://arxiv.org/abs/2508.20691) vision encoders. New MCI3/MCI4 FastViT variants added and weights mapped to existing FastViT and B, L/14 ViTs.
- MetaCLIP-2 Worldwide (https://arxiv.org/abs/2507.22062) ViT encoder weights added.
- SigLIP-2 (https://arxiv.org/abs/2502.14786) NaFlex ViT encoder weights added via timm NaFlexViT model.
- Misc fixes and contributions
What's Changed
- Pass init_values at hieradet_sam2 by @hassonofer in #2559
- Add mobileclip2 encoder weights by @rwightman in #2560
- Add support for Gemma 3n MobileNetV5 encoder weight loading by @rwightman in #2561
- Fix #2562, add siglip2 naflex vit encoder weights by @rwightman in #2564
- fix: create results_dir if missing before saving results by @zhima771 in #2576
- feat(validate): add precision, recall, and F1 metrics by @ha405 in #2568
- Allow user to ask for features other than image and label in ImageDataset by @grodino in #2571
- Add MobileCLIP2 image encoders by @rwightman in #2578
- Add DINOv3 support by @rwightman in #2579
New Contributors
- @hassonofer made their first contribution in #2559
- @zhima771 made their first contribution in #2576
- @ha405 made their first contribution in #2568
Full Changelog: v1.0.19...v1.0.20