- Fix attention mask for SDPA
- Improve performance 20-30%, more so with marker
- Add some more model loader options
What's Changed
- Backport by @VikParuchuri in #449
- Foundation Model Performance Improvements by @tarun-menta in #451
- Dev by @VikParuchuri in #452
- Bump version by @VikParuchuri in #453
Full Changelog: v0.16.3...v0.16.4