Changes
- @mbarnson added basic MPS (Apple Silicon) support in #5
- @red40maxxer reduced memory usage in #15
- @Ooooze added IBM Granite MoE support in #14
- @kldzj added multi-GPU support in #17 and #32
- @ricyoung fixed an error when Hugging Face user profile fields are missing in #20
- @tymat added support for MXFP4 quantized models with Triton tensors in #28
- @spikymoth improved support for loading local datasets in #33
- @kldzj added support for models that require
trust_remote_codein #31 - @Vinayyyy7 added notebook (Colab/Kaggle) compatibility in #42
- @Vinayyyy7 fixed loading for certain models that default to the float32 dtype in #44
- @spikymoth improved refusal detection in #45
- @red40maxxer added a PR title lint to CI in #66
- @p-e-w added research features, support for stopping the optimization process early, and support for thinking models, and implemented an important padding fix suggested by @accemlcc
New Contributors
- @mbarnson made their first contribution in #5
- @red40maxxer made their first contribution in #15
- @Ooooze made their first contribution in #14
- @kldzj made their first contribution in #17
- @ricyoung made their first contribution in #20
- @tymat made their first contribution in #28
- @spikymoth made their first contribution in #33
- @Vinayyyy7 made their first contribution in #42
Full Changelog: v1.0.1...v1.1.0