github p-e-w/heretic v1.1.0

one day ago

Changes

  • @mbarnson added basic MPS (Apple Silicon) support in #5
  • @red40maxxer reduced memory usage in #15
  • @Ooooze added IBM Granite MoE support in #14
  • @kldzj added multi-GPU support in #17 and #32
  • @ricyoung fixed an error when Hugging Face user profile fields are missing in #20
  • @tymat added support for MXFP4 quantized models with Triton tensors in #28
  • @spikymoth improved support for loading local datasets in #33
  • @kldzj added support for models that require trust_remote_code in #31
  • @Vinayyyy7 added notebook (Colab/Kaggle) compatibility in #42
  • @Vinayyyy7 fixed loading for certain models that default to the float32 dtype in #44
  • @spikymoth improved refusal detection in #45
  • @red40maxxer added a PR title lint to CI in #66
  • @p-e-w added research features, support for stopping the optimization process early, and support for thinking models, and implemented an important padding fix suggested by @accemlcc

New Contributors

Full Changelog: v1.0.1...v1.1.0

Don't miss a new heretic release

NewReleases is sending notifications on new releases.