github hiyouga/LLaMA-Factory v0.8.3
v0.8.3: Neat Packing, Split Evaluation

latest release: v0.9.0
3 months ago

New features

New models

  • Base models
    • InternLM2.5-7B 📄
    • Gemma2 (9B/27B) 📄
  • Instruct/Chat models
    • TeleChat-1B-Chat by @hzhaoy in #4651 📄🤖
    • InternLM2.5-7B-Chat 📄🤖
    • CodeGeeX4-9B-Chat 📄🤖
    • Gemma2-it (9B/27B) 📄🤖

Changes

  • Fix DPO cutoff len and deprecate reserved_label_len argument
  • Improve loss function for reward modeling

Bug fix

Don't miss a new LLaMA-Factory release

NewReleases is sending notifications on new releases.