github hiyouga/LLaMA-Factory v0.3.0
v0.3.0: Full-Parameter RLHF

latest releases: v0.9.1, v0.9.0, v0.8.3...
13 months ago

New features

  • Support full-parameter RLHF training (RM & PPO)
  • Refactor llmtuner core in #1525 by @hiyouga
  • Better LLaMA Board: full-parameter RLHF and demo mode

New models

  • Base models
    • ChineseLLaMA-1.3B
    • LingoWhale-8B
  • Instruct/Chat models
    • ChineseAlpaca-1.3B
    • Zephyr-7B-Alpha/Beta

Bug fix

Don't miss a new LLaMA-Factory release

NewReleases is sending notifications on new releases.