hiyouga/LlamaFactory v0.3.0
v0.3.0: Full-Parameter RLHF

on GitHub

latest releases: v0.9.4, v0.9.3, v0.9.2...

2 years ago

New features

Support full-parameter RLHF training (RM & PPO)
Refactor llmtuner core in #1525 by @hiyouga
Better LLaMA Board: full-parameter RLHF and demo mode

New models

Base models
- ChineseLLaMA-1.3B
- LingoWhale-8B
Instruct/Chat models
- ChineseAlpaca-1.3B
- Zephyr-7B-Alpha/Beta

Bug fix

Fix bugs in partial-parameter (freeze) tuning
Fix #224 #336 #931 #936 #1011 #1489 #1494 #1507 #1514

Check out latest releases or
releases around hiyouga/LlamaFactory v0.3.0

Don't miss a new LlamaFactory release

NewReleases is sending notifications on new releases.

Get notifications