hiyouga/LLaMA-Factory v0.1.6
v0.1.6: DPO Training and Qwen-7B

on GitHub

latest releases: v0.9.1, v0.9.0, v0.8.3...

15 months ago

Adapt DPO training from the TRL library
Support fine-tuning the Qwen-7B, Qwen-7B-Chat, XVERSE-13B, and ChatGLM2-6B models
Implement the "safe" ChatML template for Qwen-7B-Chat
Better Web UI
Pretty readme by @codemayq #382
New features: #395 #451
Fix InternLM-7B inference #312
Fix bugs: #351 #354 #361 #376 #408 #417 #420 #423 #426

Don't miss a new LLaMA-Factory release

NewReleases is sending notifications on new releases.