github huggingface/trl v0.7.9
v0.7.9: Patch release for DPO & SFTTrainer

latest releases: v0.8.6, v0.8.5, v0.8.4...
10 months ago

v0.7.9: Patch release for DPO & SFTTrainer

This is a patch release that fixes critical issues with SFTTrainer & DPOTrainer, together with minor fixes for PPOTrainer and DataCollatorForCompletionOnlyLM

What's Changed

Full Changelog: v0.7.8...v0.7.9

Don't miss a new trl release

NewReleases is sending notifications on new releases.