github huggingface/trl v0.4.5

latest releases: v0.8.6, v0.8.5, v0.8.4...
15 months ago

Patch release 1 - SFTTrainer enhancements and fixes

This patch release adds multiple fixes for the SFTTrainer and enhancements. Another patch release is coming for fixing an issue with PPOTrainer and Google Colab combined with wandb logging

What's Changed

New Contributors

Full Changelog: v0.4.4...v0.4.5

Don't miss a new trl release

NewReleases is sending notifications on new releases.