github huggingface/trl v0.4.0

latest releases: v0.8.6, v0.8.5, v0.8.4...
20 months ago

v0.4.0: peft integration

Apply RLHF and fine-tune your favorite large model on consumer GPU using peft and trl ! Share also easily your trained RLHF adapters on the Hub with few lines of code

With this integration you can train gpt-neo-x (20B parameter model - 40GB in bfloat16) on a 24GB consumer GPU!

What's Changed

New Contributors

Full Changelog: v0.3.1...v0.4.0

Don't miss a new trl release

NewReleases is sending notifications on new releases.