huggingface/trl v0.2.0 on GitHub

Highlights

General decoder model support in addition to GPT-2 in #53
Encoder-decoder model support (such as T5) in #93
New, shiny docs with the doc-builder in #59
push_to_hub with PPOTrainer in #68
Simple reference model creation with layer sharing in #61

What's Changed

Remove nbdev dependency by @younesbelkada in #52
Adds github actions and dummy test by @edbeeching in #55
Update README.md by @Keith-Hon in #51
Update README.md by @TristanThrush in #49
Adds Python highlighting to the code block by @JulesGM in #45
xxxForCausalLM support by @younesbelkada in #53
[VHead] Fix slow convergence issue by @younesbelkada in #60
add docbuilder skeleton by @lvwerra in #59
fix docs workflow by @lvwerra in #63
accelerate integration by @younesbelkada in #58
add create_reference_model by @lvwerra in #61
Improve Makefile and code quality by @lvwerra in #62
Relax requirements by @lvwerra in #66
modeling - change namings by @younesbelkada in #65
[PPOTrainer] make the reference model optional by @younesbelkada in #67
Improvements 1a by @edbeeching in #70
update GitHub actions to main by @lvwerra in #77
[core] refactor step method by @younesbelkada in #76
[PPOTrainer] Support generic optimizers by @younesbelkada in #78
Update sentiment_tuning.mdx by @eltociear in #69
Remove references to "listify_batch" by @xiaoyesoso in #81
Collater -> collator by @LysandreJik in #88
Model as kwarg in pipeline by @LysandreJik in #89
Small typo correction by @LysandreJik in #87
[API] Make dataset attribute optional by @younesbelkada in #85
[Doc] Improve docs by @younesbelkada in #91
[core] Push v_head when using AutoModelForCausalLMWithValueHead by @younesbelkada in #86
[core] remove wandb dependency by @younesbelkada in #92
add logo by @lvwerra in #95
Encoder-Decoder models support by @younesbelkada in #93
Fix docs hyperlinks by @lewtun in #98
[API] LR scheduler support by @younesbelkada in #96
Version should have dev0 unless it is a release version by @mishig25 in #99
[core] improve API by @younesbelkada in #97
Add push to Hub for PPOTrainer by @lewtun in #68
[core] Advise to use fbs=1 by @younesbelkada in #102
[Doc] New additions by @younesbelkada in #105
restructure examples by @lvwerra in #107
Fix nits & missing things by @younesbelkada in #108
Convert notebook 05 by @edbeeching in #80

New Contributors

@lvwerra made their first contribution in #2
@vblagoje made their first contribution in #16
@dependabot made their first contribution in #26
@younesbelkada made their first contribution in #52
@edbeeching made their first contribution in #55
@Keith-Hon made their first contribution in #51
@TristanThrush made their first contribution in #49
@JulesGM made their first contribution in #45
@eltociear made their first contribution in #69
@xiaoyesoso made their first contribution in #81
@LysandreJik made their first contribution in #88
@lewtun made their first contribution in #98
@mishig25 made their first contribution in #99

Full Changelog: https://github.com/lvwerra/trl/commits/v0.2.0