Highlights
- General decoder model support in addition to GPT-2 in #53
- Encoder-decoder model support (such as T5) in #93
- New, shiny docs with the
doc-builder
in #59 push_to_hub
with PPOTrainer in #68- Simple reference model creation with layer sharing in #61
What's Changed
- Remove
nbdev
dependency by @younesbelkada in #52 - Adds github actions and dummy test by @edbeeching in #55
- Update README.md by @Keith-Hon in #51
- Update README.md by @TristanThrush in #49
- Adds Python highlighting to the code block by @JulesGM in #45
xxxForCausalLM
support by @younesbelkada in #53- [
VHead
] Fix slow convergence issue by @younesbelkada in #60 - add docbuilder skeleton by @lvwerra in #59
- fix docs workflow by @lvwerra in #63
accelerate
integration by @younesbelkada in #58- add create_reference_model by @lvwerra in #61
- Improve Makefile and code quality by @lvwerra in #62
- Relax requirements by @lvwerra in #66
- modeling - change namings by @younesbelkada in #65
- [
PPOTrainer
] make the reference model optional by @younesbelkada in #67 - Improvements 1a by @edbeeching in #70
- update GitHub actions to
main
by @lvwerra in #77 - [core] refactor
step
method by @younesbelkada in #76 - [
PPOTrainer
] Support generic optimizers by @younesbelkada in #78 - Update sentiment_tuning.mdx by @eltociear in #69
- Remove references to "listify_batch" by @xiaoyesoso in #81
- Collater -> collator by @LysandreJik in #88
- Model as kwarg in pipeline by @LysandreJik in #89
- Small typo correction by @LysandreJik in #87
- [API] Make
dataset
attribute optional by @younesbelkada in #85 - [Doc] Improve docs by @younesbelkada in #91
- [core] Push
v_head
when usingAutoModelForCausalLMWithValueHead
by @younesbelkada in #86 - [core] remove
wandb
dependency by @younesbelkada in #92 - add logo by @lvwerra in #95
- Encoder-Decoder models support by @younesbelkada in #93
- Fix docs hyperlinks by @lewtun in #98
- [API] LR scheduler support by @younesbelkada in #96
- Version should have
dev0
unless it is a release version by @mishig25 in #99 - [core] improve API by @younesbelkada in #97
- Add push to Hub for PPOTrainer by @lewtun in #68
- [
core
] Advise to usefbs=1
by @younesbelkada in #102 - [Doc] New additions by @younesbelkada in #105
- restructure examples by @lvwerra in #107
- Fix nits & missing things by @younesbelkada in #108
- Convert notebook 05 by @edbeeching in #80
New Contributors
- @lvwerra made their first contribution in #2
- @vblagoje made their first contribution in #16
- @dependabot made their first contribution in #26
- @younesbelkada made their first contribution in #52
- @edbeeching made their first contribution in #55
- @Keith-Hon made their first contribution in #51
- @TristanThrush made their first contribution in #49
- @JulesGM made their first contribution in #45
- @eltociear made their first contribution in #69
- @xiaoyesoso made their first contribution in #81
- @LysandreJik made their first contribution in #88
- @lewtun made their first contribution in #98
- @mishig25 made their first contribution in #99
Full Changelog: https://github.com/lvwerra/trl/commits/v0.2.0