Core
- Improvements
- Python
- Java
- Allow users to set JVM options at actor creation time. #4970
- Internal
- Peformance
Tune
- Add directional metrics for components. #4120, #4915
- Disallow setting
resources_per_trial
when it is already configured. #4880 - Make PBT Quantile fraction configurable. #4912
RLlib
- Add QMIX mixer parameters to optimizer param list. #5014
- Allow Torch policies access to full action input dict in
extra_action_out_fn
. #4894 - Allow access to batches prior to postprocessing. #4871
- Throw error if
sample_async
is used with pytorch for A3C. #5000 - Patterns & User Experience
- Documentation
Other Libraries
- Add support for distributed training with PyTorch. #4797, #4933
- Autoscaler will kill workers on exception. #4997
- Fix handling of non-integral timeout values in
signal.receive
. #5002
Thanks
We thank the following contributors for their amazing contributions: @jiangzihao2009, @raulchen, @ericl, @hershg, @kfstorm, @kiddyboots216, @jovany-wang, @pschafhalter, @richardliaw, @robertnishihara, @stephanie-wang, @simon-mo, @zhijunfu, @ls-daniel, @ajgokhale, @rueberger, @suquark, @guoyuhong, @jovany-wang, @pcmoritz, @hartikainen, @timonbimon, @TianhongDai