Core
- Add
delete_creating_tasks
option forinternal.free()
#4588
Tune
- Add filter flag for Tune CLI. #4337
- Better handling of
tune.function
in global checkpoint. #4519 - Add compatibility to nevergrad 0.2.0+. #4529
- Add
--columns
flag for CLI. #4564 - Add checkpoint eraser. #4490
- Fix checkpointing for Gym types. #4619
RLlib
- Report sampler performance metrics. #4427
- Ensure stats are consistently reported across all algos. #4445
- Cleanup
TFPolicyGraph
. #4478 - Make batch timeout for remote workers tunable. #4435
- Fix inconsistent weight assignment operations in
DQNPolicyGraph
. #4504 - Add support for LR schedule to DQN/APEX. #4473
- Add option for RNN state and value estimates to span episodes. #4429
- Create a combination of
ExternalEnv
andMultiAgentEnv
, calledExternalMutliAgentEnv
. #4200 - Support
prev_state
/prev_action
in rollout and fix multiagent. #4565 - Support torch device and distributions. #4553
Java
- TestNG outputs more verbose error messages. #4507
- Implement
GcsClient
. #4601 - Avoid unnecessary memory copy and addd a benchmark. #4611