Detailed Changes
Added
-
Added Databricks storage
DBFSLocalStore
and support for GPU-aware scheduling to horovod.spark Estimator. (#2234) -
Added ElasticSampler and PyTorch Elastic ImageNet example. (#2297)
-
Added ability to dynamically start and stop timeline programmatically. (#2215)
-
Added support for Gloo on macOS. (#2254)
-
Exposed name argument to TensorFlow allreduce operation. (#2325)
-
Added option to strip outer name scope from Horovod ops in TensorFlow. (#2328)
Fixed
-
Fixed usage of VERBOSE=1 when setting custom MAKEFLAGS. (#2239)
-
Fixed bugs in Keras Elastic Callback classes. (#2289)
-
Fixed RelWithDebInfo build and made it the default with -03 optimizations. (#2305)
-
Fixed usage of tf.cond in TensorFlow alltoall gradient. (#2327)
-
Fixed allreduce averaging for TF IndexedSlices in ROCm path. (#2279)
-
Include stdexcept to handle certain compiler / frameworks that don't include it already. (#2238)
-
Fixed Debug builds by setting compiler options based on CMake build type. (#2263)
-
Skipped launching zero-sized send/recvs for NCCLAlltoall. (#2273)
-
Fixed missing run in tf keras elastic mode. (#2272)
-
Fixed loss function in TensorFlow2 elastic synthetic benchmark. (#2265)
-
Fixed usage of HOROVOD_MIXED_INSTALL env var in alltoall tests. (#2266)
-
Removed keras requirement from Ray example. (#2262)