github kubeflow/mpi-operator v0.7.0

one day ago

Changes since v0.6.0

  • Features:
    • Support custom cluster domain in MPI hostfile generation. (#704, #707, #738, @tenzen-y)
    • Enable Service publishNotReadyAddresses when runLauncherAsWorker to improve DNS discovery for workers. (#703, @tenzen-y)
    • Expose job controller workqueue rate-limiting configuration via operator flags to improve scalability tuning. (#674, @rotemelad)
  • Bug fixes:
    • Fix crash in PodGroup when runLauncherAsWorker=true. (#669, @GonzaloSaez)
    • Fix missing ReplicaIndexLabel when runLauncherAsWorker=true so the launcher pod gets the expected pod index label (helps Kueue/TAS rank discovery). (#690, @GonzaloSaez)
  • Clean ups:
    • Upgrade Kubernetes dependencies to v1.34. (#742, @tenzen-y)
    • Fix kustomize v5 warnings in manifests. (#700, @vikas-saxena02)
    • Upgrade Debian version to trixie and MPI versions are upgraded in the following: (#685, @tenzen-y)
      • OpenMPI: v4.1.4 -> v5.0.7
      • MPICH: v3.4.1 -> v4.2.1

Acknowledgments

Thank you to all the contributors (in no particular order): @rotemelad @mimowo @terrytangyuan @GonzaloSaez @vikas-saxena02 @tenzen-y

Don't miss a new mpi-operator release

NewReleases is sending notifications on new releases.