Full Changelog: v0.12.2...v0.13.0
Changes
- Skip
NVIDIA DGX Display
devices when generating labels. - Fail on startup if no valid resources are detected
- Bump GFD subchart to version 0.7.0
Changes from v0.13.0-rc.3
- Use
nodeAffinity
instead ofnodeSelector
by default in daemonsets - Add
machine-file-path
option to GFD config flags - Mount
/sys
instead of/sys/class/dmi/id/product_name
in GPU Feature Discovery daemonset - Bump GFD subchard to version 0.7.0-rc.3
Changes from v0.13.0-rc.2
- Bump cuda base image to 11.8.0
- Use consistent indendation in YAML manifests
- Fix bug from v0.13.0-rc.1 when using mig-strategy="mixed"
- Add logged error message if setting up health checks fails
- Support MIG devices with 1g.10gb+me profile
- Distribute replicas evenly across GPUs during allocation
- Bump GFD subchart to version 0.7.0-rc.2
Changes from v0.13.0-rc.1
- Improve health checks to detect errors when waiting on device events
- Log ECC error events detected during health check
- Add the GIT sha to version information for the CLI and container images
- Use NVML interfaces from go-nvlib to query devices
- Refactor plugin creation from resources
- Add a CUDA-based resource manager that can be used to expose integrated devices on Tegra-based systems
- Bump GFD subchart to version 0.7.0-rc.1
Note:
The container image nvcr.io/nvidia/k8s-device-plugin:v0.13.0-ubi8
contains the following high-severity CVEs:
- CVE-2022-42898 - Vulnerability found in os package type (rpm) - krb5-libs