What's Changed
🚀 Features
- feat: update energy interface to expose a power method by @KaiyiLiu1234 in #2365
- feat(hwmon): implement device reader for architectures with hwmon sensors by @KaiyiLiu1234 in #2372
- feat(redfish): add graceful degradation when BMC is unavailable by @vprashar2929 in #2379
- feat(gpu): add GPU power monitoring foundation by @vimalk78 in #2378
- feat(gpu): add NVIDIA GPU power monitoring via NVML by @vimalk78 in #2380
- feat(gpu): add NVIDIA GPU power monitoring via NVML by @vimalk78 in #2384
- feat(gpu): add idle power detection and configurable override by @vimalk78 in #2388
- feat(k8s): replace API server watch with kubelet /pods polling by @vimalk78 in #2369
- feat(gpu): add container and pod GPU power metrics by @vimalk78 in #2404
- feat(hwmon): add current and voltage readings to hwmon power meter by @KaiyiLiu1234 in #2385
- feat(kvm): support kvm vm processes on proxmox by @brunnert in #2419
- feat(gpu): add GPU energy joules metrics at all levels by @vimalk78 in #2405
🐛 Bug Fixes
- fix(helm): use image.tag="" otherwise it will always use the latest tag by @yellowhat in #2389
- fix(device): ensure deterministic hwmon zone ordering by @vimalk78 in #2396
- fix(monitor): resolve race condition in collection goroutine lifecycle by @vimalk78 in #2394
📖 Documentation
- docs: add README for agents by @vprashar2929 in #2356
- docs: add CLAUDE.md symlink by @vprashar2929 in #2364
- docs: update metricsLevel config by @vprashar2929 in #2377
- docs(proposal): add EP-003 GPU Power Monitoring enhancement proposal by @vimalk78 in #2367
New Contributors
- @yellowhat made their first contribution in #2389
- @brunnert made their first contribution in #2419
- @cmcantalupo made their first contribution in #2416
Full Changelog: v0.11.3...v0.11.4