Features
- default to multizone karpenter deployment #6373 (Reed Schalo)
- Reduce hop token count (#6346) #6346 (Reed Schalo)
- Add kubelet configuration to EC2NodeClass V1 API (#6418) #6418 (Amanuel Engeda)
- Adding Conversion Webhooks for v1 EC2NodeClass (#6400) #6400 (Amanuel Engeda)
- implement v1 AMI selection (#6450) #6450 (Jason Deal)
- Drop Env Vars for v1 (#6576) #6576 (Reed Schalo)
- Change metrics port (#6578) #6578 (Reed Schalo)
- reimplement amiFamily (#6569) #6569 (Jason Deal)
- Add eks:eks-cluster-name tag and remove karpenter.sh/managed-by annotation (#6589) #6589 (Reed Schalo)
- Drop Support for Log Config, Add Log Paths (#6586) #6586 (Reed Schalo)
- convert unpinned Ubuntu EC2NodeClasses (#6699) #6699 (Jason Deal)
Bug Fixes
- Fix failure launching windows pod when instance type isn't in vpc resource controller config (#6415) #6415 (Jonathan Innis)
- add required startup taint to NodeClaim E2E test (#6486) #6486 (Reed Schalo)
- bottlerocket settings taint format (#6489) #6489 (Jason Deal)
- scheduling flake due to nodepool readiness (#6491) #6491 (Jason Deal)
- scheduling zone-id flake (#6493) #6493 (Jason Deal)
- ami family conversion (#6525) #6525 (Jason Deal)
- add terminationGracePeriod RBAC permissions (#6538) #6538 (Nick Tran)
- update to use karpv1 taint (#6558) #6558 (Reed Schalo)
- remove unused option (#6557) #6557 (Reed Schalo)
- Use ScheduleAnyway instead of DoNotSchedule (#6568) #6568 (Reed Schalo)
- put webhook disable option back (#6534) #6534 (Reed Schalo)
- bump go.mod (#6575) #6575 (Reed Schalo)
- return to DoNotSchedule (#6584) #6584 (Reed Schalo)
- Update AMIFamily for non-AL2023 AMIs (#6638) #6638 (Amanuel Engeda)
- Use NodeClaim annotations instead of NodeClass (#6654) #6654 (Amanuel Engeda)
- Make sure EbsOptimizedInfo exists when selecting instances by EBS maximum bandwidth (#6663) #6663 (Nicole Wren)
- allow v prefix for bottlerocket aliases (#6736) #6736 (Jason Deal)
Documentation
- explicitly call out CRD upgrade requirement (#6288) #6288 (Jason Deal)
- Adding Homa to the list of adopters (#6262) #6262 (afreyermuth98)
- drop eksctl note and bump min version (#6297) #6297 (Jason Deal)
- update website recommendations (#6317) #6317 (Nick Tran)
- adds more of ec2 spot heads up (#6219) #6219 (Justin Reasoner)
- update karpenter docs around soft affinity behavior (#6172) #6172 (Cameron McAvoy)
- fix KMS policy to wildcard Describe (#6349) #6349 (Jason Deal)
- add FAQ entry clarifying that startup taints are necessary to avoid excess node provisioning (#6267) #6267 (wmgroot)
- update stable release / rc wording (#6352) #6352 (Jason Deal)
- fix website for migrating from cas (#6385) #6385 (Jigisha Patil)
- Update disable disruption on nodepool to use budgets #6383 (jigisha620)
- add troubleshooting note on SGPP and max-pods (#6380) #6380 (Jason Deal)
- Scheduling concept doc GPU taint example value: true bool should be a string (#6405) #6405 (Sam Fulton)
- Drop old redirects (#6406) #6406 (Jonathan Innis)
- add disruption budgets by reason documentation (#6410) #6410 (Nick Tran)
- Update cloudformation reference with tls queue policy (#6416) #6416 (Jonathan Innis)
- change expiration to forceful (#6423) #6423 (Nick Tran)
- Fix
make docgen
for metrics generation (#6428) #6428 (Jonathan Innis) - Update docs to indicate requirements limit on NodePool is 100 (#6441) #6441 (Jigisha Patil)
- add karpenter startup taint custom userdata requirement (#6388) #6388 (Reed Schalo)
- Update preview for getting started guide for MNG and Fargate (#6523) #6523 (Amanuel Engeda)
- Adding X3M to the list of adopters (#6521) #6521 (Matias Reparaz)
- Updated examples to v1 (#6520) #6520 (Chris Negus)
- Update k8s version to be 1.30 instead of 1.3 on preview (#6556) #6556 (Amanuel Engeda)
- Update docs to correspond with the metric updates and bum karpenter version #6524 (jigisha620)
- Update counter metrics to have suffix _total #6577 (jigisha620)
- Adding contact to X3M adopters (#6588) #6588 (Matias Reparaz)
- Add missing sentence-ending periods (#6595) #6595 (Adi Harush)
- Add v1 RFC to the AWS CloudProvider repo (#6604) #6604 (Jonathan Innis)
- Update examples and docs to add security context (#6608) #6608 (Amanuel Engeda)
- Update docgen to latest (#6610) #6610 (Jonathan Innis)
- Update metrics to have stability assigned to them #6602 (jigisha620)
- Update supported description. (#6571) #6571 (Laitron)
- add terminationGracePeriod docs (#6640) #6640 (Nick Tran)
- Update comment associated with consolidateAfter #6656 (jigisha620)
- Update metrics description #6657 (jigisha620)
- Parametrize namespace on annotating karpenter-crd namespace (#6666) #6666 (Amanuel Engeda)
- Update cloudformation.yaml docs for v1 (#6674) #6674 (Chris Negus)
- Added v1.0.0 information to upgrade guide (#6560) #6560 (Chris Negus)
- add docs on consolidateAfter (#6678) #6678 (Nick Tran)
- Adding NodeClaims concept page to docs (#6470) #6470 (Chris Negus)
- update metrics port info (#6681) #6681 (Reed Schalo)
- run docgen (#6682) #6682 (Reed Schalo)
- Update v1beta1 to v1 examples (#6609) #6609 (Amanuel Engeda)
- Fix formatting in cloudformation docs (#6690) #6690 (Chris Negus)
- Update Add v1 Rollback (#6689) #6689 (Amanuel Engeda)
- NodeClass Ubuntu and Alias Updates (#6715) #6715 (Jason Deal)
- Update v1 migration guide to enable the conversion webhooks by helm controller (#6703) #6703 (Amanuel Engeda)
- Update URLs moved from v1beta1 to v1 (#6718) #6718 (Yonatan Goldschmidt)
- change GetParametersByPath back to GetParameter (#6726) #6726 (Nick Tran)
- Adding Karpenter version support note (#6724) #6724 (Amanuel Engeda)
- Update migration guide to filter NodeClaimNotFoundError #6734 (jigisha620)
- v1 NodeClass doc updates (#6720) #6720 (Jason Deal)
- Add wait for instance termination to changelog #6749 (jigisha620)
- run docgen and correct hyperlink (#6750) #6750 (Reed Schalo)
- update migration guide for variable release prefix (#6752) #6752 (Jason Deal)
- update note for amd (#6754) #6754 (Nick Tran)
Tests
- fix recent AMI test for EKS 1.30 (#6289) #6289 (Jason Deal)
- Fix scale test provisioning spike #6374 (jigisha620)
- Enable Windows IPAM at suite level (#6343) #6343 (Amanuel Engeda)
- fix nodeclaim test for custom labels in userData (#6492) #6492 (Jigisha Patil)
- Fix time out in scale tests (#6451) #6451 (Jigisha Patil)
- fix upgrade suite (#6505) #6505 (Reed Schalo)
- fix drift suite (#6506) #6506 (Reed Schalo)
- prevent resource leaks from go timeout (#6533) #6533 (Reed Schalo)
- fix failure in expiration tests (#6535) #6535 (Jigisha Patil)
- fix expiration test failure #6537 (jigisha620)
- Fix Drift Start-up Taint Test Flake (#6547) #6547 (Amanuel Engeda)
- Update Taint Existence check for Start-Up Drift (#6554) #6554 (Amanuel Engeda)
- Add zero CPU limits to expiration (#6555) #6555 (Amanuel Engeda)
- Validate that only the start-up is remaining before Drift check (#6559) #6559 (Amanuel Engeda)
- fix failure happening due to webhook being disabled #6585 (jigisha620)
- Add tests for termination grace period #6541 (jigisha620)
- fix scale testing flakes #6603 (jigisha620)
- add lastPodEventTime e2es (#6628) #6628 (Nick Tran)
- Ensure no 6+ minute delays for disrupted stateful workloads (#6484) #6484 (Drew Sirenko)
- deflake termination grace period tests (#6704) #6704 (Nick Tran)
- Add testing to verify consistent generation for a no change update (#6700) #6700 (Amanuel Engeda)
- Update scale test for expiration to not update expireAfter to prevent drift #6732 (jigisha620)
- use AL2 for nvidia E2E test (#6735) #6735 (Jason Deal)
- use AL2 for accelerator test (#6739) #6739 (Jason Deal)
Continuous Integration
- add workflow_dispatch trigger to image canary (#6263) #6263 (Jason Deal)
- unpin soak cluster (#6290) #6290 (Jason Deal)
- upgrade eksctl to v0.180.0 (#6313) #6313 (Jason Deal)
- Limit Sweeper Parallelism to avoid throttling (#6314) #6314 (Amanuel Engeda)
- Exclude soak testing clusters from deletion (#6334) #6334 (Amanuel Engeda)
- Add
git_ref
for versionCompatibility (#6344) #6344 (Amanuel Engeda) - move tests out from integration suite #6242 (jigisha620)
- Increase max retry for the sweeper (#6378) #6378 (Amanuel Engeda)
- don't exceed max ids to terminate instances (#6497) #6497 (Jason Deal)
- Update deployment to remove drift feature gate (#6561) #6561 (Amanuel Engeda)
- Don't
fail-fast
when running CI test job (#6634) #6634 (Jonathan Innis)
Chores
- drop implicit public IP association config (#6213) #6213 (Jason Deal)
- Release v0.37.0 (#6278) #6278 (github-actions[bot])
- Update data from AWS APIs (#6274) #6274 (github-actions[bot])
- deps: bump sigs.k8s.io/controller-runtime from 0.18.2 to 0.18.3 in the k8s-go-deps group (#6272) #6272 (dependabot[bot])
- deps: bump the go-deps group across 1 directory with 2 updates (#6291) #6291 (dependabot[bot])
- deps: bump docker/login-action from 3.1.0 to 3.2.0 in the actions-deps group (#6305) #6305 (dependabot[bot])
- deps: bump docker/login-action from 3.1.0 to 3.2.0 in /.github/actions/e2e/install-karpenter in the action-deps group (#6308) #6308 (dependabot[bot])
- deps: bump github.com/aws/aws-sdk-go from 1.53.12 to 1.53.14 in the go-deps group (#6307) #6307 (dependabot[bot])
- deps: bump docker/login-action from 3.1.0 to 3.2.0 in /.github/actions/e2e/run-tests-private-cluster in the action-deps group (#6309) #6309 (dependabot[bot])
- Update data from AWS APIs (#6312) #6312 (github-actions[bot])
- Paginate resources with aws-sdk-go-v2 pagination (#6268) (#6316) #6316 (Louis Petro)
- sync v1 staging branch with main (#6335) #6335 (Amanuel Engeda)
- Convert singleton reconcilers to operatorpkg (#6310) #6310 (Jonathan Innis)
- deps: bump sigs.k8s.io/controller-runtime from 0.18.3 to 0.18.4 in the k8s-go-deps group (#6339) #6339 (dependabot[bot])
- deps: bump github.com/aws/aws-sdk-go from 1.53.14 to 1.53.19 in the go-deps group (#6340) #6340 (dependabot[bot])
- deps-dev: bump braces from 3.0.2 to 3.0.3 in /website (#6347) #6347 (dependabot[bot])
- Remove
Get()
in offerings in favor ofCompatible()
(#6341) #6341 (Jonathan Innis) - Wait for instance termination before deleting nodeclaim #5979 (jigisha620)
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/slack/notify in the action-deps group (#6364) #6364 (dependabot[bot])
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/upgrade-crds in the action-deps group (#6365) #6365 (dependabot[bot])
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/cleanup in the action-deps group (#6366) #6366 (dependabot[bot])
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/install-karpenter in the action-deps group (#6367) #6367 (dependabot[bot])
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/install-prometheus in the action-deps group (#6368) #6368 (dependabot[bot])
- deps: bump actions/checkout from 4.1.6 to 4.1.7 in /.github/actions/e2e/setup-cluster in the action-deps group (#6371) #6371 (dependabot[bot])
- deps: bump github.com/aws/aws-sdk-go from 1.53.19 to 1.54.2 in the go-deps group (#6370) #6370 (dependabot[bot])
- deps: bump the k8s-go-deps group with 5 updates (#6369) #6369 (dependabot[bot])
- Remove clusters used for investigation (#6375) #6375 (Amanuel Engeda)
- Bump staging/v1 with main (#6381) #6381 (Amanuel Engeda)
- Copy the
v1
APIs forEC2NodeClass
(#6321) #6321 (Amanuel Engeda) - deps: bump k8s.io/klog/v2 from 2.130.0 to 2.130.1 in the k8s-go-deps group (#6396) #6396 (dependabot[bot])
- deps: bump github.com/aws/aws-sdk-go from 1.54.2 to 1.54.6 in the go-deps group (#6397) #6397 (dependabot[bot])
- deps: bump aws-actions/aws-codebuild-run-build from b31229dccde89c01530699df08ad07e8d7ac21a8 to f59b837dd074776bd06619e7e22fb62161eab324 in /.github/actions/e2e/run-tests-private-cluster in the action-deps group (#6398) #6398 (dependabot[bot])
- Update data from AWS APIs (#6372) #6372 (github-actions[bot])
- fix grammar in troubleshooting.md (#6389) #6389 (robinverduijn)
- Update data from AWS APIs (#6402) #6402 (github-actions[bot])
- Update storage version of EC2NodeClass to be v1beta1 APIs (#6417) #6417 (Amanuel Engeda)
- Simplify scheme to use clientgoscheme (#6422) #6422 (Ellis Tarn)
- Adding print columns for v1 EC2NodeClass (#6424) #6424 (Amanuel Engeda)
- Add prometheus metrics for AWS client calls (#6421) #6421 (Jonathan Innis)
- deps: bump the go-deps group with 2 updates (#6431) #6431 (dependabot[bot])
- Fill in the NodeClassReference for all NodePools and NodeClaims (#6427) #6427 (Jonathan Innis)
- forceful expiration updates (#6430) #6430 (Jason Deal)
- address GO-2024-2963 (#6448) #6448 (Reed Schalo)
- Add pod-pids-limit to bottlerocket settings #6442 (jigisha620)
- Add talks to README (#6463) #6463 (sullis)
- Update GetSupportedNodeClasses to return []status.Object #6414 (jigisha620)
- Deprecate functional library in favor of lo and operatorpkg (#6467) #6467 (Ellis Tarn)
- Add status conditions on nodeClass #6455 (jigisha620)
- deps: bump actions/upload-artifact from 4.3.3 to 4.3.4 in the actions-deps group (#6460) #6460 (dependabot[bot])
- deps: bump github.com/aws/aws-sdk-go from 1.54.11 to 1.54.15 in the go-deps group (#6458) #6458 (dependabot[bot])
- update node permissions (#6469) #6469 (Reed Schalo)
- bump go.mod (#6478) #6478 (Reed Schalo)
- Fix spurious discovered lines in logging (#6481) #6481 (Jonathan Innis)
- Update data from AWS APIs (#6485) #6485 (github-actions[bot])
- Fix CI failures for golang-ci (#6479) #6479 (Jonathan Innis)
- Add operator status condition metrics (#6482) #6482 (Jonathan Innis)
- drop validation webhooks and enable conversion webhooks (#6488) #6488 (Jason Deal)
- migrate to v1 APIs (#6502) #6502 (Jason Deal)
- deps: bump actions/setup-go from 5.0.1 to 5.0.2 in /.github/actions/install-deps in the action-deps group (#6512) #6512 (dependabot[bot])
- deps: bump actions/setup-go from 5.0.1 to 5.0.2 in /.github/actions/e2e/cleanup in the action-deps group (#6513) #6513 (dependabot[bot])
- deps: bump the go-deps group with 2 updates (#6511) #6511 (dependabot[bot])
- deps: bump actions/setup-go from 5.0.1 to 5.0.2 in the actions-deps group (#6514) #6514 (dependabot[bot])
- update kubelet hash annotation on nodeclaim #6508 (jigisha620)
- change error returned by ResolveNodePoolFromNodeClaim #6517 (jigisha620)
- Bump upstream Karpenter (#6518) #6518 (Amanuel Engeda)
- Update preview docs to refer to the main branch (#6522) #6522 (Amanuel Engeda)
- Bump upstream Karpenter (#6526) #6526 (Amanuel Engeda)
- fix nodeClaim tests failure #6532 (jigisha620)
- Bump upstream karpenter (#6536) #6536 (Amanuel Engeda)
- Reconfigure metrics that are duplicated #6524 (jigisha620)
- deps: bump github.com/aws/aws-sdk-go from 1.54.19 to 1.54.20 in the go-deps group (#6565) #6565 (dependabot[bot])
- deps: bump the k8s-go-deps group with 4 updates (#6564) #6564 (dependabot[bot])
- update mod (#6583) #6583 (Nick Tran)
- deps: bump docker/login-action from 3.2.0 to 3.3.0 in the actions-deps group (#6600) #6600 (dependabot[bot])
- deps: bump the go-deps group with 3 updates (#6599) #6599 (dependabot[bot])
- deps: bump docker/login-action from 3.2.0 to 3.3.0 in /.github/actions/e2e/run-tests-private-cluster in the action-deps group (#6598) #6598 (dependabot[bot])
- deps: bump docker/login-action from 3.2.0 to 3.3.0 in /.github/actions/e2e/install-karpenter in the action-deps group (#6596) #6596 (dependabot[bot])
- Scope down the IAM permission on instance profiles for Karpenter (#6605) #6605 (Amanuel Engeda)
- remove support for 1.23 and 1.24 (#6468) #6468 (Nick Tran)
- upgrade go mod and update tests (#6614) #6614 (Nick Tran)
- Cleanup extra parenthesis in chart (#6624) #6624 (Jonathan Innis)
- reconfigure metrics and update docs #6592 (jigisha620)
- Add post install hook (#6626) #6626 (Amanuel Engeda)
- Update GHA Karpenter helm chart diff action (#6631) #6631 (Amanuel Engeda)
- Add templated Values for the CRDs (#6630) #6630 (Amanuel Engeda)
- Include post upgrade on Helm Hook (#6636) #6636 (Amanuel Engeda)
- update to nodeclaimtemplate in nodepool (#6639) #6639 (Nick Tran)
- deps: bump the go-deps group with 4 updates (#6644) #6644 (dependabot[bot])
- deps: bump actions/upload-artifact from 4.3.4 to 4.3.5 in the actions-deps group (#6645) #6645 (dependabot[bot])
- Update so NodeClassNotReady is returned when NodeClass is failing (#6629) #6629 (Jonathan Innis)
- Bump deps to latest for
karpenter
andaws-sdk-go-prometheus
(#6658) #6658 (Jonathan Innis) - Add tagging for migration to
eks:eks-cluster-name
(#6641) #6641 (Jonathan Innis) - Add validation rules for v1beta1 (#6662) #6662 (Amanuel Engeda)
- Update post install hook naming (#6661) #6661 (Amanuel Engeda)
- enforce Windows version via CEL (#6653) #6653 (Jason Deal)
- Bump
sigs.k8s.io/karpenter
to latest (#6667) #6667 (Jonathan Innis) - consistent userData override ordering (#6664) #6664 (Jason Deal)
- update go mod (#6680) #6680 (Nick Tran)
- reduce SSM GetParametersByPath Call rate (#6683) #6683 (Nick Tran)
- Bump
sigs.k8s.io/karpenter
to latest (#6684) #6684 (Jonathan Innis) - deps: bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0 in the go-deps group (#6706) #6706 (dependabot[bot])
- deps: bump actions/upload-artifact from 4.3.5 to 4.3.6 in the actions-deps group (#6707) #6707 (dependabot[bot])
- make image configurable (#6716) #6716 (Nick Tran)
- update karpenter install script for private cluster testing (#6719) #6719 (Jigisha Patil)
- revert to using ssm:GetParameter (#6725) #6725 (Jason Deal)
- Limit parallelism on our GitHub action for version compatibility (#6727) #6727 (Amanuel Engeda)
Commits
- e46b16b: Bump sigs.k8s.io/karpenter to staging/v1 (Jonathan Innis) #6413
- cf53189: Merge remote-tracking branch 'upstream/main' into bump-main (Jason Deal) #6215
- update sigs.k8s.io/karpenter to staging/v1 #6215 (Jason Deal)
- bump go to 1.22.4 (#6332) #6332 (Jason Deal)
- 51a4d13: Modification of Karpenter Grafana dashboards capacity and performance (#5935) (Jens-Uwe Walther) #5935
- d8455ac: Update tests to use amdAMIID (#6384) (sarthakkothari) #6384
- 03e49d9: Drop old redirects and old path checking (Jonathan Innis) #6406
- d42916b: Merge branch 'main' into staging/v1 (Jonathan Innis) #6409
- 7ccbc89: Fix private cluster test failure (jigisha620) #6403
- bump sigs.k8s.io/karpenter (#6412) #6412 (Jason Deal)
- b3e3f90: Deny http access aws sqs queue (#6395) (Pradeep Ashwathanarayan) #6395
- fix expiration test #6546 (jigisha620)
- e18413a: Added v1 features to nodeclasses and nodepools (#6549) (Chris Negus) #6549
- Move tests under termination test suite #6541 (jigisha620)
- bump sigs.k8s.io/karpenter (#6705) #6705 (Jason Deal)
- 75222a1: Small fixes to v1 Migration Guide (#6709) (Chris Negus) #6709
- bump sigs.k8s.io/karpenter to v1.0.0 (#6753) #6753 (Jason Deal)