-
Notifications
You must be signed in to change notification settings - Fork 4.2k
AEP-8026: per-vpa-component-configuration #8026
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Omer Aplatony <[email protected]>
Signed-off-by: Omer Aplatony <[email protected]>
Signed-off-by: Omer Aplatony <[email protected]>
/assign @raywainman |
Signed-off-by: Omer Aplatony <[email protected]>
This PR may require API review. If so, when the changes are ready, complete the pre-review checklist and request an API review. Status of requested reviews is tracked in the API Review project. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really like this @omerap12, thanks for putting this together!
vertical-pod-autoscaler/enhancements/8026-per-vpa-component-configuration/README.md
Show resolved
Hide resolved
* recommendationMarginFraction | ||
* Other parameters that benefit from workload-specific tuning | ||
|
||
### Validation via CEL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will we still do some basic validating in the code?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, e2e tests will be included: 5b89607
vertical-pod-autoscaler/enhancements/8026-per-vpa-component-configuration/README.md
Show resolved
Hide resolved
Signed-off-by: Omer Aplatony <[email protected]>
Signed-off-by: Omer Aplatony <[email protected]>
Signed-off-by: Omer Aplatony <[email protected]>
Do these API changes need to be feature-gated and go through the process of:
I don't think the VPA project really promises a downgrade path, but Kubernetes does. If we use what Kubernetes has defined, then we may need to do this. |
I thought you are on vacation! |
I'm easing back into open source over the next week or so, and playing catch up.
I think it depends on what our promise is to the user. If we promise the ability to downgrade by 1 version, then we need to do the feature-gating In the past it seems as though we loosely adopt what Kubernetes promises (ability to downgrade), but that doesn't seem to be a formal decision. |
What does downgrade means in this context? cause you are just adding fields.. |
Something like this is what I'm thinking:
Their previously-set VPA setting (oomBumpUpRatio, for example) no longer works. It's not a big deal, but it's how Kubernetes rolls out new features. |
I understand. Right now, if someone updates the VPA and sets a different |
Yeah, the downgrade isn't a big problem with this change. I think the VPA is a lower risk than Kubernetes, so we could get away not worrying about a downgrade, but, someone may expect the same from the VPA as they do from Kubernetes. So I'm on the fence with this one. |
Signed-off-by: Omer Aplatony <[email protected]>
Signed-off-by: Omer Aplatony <[email protected]>
@adrianmoisey PTAL :) |
/lgtm I'd really love more eyes on this, hence holding it |
cc @kubernetes/sig-autoscaling-leads |
Extend `ContainerResourcePolicy` with: | ||
* `oomBumpUpRatio` | ||
* `oomMinBumpUp` | ||
* `memoryAggregationInterval` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm curious why you want to expose this and not also expose the count
?
Do we just assume that 8 intervals is good and we can just adjust the time interval instead to suit the need?
I'm curious how much success you've had in tweaking this in your own deployment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's no specific reason — the list above just includes what I thought were the most useful fields to expose. I can also expose the count field since it seems straightforward. We'll continue exposing new fields based on user requirements, as mentioned here: https://github.com/kubernetes/autoscaler/pull/8026/files#diff-b666c5599d02971edb0c08090acb86761b7c726c97c4c6c1bbf00ff5883a2364R190
Overall LGTM, just a couple comments. Apologies for the delay here team... |
Signed-off-by: Omer Aplatony <[email protected]>
/lgtm |
/lgtm Modulo the discussion on Quantity vs String for the |
Signed-off-by: Omer Aplatony <[email protected]>
/lgtm |
Ping @raywainman for final approval. |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: omerap12, raywainman The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/unhold |
What type of PR is this?
/kind documentation
/kind feature
/kind api-change
What this PR does / why we need it:
AEP for #7650