Support scaling HPA to/from zero pods for object/external metrics #74526

DXist · 2019-02-25T14:53:57Z

What type of PR is this?

/kind feature

What this PR does / why we need it:
This PR targets worker deployments that use queues and scaling is based on object or external metric that depends on queue size. When workers are idle it is possible to scale corresponding deployment to zero replicas and save resources.

This technique is especially useful when workers request GPU resources and the amount of different idling worker types exceeds number of available GPUs.

Which issue(s) this PR fixes:

Fixes #69687

Special notes for your reviewer:
The PR is based on changes made in #61423

Scale to/from zero changes are made
Applied changes from In Horizontal Pod Autoscaler Controller when scaling on multiple metrics, handle invalid metrics. #61423
HPA continues to scale as long as there is at least one metric value available.
There is no conservative scale down behaviour introduced in In Horizontal Pod Autoscaler Controller when scaling on multiple metrics, handle invalid metrics. #61423
Scaling down works even if we have just one metric value.
HPA tolerance set through --horizontal-pod-autoscaler-tolerance flag is ignored when scaling up from zero pods.

Does this PR introduce a user-facing change?:

When HPAScaleToZero feature gate is enabled HPA supports scaling to zero pods based on object or external metrics. HPA remains active as long as at least one metric value available.

To downgrade the cluster to version that does not support scale-to-zero feature:
1. make sure there are no hpa objects with minReplicas=0. Here is a oneliner to update it to 1:
    $ kubectl get hpa --all-namespaces  --no-headers=true | awk  '{if($6==0) printf "kubectl patch hpa/%s --namespace=%s -p \"{\\\"spec\\\":{\\\"minReplicas\\\":1}}\"\n", $2, $1 }' | sh
2. disable HPAScaleToZero feature gate

k8s-ci-robot · 2019-02-25T14:54:05Z

Hi @DXist. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

spiffxp · 2019-02-25T16:15:57Z

/ok-to-test

Rajat-0 · 2019-02-25T16:49:58Z

@DXist I am also working on this issue and I have discussed the below proposel in Sig weekly meeting, and as per discussion in meeting the @mwielgus will discuss this with networking team, and based on that we can modify the PR and I'll be happy to coordinate with you on this.

https://docs.google.com/document/d/1p_Xlk8e5V32WOBVeRbJqbxhEKcJy_mDu9JgVp5ZNDxs/

DXist · 2019-02-26T08:16:23Z

@Rajat-0

@DXist I am also working on this issue and I have discussed the below proposel in Sig weekly meeting, and as per discussion in meeting the Sig owner will discuss this with networking team, and based on that we can modify the PR and I'll be happy to coordinate with you on this.

https://docs.google.com/document/d/1p_Xlk8e5V32WOBVeRbJqbxhEKcJy_mDu9JgVp5ZNDxs/

This PR handles more general case and is not specific to HTTP workload.

If an HTTP load balancer/Ingress could buffer requests and export a number of queued requests for given service as a custom metric then HPA can use this metric for scaling. I see that there could be some signaling about metric change or push based model of metric delivery to HPA to speed up the scaling process.

mwielgus

Couple of high-level items:

How was it tested?
Why didn't you handle: https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/podautoscaler/horizontal.go#L567
Did you consider explicitly validating that there is Object/External metric configured?
Maybe we should have a comment update in the API.

DXist · 2019-02-28T08:48:18Z

Couple of high-level items:

How was it tested?

There are unittests that cover scaling up from zero pods and scaling down to zero for custom object and external metrics.

For end-to-end testing I

built K8S binaries
run them using https://github.com/kubernetes-sigs/kubeadm-dind-cluster
set up RabbitMQ, Prometheus Operator, Prometheus adapter for custom metrics
set up the service I want to scale and configured Prometheus, Prometheus adapter and HPA.
HPA used two Object metrics - worker utilization and queue ingress/egress ratio
run scripts to generate test load - a single message or a series of messages in the rate of ~5X processing throughput of a single pod

Only one metric value (worker utilization) was available when there were no pods.
Metrics used 1m window to calculate averaged value so I delayed start of pod readiness probes via initialDelaySeconds: 60. Due to this delay the next scaling decision was based on a full metric window since previous rescale. The delay worked in the same way as replica calculator code for resource metrics.

Why didn't you handle: https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/podautoscaler/horizontal.go#L567

Github collapsed the diff for this module. I've added extra condition to enable scaling when minReplicas==0

Did you consider explicitly validating that there is Object/External metric configured?

No, I didn't. It seems to be a good protection from HPA misconfiguration.

Maybe we should have a comment update in the API.

A comment update should be near MinReplicas?

DXist · 2019-03-01T14:33:23Z

Added metrics validation for minReplicas=0 case and a comment to the API.

fejta-bot · 2019-03-01T15:12:50Z

This PR may require API review.

If so, when the changes are ready, complete the pre-review checklist and request an API review.

Status of requested reviews is tracked in the API Review project.

thockin · 2019-03-01T23:19:03Z

API LGTM

@mwielgus ping me when LGTM and I will approve

liggitt · 2019-07-16T14:04:47Z

/lgtm
/retest

DXist · 2019-07-16T14:20:40Z

/retest

DXist · 2020-01-21T10:13:57Z

@liggitt , @mwielgus What is the next step to make the feature GA?

numbsafari · 2020-07-28T18:25:27Z

Just stopping by after six months to see what, if anything, can be done to move this otherwise perfectly reasonable and working feature from Alpha to GA.

This would be great to have on hosted K8S environments like GKE or AKS or EKS or... basically anywhere.

numbsafari · 2020-08-26T20:49:01Z

Was just reading the release notes for 1.19,
and noticed the section on avoiding permanent beta, it would be great if there was some clarity on how to foster this work through the process.

jeffreybrowning · 2020-08-30T23:52:35Z

@liggitt , @mwielgus What is the next step to make the feature GA?

Would love an answer to this. It's been half a year or so since it was asked.

Very useful feature for shutting down workers processing jobs from queues when the queues have no tasks.

lavalamp · 2020-08-31T16:13:14Z

Sounds like a good topic for the SIG Autoscaling meeting :)

jeffreybrowning · 2020-09-05T22:56:43Z

@lavalamp were there results from the meeting?

lavalamp · 2020-09-08T22:51:59Z

Apologies, that was super unclear of me-- I realize now it reads like I was going to take it to the SIG meeting, but I was actually only trying to suggest that it was the logical next thing to do. I don't personally have time to contribute to this other than the occasional review.

jeffreybrowning · 2020-09-08T22:56:59Z

@lavalamp no worries. Is there someone more central to the planning we can poke here to move things along? I think this should be a pretty innocent bump to GA. I'll go ahead and tag others who were in this ticket for Alpha release.

@liggitt @mwielgus

johanneswuerbach · 2020-09-08T23:01:19Z

We are also using this feature heavily and I would be happy to contribute time driving this towards GA.

While I contributed some changes to k8s, I've never been part of an API graduation so I would need to some pointers how to do this if help is needed.

liggitt · 2020-09-10T13:18:38Z

I think this should be a pretty innocent bump to GA. I'll go ahead and tag others who were in this ticket for Alpha release.

@liggitt @mwielgus

API changes need an associated doc describing the change and graduation requirements, called a KEP (Kubernetes Enhancement Proposal). The key things we'd be looking for are evidence of sufficient test coverage, any performance implications of the change, and upgrade/downgrade/skew compatibility implications.

It looks like #69687 (comment) might have been the original proposal associated with the alpha change, but before graduating to beta and being enabled by default, that should be ported to a KEP and make sure the test/performance/upgrade questions are answered. A template is available at https://github.com/kubernetes/enhancements/tree/master/keps/NNNN-kep-template and would be placed in https://github.com/kubernetes/enhancements/tree/master/keps/sig-autoscaling

jeffreybrowning · 2020-09-16T01:50:36Z

@johanneswuerbach @DXist is this something one of you can lead? It would be my first contribution, and is probably not a great thing to take a bite out of first.

If not, I can attempt the first pass at the proposal.

DXist · 2020-09-16T09:21:42Z

@jeffreybrowning go ahead. I changed project and work in another context.

johanneswuerbach · 2020-09-16T09:39:33Z

I planned to get something drafted on Friday, but I also never worked on a KEP before. If you have more time feel free to pick it @jeffreybrowning :-)

jeffreybrowning · 2020-09-17T22:07:28Z

I am embroiled in an update to another package right now -- it would likely be a sizeable delay until I get to this.

johanneswuerbach · 2020-09-26T19:37:29Z

I started the official propose here kubernetes/enhancements#2021. Looking forward to any of your comments or feedback on the KEP itself here kubernetes/enhancements#2022.

k8s-ci-robot added kind/feature release-note size/XL needs-sig needs-priority needs-ok-to-test labels Feb 25, 2019

k8s-ci-robot added the cncf-cla: yes label Feb 25, 2019

k8s-ci-robot requested review from mwielgus and sttts February 25, 2019 14:54

k8s-ci-robot added sig/apps sig/autoscaling and removed needs-sig labels Feb 25, 2019

k8s-ci-robot assigned MaciekPytel and thockin Feb 25, 2019

k8s-ci-robot added ok-to-test and removed needs-ok-to-test labels Feb 25, 2019

DXist mentioned this pull request Feb 26, 2019

HPA scale-to-zero knative/serving#3064

Closed

jchesterpivotal mentioned this pull request Feb 27, 2019

Investigate which metric backend to use for autoscale knative/serving#364

Closed

mwielgus reviewed Feb 28, 2019

View reviewed changes

mwielgus self-assigned this Feb 28, 2019

k8s-ci-robot added the kind/api-change label Mar 1, 2019

DXist mentioned this pull request Mar 5, 2019

Allow HPA to scale to 0 #69687

Closed

k8s-ci-robot assigned liggitt Jul 16, 2019

k8s-ci-robot added the lgtm label Jul 16, 2019

k8s-ci-robot merged commit 5ece88c into kubernetes:master Jul 16, 2019

gjtempleton mentioned this pull request Jul 17, 2019

Correct Comment on HPA Logic #78827

Closed

aslom mentioned this pull request Nov 26, 2019

Make Knative eventing sources more serverless and scalable knative/eventing#2153

Closed

mikkeloscar mentioned this pull request Feb 18, 2020

Update to Kubernetes v1.16 zalando-incubator/kubernetes-on-aws#2774

Merged

20 tasks

fernandrone mentioned this pull request Jun 8, 2020

Scale to Zero vmware-archive/kubeless#982

Open

chaws mentioned this pull request Aug 24, 2020

Configure workers to autoscale down to 0 Linaro/qa-reports.linaro.org#54

Open

silenceper mentioned this pull request Oct 19, 2020

HPA does not reflect minpods change kedacore/keda#1266

Closed

johanneswuerbach mentioned this pull request Dec 20, 2020

KEP-2021: Support scaling HPA to/from zero pods for object/external metrics kubernetes/enhancements#2022

Open

This was referenced Feb 24, 2021

hpa: Scales down despite some metrics being unavailable (since v1.16, #74526) #99394

Closed

hpa: Don't scale down if at least one metric was invalid #99514

Merged

vlerenc mentioned this pull request Jul 16, 2024

Add best practices guide for pod auto scaling gardener/gardener#10083

Merged

Support scaling HPA to/from zero pods for object/external metrics #74526

Support scaling HPA to/from zero pods for object/external metrics #74526

Conversation

DXist commented Feb 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-ci-robot commented Feb 25, 2019

Uh oh!

spiffxp commented Feb 25, 2019

Uh oh!

Rajat-0 commented Feb 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DXist commented Feb 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mwielgus left a comment

Choose a reason for hiding this comment

Uh oh!

DXist commented Feb 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DXist commented Mar 1, 2019

Uh oh!

fejta-bot commented Mar 1, 2019

Uh oh!

thockin commented Mar 1, 2019

Uh oh!

liggitt commented Jul 16, 2019

Uh oh!

DXist commented Jul 16, 2019

Uh oh!

DXist commented Jan 21, 2020

Uh oh!

numbsafari commented Jul 28, 2020

Uh oh!

numbsafari commented Aug 26, 2020

Uh oh!

jeffreybrowning commented Aug 30, 2020

Uh oh!

lavalamp commented Aug 31, 2020

Uh oh!

jeffreybrowning commented Sep 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lavalamp commented Sep 8, 2020

Uh oh!

jeffreybrowning commented Sep 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johanneswuerbach commented Sep 8, 2020

Uh oh!

liggitt commented Sep 10, 2020

Uh oh!

jeffreybrowning commented Sep 16, 2020

Uh oh!

DXist commented Sep 16, 2020

Uh oh!

johanneswuerbach commented Sep 16, 2020

Uh oh!

jeffreybrowning commented Sep 17, 2020

Uh oh!

johanneswuerbach commented Sep 26, 2020

Uh oh!

DXist commented Feb 25, 2019 •

edited

Loading

Rajat-0 commented Feb 25, 2019 •

edited

Loading

DXist commented Feb 26, 2019 •

edited

Loading

DXist commented Feb 28, 2019 •

edited

Loading

jeffreybrowning commented Sep 5, 2020 •

edited

Loading

jeffreybrowning commented Sep 8, 2020 •

edited

Loading