DaemonSet should respect Pod Affinity and Pod AntiAffinity #29276

lukaszo · 2016-07-20T10:34:44Z

This is a split from #22205 where only node affinity was added to DaemonSets. Pod Affinity and Pod AntiAffinity are still missing in DaemonSets.

It can be implemented in two ways:

Add InterPodAffinityMatches predicate check to nodeShouldRunDaemonPod in daemoncontroller.go
Add InterPodAffinityMatches predicate to GeneralPredicates which is used by daemon set.

cc @bgrant0607

The text was updated successfully, but these errors were encountered:

0xmichalis · 2017-05-20T19:16:53Z

@lukaszo is this issue fixed?

lukaszo · 2017-05-21T12:47:52Z

@Kargakis nope

kow3ns · 2017-07-14T17:30:31Z

@lukaszo @Kargakis @davidopp

DeamonSet is meant to run one copy of Pod on every node that matches the NodeSelector. What would PodAffinity PodAntiAffinity do here?

I can image that hard PodAntiAffinity might be used to indicate "put a Pod on every Node that matches your NodeSelector except for Nodes that have Pod x", but users could achieve the same functionality with a more restrictive NodeSelector and a more granular labeling scheme.

Have we put thought into the semantics of the other forms of Pod Affinity/AntiAffinity?

lukaszo · 2017-07-14T23:45:53Z

@kow3ns some use cases are described in the PR #31136

davidopp · 2017-07-14T23:56:37Z

@kow3ns it's not an unreasonable question. Thinking out loud here, I could imagine you might want to run one daemon per rack, e.g. to control some rack-level hardware resource. (Pod affinity is harder to justify.)

fejta-bot · 2017-12-31T20:19:47Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

bgrant0607 · 2018-01-23T02:47:25Z

/remove-lifecycle stale

@bsalamat has been looking at this

dblackdblack · 2018-01-23T19:19:33Z

+1 . This used to work back when affinity was indicated via the scheduler.alpha.kubernetes.io/affinity pod annotation. We used this feature for daemonsets but it is now broken.

In other words, this is a regression.

fejta-bot · 2018-04-30T17:12:04Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

dblackdblack · 2018-04-30T17:20:52Z

/remove-lifecycle stale

fejta-bot · 2018-07-29T17:30:53Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

dblackdblack · 2018-07-29T17:38:53Z

/remove-lifecycle stale

…

On Sun, Jul 29, 2018 at 10:31 fejta-bot ***@***.***> wrote: Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues rot after an additional 30d of inactivity and eventually close. If this issue is safe to close now please do so with /close. Send feedback to sig-testing, kubernetes/test-infra and/or fejta <https://github.com/fejta>. /lifecycle stale — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#29276 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEmN06BSjJWcgDw2QdEbzXu1XpYoL9fbks5uLfGFgaJpZM4JQpIZ> .

mitchellh · 2018-09-05T20:18:28Z

Hello! I'd love to see this as well. Rather than just saying +1, which I know is unhelpful, I'll explain our use case:

We're using a DaemonSet to deploy Consul clients on all nodes. However, clients don't need to run where server agents are running. We'd like to use pod anti-affinity to avoid scheduling the clients where servers are currently running.

We understand that there are obvious other shortcomings today, but we hope these will be resolved in the future, such as what happens when a server agent has to be rescheduled and so on. We're hoping those issues will get resolved as preemption stabilizes.

fejta-bot · 2019-05-07T02:38:33Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-06-06T03:21:38Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-06-06T03:21:46Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

dblackdblack · 2019-07-05T22:44:04Z

/reopen

k8s-ci-robot · 2019-07-05T22:44:11Z

@dblackdblack: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

mukgupta · 2020-06-12T10:58:01Z

I wanted to mention my use case for this feature. We use New Relic APM for monitoring some of our applications. NR APM charges per-host, not per-pod/per-container. To reduce costs, we would like to run the APM agent only on the nodes where the relevant pods are running (i.e. the pods running applications that use NR APM). One (theoretical) way of doing this would be to add a pod affinity to the New Relic APM DaemonSet to match pods labeled as using the APM; however, that feature obviously isn't supported.

A workaround that I thought of (but not yet implemented) would be to have a custom controller that labels nodes with new-relic-apm: active whenever relevant pods are scheduled on that nodes, and removes the label after all the relevant pods are removed from the node. Then, we would add a node selector to the DaemonSet to match that label. Therefore, the DaemonSet would still hold the responsibility for creating the APM agent pods and performing updates and we wouldn't need to implement our own pod controller.

@rabbitfang I have the exact same requirement but for datadog. Did you manage to solve it?

dblackdblack · 2020-06-12T16:53:13Z

nope

…

andyxning · 2020-06-21T03:44:02Z

I wanted to mention my use case for this feature. We use New Relic APM for monitoring some of our applications. NR APM charges per-host, not per-pod/per-container. To reduce costs, we would like to run the APM agent only on the nodes where the relevant pods are running (i.e. the pods running applications that use NR APM). One (theoretical) way of doing this would be to add a pod affinity to the New Relic APM DaemonSet to match pods labeled as using the APM; however, that feature obviously isn't supported.

A workaround that I thought of (but not yet implemented) would be to have a custom controller that labels nodes with new-relic-apm: active whenever relevant pods are scheduled on that nodes, and removes the label after all the relevant pods are removed from the node. Then, we would add a node selector to the DaemonSet to match that label. Therefore, the DaemonSet would still hold the responsibility for creating the APM agent pods and performing updates and we wouldn't need to implement our own pod controller.

We have somewhat the same requirements about scheduling the daemonset pods with affinity with app pods and anti-affinity with nodes not running the app pods after app pods are deleted from the nodes. Any suggestion?

**change resources for `deploy/k8s/dev`** To test the Varlog in a Krane-based cluster, the size of the cluster should be at least 20 nodes. Recommended numbers are as follows: the number of MRs is three and the number of SNs is 3 or 4 and the replication factor is 3. **daemonset doesn't respect podAntiAffinity** See kubernetes/kubernetes#29276. We should attach label `varlog-type=telemetry` to nodes running jaeger, prometheus, otel-collector and grafana. Daemonset for MR and SN won't be deployed to those nodes. The e2e testing module ignores nodes labed with `varlog-type=telemetry`. Resolves [#VARLOG-509](https://jira.daumkakao.com/browse/VARLOG-509).

**change resources for `deploy/k8s/dev`** To test the Varlog in a Krane-based cluster, the size of the cluster should be at least 20 nodes. Recommended numbers are as follows: the number of MRs is three and the number of SNs is 3 or 4 and the replication factor is 3. **daemonset doesn't respect podAntiAffinity** See kubernetes/kubernetes#29276. We should attach label `varlog-type=telemetry` to nodes running jaeger, prometheus, otel-collector and grafana. Daemonset for MR and SN won't be deployed to those nodes. The e2e testing module ignores nodes labed with `varlog-type=telemetry`. Resolves [#VARLOG-509](VARLOG-509).

lukaszo mentioned this issue Jul 20, 2016

DaemonSet should respect new scheduling constraints #22205

Closed

apelisse added team/control-plane area/workload-api/daemonset labels Jul 20, 2016

lukaszo mentioned this issue Aug 17, 2016

Move PodToleratesNodeTaints to GeneralPredicates #29116

Closed

ivan4th mentioned this issue Aug 22, 2016

Implement Pod Affinity and AntiAffinity for DaemonSets #31136

Closed

bgrant0607 added sig/apps sig/scheduling and removed team/control-plane (deprecated - do not use) labels Mar 21, 2017

bgrant0607 mentioned this issue Mar 21, 2017

Workload API v1 requirements umbrella issue #42752

Closed

davidopp mentioned this issue Jul 8, 2017

DaemonSet pods should be scheduled by default scheduler, not DaemonSet controller #42002

Closed

antoniaklja mentioned this issue Nov 8, 2017

#50598: Added more test cases for nodeShouldRunDaemonPod #55236

Merged

k8s-ci-robot added the lifecycle/stale label Dec 31, 2017

k8s-ci-robot removed the lifecycle/stale label Jan 23, 2018

k8s-ci-robot added the lifecycle/stale label Apr 30, 2018

k8s-ci-robot removed the lifecycle/stale label Apr 30, 2018

k8s-ci-robot added the lifecycle/stale label Jul 29, 2018

k8s-ci-robot removed the lifecycle/stale label Jul 29, 2018

k8s-ci-robot added the lifecycle/stale label Apr 7, 2019

guevara mentioned this issue Apr 29, 2019

详解 Kubernetes DaemonSet 的实现原理 guevara/read-it-later#3507

Open

k8s-ci-robot added lifecycle/rotten and removed lifecycle/stale labels May 7, 2019

k8s-ci-robot closed this as completed Jun 6, 2019

yf2008 mentioned this issue May 8, 2020

详解 Kubernetes DaemonSet 的实现原理 yf2008/duty-machine#132

Closed

ellistarn mentioned this issue Jan 25, 2022

podAntiAffinity aws/karpenter-provider-aws#942

Closed

DaemonSet should respect Pod Affinity and Pod AntiAffinity #29276

DaemonSet should respect Pod Affinity and Pod AntiAffinity #29276

Comments

lukaszo commented Jul 20, 2016

0xmichalis commented May 20, 2017

Uh oh!

lukaszo commented May 21, 2017

Uh oh!

kow3ns commented Jul 14, 2017

Uh oh!

lukaszo commented Jul 14, 2017

Uh oh!

davidopp commented Jul 14, 2017

Uh oh!

fejta-bot commented Dec 31, 2017

Uh oh!

bgrant0607 commented Jan 23, 2018

Uh oh!

dblackdblack commented Jan 23, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fejta-bot commented Apr 30, 2018

Uh oh!

dblackdblack commented Apr 30, 2018

Uh oh!

fejta-bot commented Jul 29, 2018

Uh oh!

dblackdblack commented Jul 29, 2018 via email

Uh oh!

mitchellh commented Sep 5, 2018

Uh oh!

fejta-bot commented May 7, 2019

Uh oh!

fejta-bot commented Jun 6, 2019

Uh oh!

k8s-ci-robot commented Jun 6, 2019

Uh oh!

dblackdblack commented Jul 5, 2019

Uh oh!

k8s-ci-robot commented Jul 5, 2019

Uh oh!

mukgupta commented Jun 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dblackdblack commented Jun 12, 2020 via email

Uh oh!

andyxning commented Jun 21, 2020

Uh oh!

dblackdblack commented Jan 23, 2018 •

edited

Loading

mukgupta commented Jun 12, 2020 •

edited

Loading