CLOSE_WAIT connections on master node when ELBs point there

**Is this a request for help?** (If yes, you should use our troubleshooting guide and community support channels, see http://kubernetes.io/docs/troubleshooting/.):
No.

**What keywords did you search in Kubernetes issues before filing this one?** (If you have found any duplicates, you should instead reply there.):

CLOSE_WAIT

---

**Is this a BUG REPORT or FEATURE REQUEST?** (choose one):

BUG REPORT



**Kubernetes version** (use `kubectl version`):

```
Client Version: version.Info{Major:"1", Minor:"5", GitVersion:"v1.5.4", GitCommit:"7243c69eb523aa4377bce883e7c0dd76b84709a1", GitTreeState:"clean", BuildDate:"2017-03-08T02:48:58Z", GoVersion:"go1.8", Compiler:"gc", Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"5", GitVersion:"v1.5.2", GitCommit:"08e099554f3c31f6e6f07b448ab3ed78d0520507", GitTreeState:"clean", BuildDate:"2017-01-12T04:52:34Z", GoVersion:"go1.7.4", Compiler:"gc", Platform:"linux/amd64"}
```

**Environment**:
- **Cloud provider or hardware configuration**: AWS
- **OS** (e.g. from /etc/os-release): Debian GNU/Linux 8 (jessie)
- **Kernel** (e.g. `uname -a`): Linux ip-172-31-64-14 4.4.41-k8s #1 SMP Mon Jan 9 15:34:39 UTC 2017 x86_64 GNU/Linux
- **Install tools**: kops 1.5.3
- **Others**:


**What happened**:

The problem was triggered when an Service type=LoadBalancer was left without ready Pods. This triggers a wave of CLOSE_WAIT on the master node(s) that is reproducible.

**What you expected to happen**:

There should not be any flooding of CLOSE_WAIT connections.

**How to reproduce it** (as minimally and precisely as possible):

- Start a cluster with Kops v1.5.3 (kubernetes v1.5.2) at AWS
- Create a Service type=LoadBalancer (without attached Pods)

This should trigger the CLOSE_WAIT on the master.

Take note that  because the kops v1.5.3 is using taints instead of SchedulingDisabled (https://github.com/kubernetes/kops/issues/639) the master nodes are also added under the ELB on AWS.

**Anything else we need to know**:

* As a workaround the master can be tagged as unscheduled. This will configure ELB to exclude the master and the CLOSE_WAITs will stop raising.

```
kubectl patch node MASTER_NAME -p "{\"spec\":{\"unschedulable\":true}}"
```
* If Pods are added to the LoadBalancer service, the CLOSE_WAITS will stop raising when ready.

* The CLOSE_WAITs start raising on master when ELB is tagging the master node as "InService" not before that.

* Once too many CLOSE_WAITs are generated the following error appears, and master is marked as `not_ready` and ssh is unresponsive. Logs were gathered from "AWS > Instance Settings > Get System Log"

```
TCP: out of memory… consider tuning tcp_mem
```

* Issue has been reproduced in a different cluster and AWS account.

reported together with @mikim83

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CLOSE_WAIT connections on master node when ELBs point there #43212

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

CLOSE_WAIT connections on master node when ELBs point there #43212

Description

Activity

justinsb commented on Mar 18, 2017

felipejfc commented on Mar 18, 2017

thockin commented on May 11, 2017

justinsb commented on May 11, 2017

exarkun commented on Jul 17, 2017

0xMadao commented on Sep 28, 2017

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions