Not split nodes when searching for nodes but doing it all at once #67555

wgliang · 2018-08-18T01:07:47Z

What this PR does / why we need it:
Not split nodes when searching for nodes but doing it all at once.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #

Special notes for your reviewer:
@bsalamat

This is a follow up PR of #66733.

#66733 (comment)

Release note:

Not split nodes when searching for nodes but doing it all at once.

wgliang · 2018-08-18T01:09:18Z

/sig scheduling

wgliang · 2018-08-22T00:25:54Z

/assign @bsalamat
PTAL

misterikkit

I don't think we should try too hard to mimic the Parallelize API that is being replaced here. If there's a pattern that works better for us here, let's go with it.

misterikkit · 2018-08-22T23:23:46Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+
+// ParallelizeUntilFeasible is a very simple framework that allow for parallelizing
+// N independent pieces of work until get feasible solution.
+func ParallelizeUntilFeasible(workers, pieces int, doWorkPiece DoWorkPieceFunc, feasible *int32) {


The normal way to accomplish something like this in golang would be to use a context.Context for the work that you are dispatching, then cancelling the context when you want to abandon the rest of the work.

It would also make sense to have the workers fan in their results to a goroutine that handles results and decides when to cancel.

Indeed, I understand what you mean, maybe a callback function can do the same, just like bsalamat said. And using Context may be too complicated for a public function.

bsalamat · 2018-08-23T00:06:48Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+
+// ParallelizeUntilFeasible is a very simple framework that allow for parallelizing
+// N independent pieces of work until get feasible solution.
+func ParallelizeUntilFeasible(workers, pieces int, doWorkPiece DoWorkPieceFunc, feasible *int32) {


Given that this is in client-go, I think we should make this a more generic function. More specifically, I think we should:

Rename it to ParallelizeUntil.

Change feasible *int32 to a function that returns a bool and an error, similar to ConditionFunc. When the return value is true, it stops.

Thanks for your kind review. :). I've updated it.

misterikkit

I don't think we should try too hard to mimic the Parallelize API that is being replaced here. If there's a pattern that works better for us here, let's go with it.

I apologize. I didn't notice that you were adding the util in client-go. Given that, it makes lots of sense to mimic the Parallelize API.

Indeed, I understand what you mean, maybe a callback function can do the same, just like bsalamat said. And using Context may be too complicated for a public function.

I don't believe that Context should be considered "too complicated" for a public function. In this case, it offers features that we don't need, so a stop channel would also work. (Passing a stop channel around is a very common pattern in k8s, but it's an old pattern that really ought to be replaced with Context.)

misterikkit · 2018-08-23T00:58:53Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+
+// ParallelizeUntil is a very simple framework that allow for parallelizing
+// N independent pieces of work until the condition function return true.
+func ParallelizeUntil(workers, pieces int, doWorkPiece DoWorkPieceFunc, condition ConditionFunc) {


I suggest replacing ConditionFunc with a context.Context or stop channel. Then each worker just selects on toProcess and the done/stop channel.

misterikkit · 2018-08-23T00:59:27Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+// if the loop should be aborted.
+type ConditionFunc func() (done bool, err error)
+
+// ParallelizeUntil is a very simple framework that allow for parallelizing


s/allow/allows/

and I would appreciate it if you fix the same typo on Parallelize.

No problem.

misterikkit · 2018-08-23T01:00:25Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+
+// ParallelizeUntil is a very simple framework that allow for parallelizing
+// N independent pieces of work until the condition function return true.
+func ParallelizeUntil(workers, pieces int, doWorkPiece DoWorkPieceFunc, condition ConditionFunc) {


Since there is a lot of duplicated code here, I suggest rewriting Parallelize to just call this function.

A good idea.

After this PR merge, I will submit a separate PR to replace all Parallelize.

I think there was a slight misunderstanding. We should only replace the definition of Parallelize so that existing callers won't see any change. Parallelize would just call ParallelizeUntil.

I think that is a small change that should be included in this PR.

misterikkit · 2018-08-23T01:05:19Z

pkg/scheduler/core/generic_scheduler.go

+
+		// Stops searching for more nodes once the configured number of feasible nodes are found(
+		// once the remainingNodesNumber is less than or equal to 0).
+		workqueue.ParallelizeUntil(16, int(allNodes), checkNode, condition)


I know a lot of this code was here before your change, but with this new ParallelizeUntil function, I think we can make this code much cleaner and easier to read.

Rather than having each worker write to an array and atomically increment and index variable, workers should send *v1.Node objects on a result channel. Then in this function, we can read from the result channel and build our filtered slice. Since only one goroutine can access filtered we can remove the synchronization around that.

I have a question and need your guidance. If pass the channel instead of synchronization as you said, is it also needed for errs and failedPredicateMap?

And the size of the channel buffer is also a problem that needs to be determined.

misterikkit · 2018-08-23T01:11:35Z

pkg/scheduler/core/generic_scheduler.go

+
+		// Stops searching for more nodes once the configured number of feasible nodes are found(
+		// once the remainingNodesNumber is less than or equal to 0).
+		workqueue.ParallelizeUntil(16, int(allNodes), checkNode, condition)


I have to say, it is a little weird that we are building a new util which passes integers to the worker funcs when our specific use case ignores those integers.

Since golang doesn't have generics, the more "canonical" way to solve that would be to have the worker funcs capture their work channel in a closure so that the utility doesn't have to manage that. What do you think?

Yes, I agree with you. We should pass in a channel, and then each worker function only needs data from the goroutine (here is one *v1.Node). But do we need to build this channel first? Because this may be a channel with a large size, this is my doubt.

k82cn · 2018-08-23T01:29:28Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+			defer wg.Done()
+			for piece := range toProcess {
+				// Return and abort if the condition is satisfied.
+				if done, err := condition(); done && err == nil {


what happen if err is NOT nil?

k82cn · 2018-08-23T01:42:36Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+			defer wg.Done()
+			for piece := range toProcess {
+				// Return and abort if the condition is satisfied.
+				if done, err := condition(); done && err == nil {


btw, as an util, do we accept nil conditionFunc ?

Yes,we should handle it. I'll follow misterikkit's comment and update it later.

bsalamat · 2018-08-23T17:58:58Z

pkg/scheduler/core/generic_scheduler.go

+		var (
+			predicateResultLock  sync.Mutex
+			filteredLen          int32
+			remainingNodesNumber int32


Do we need remainingNodeNumber? I think we can use filteredLen and stop searching for more nodes once filteredLen >= numNodesToFind.

bsalamat · 2018-08-23T18:07:32Z

pkg/scheduler/core/generic_scheduler.go

@@ -413,24 +419,23 @@ func (g *genericScheduler) findNodesThatFit(pod *v1.Pod, nodes []*v1.Node) ([]*v
 			}
 			if fits {


If we re-write this if statement as below, we can change line 377 to filtered = make([]*v1.Node, numNodesToFind). This saves us some memory.

if fits { len := atomic.AddInt32(&filteredLen, 1) if len > numNodesToFind { cancel() } else { filtered[len - 1] = g.cachedNodeInfoMap[nodeName].Node() } }

bsalamat · 2018-08-23T23:19:02Z

pkg/scheduler/core/generic_scheduler.go

@@ -373,16 +374,19 @@ func (g *genericScheduler) findNodesThatFit(pod *v1.Pod, nodes []*v1.Node) ([]*v

 		// Create filtered list with enough space to avoid growing it
 		// and allow assigning.
-		filtered = make([]*v1.Node, 2*numNodesToFind)
+		filtered = make([]*v1.Node, numNodesToFind+16)


We will not add to filtered if the index is larger than numNodesToFind, so we should be fine with allocating only numNodesToFind entries. If you still insist, please create a const, like numWorkers, and use it here and in ParallelizeUntil. In a few years, no body would know why this +16 is here.

I'm sorry,this is what I tested and I forgot to delete it.

bsalamat · 2018-08-24T00:51:34Z

/retest

wgliang · 2018-08-27T13:51:52Z

/test pull-kubernetes-kubemark-e2e-gce-big

wgliang · 2018-08-28T02:23:16Z

/test pull-kubernetes-bazel-build

Huang-Wei · 2018-08-28T18:32:22Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+func ParallelizeUntil(ctx context.Context, workers, pieces int, doWorkPiece DoWorkPieceFunc) {
+	// If the passed ctx is nil, then the default context.TODO() is used.
+	if ctx == nil {
+		ctx = context.TODO()


maybe change to return Parallelize(workers, pieces, doWorkPiece) so that we don't need to check if context is done in select{} for this "ctx == nil" case

Per other comments, I think the plan is for Parallelize to call this func with a nil context to eliminate duplicate code.

context.TODO() implies that we need further code changes to get an appropriate context. Since your goal is to create a context that never cancels, it is more appropriate to use context.Background().

However, I think it would be cleaner to write it as,

var stop chan struct{} if ctx != nil { stop = ctx.Done() }

This is not a good idea, ParallelizeUntil should not rely on Parallelize.Because ParallelizeUntil may replace Parallelize.

That's fair.

misterikkit

As a side note, it would be nice to rewrite the commit description and PR description to more clearly explain the change.

misterikkit · 2018-08-28T22:25:18Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

@@ -50,3 +51,37 @@ func Parallelize(workers, pieces int, doWorkPiece DoWorkPieceFunc) {
 	}
 	wg.Wait()
 }
+
+// ParallelizeUntil is a framework that allows for parallelizing N
+// independent pieces of work until the context done or canceled.


Suggest rewording "until the context done or canceled" to "until done or the context is canceled"

misterikkit · 2018-08-28T22:25:20Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

+func ParallelizeUntil(ctx context.Context, workers, pieces int, doWorkPiece DoWorkPieceFunc) {
+	// If the passed ctx is nil, then the default context.TODO() is used.
+	if ctx == nil {
+		ctx = context.TODO()


Per other comments, I think the plan is for Parallelize to call this func with a nil context to eliminate duplicate code.

context.TODO() implies that we need further code changes to get an appropriate context. Since your goal is to create a context that never cancels, it is more appropriate to use context.Background().

However, I think it would be cleaner to write it as,

var stop chan struct{} if ctx != nil { stop = ctx.Done() }

misterikkit · 2018-08-28T22:32:59Z

pkg/scheduler/core/generic_scheduler.go

-		}
+
+		// Stops searching for more nodes once the configured number of feasible nodes are found(
+		// once the remainingNodesNumber is less than or equal to 0 and cancel the ctx).


This comment needs updating, since remainingNodesNumber is no longer used.

Done, thanks.

wgliang · 2018-08-30T14:22:24Z

@sttts @bsalamat @misterikkit
Ready for further review. :)

bsalamat · 2018-09-01T21:02:02Z

/assign @sttts

for another round of review and approval

sttts · 2018-09-03T15:48:04Z

staging/src/k8s.io/client-go/util/workqueue/parallelizer.go

 	toProcess := make(chan int, pieces)
 	for i := 0; i < pieces; i++ {
 		toProcess <- i
 	}
 	close(toProcess)
-


nit: would leave these lines.

sttts · 2018-09-03T15:48:44Z

/lgtm
/approve

For client-go changes.

k8s-ci-robot · 2018-09-03T15:49:10Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bsalamat, sttts, wgliang

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [bsalamat]
~~staging/src/k8s.io/client-go/OWNERS~~ [sttts]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

bsalamat · 2018-09-03T17:29:31Z

/retest

k8s-ci-robot · 2018-09-04T06:09:00Z

New changes are detected. LGTM label has been removed.

wgliang · 2018-09-04T07:52:01Z

/retest

k8s-ci-robot · 2018-09-04T09:08:16Z

@wgliang: You must be a member of the kubernetes/kubernetes-milestone-maintainers github team to set the milestone.

In response to this:

/milestone v1.12

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

wgliang · 2018-09-04T09:09:59Z

/ping @bsalamat
for milestone v1.12

k8s-github-robot · 2018-09-04T18:41:33Z

Automatic merge from submit-queue (batch tested with PRs 67555, 68196). If you want to cherry-pick this change to another branch, please follow the instructions here: https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md.

k8s-ci-robot added release-note size/M cncf-cla: yes labels Aug 18, 2018

k8s-ci-robot requested review from lavalamp and wojtek-t August 18, 2018 01:08

k8s-ci-robot added the sig/scheduling label Aug 18, 2018

k8s-ci-robot assigned bsalamat Aug 22, 2018

misterikkit reviewed Aug 22, 2018

View reviewed changes

bsalamat reviewed Aug 23, 2018

View reviewed changes

wgliang force-pushed the opt/improve-performance branch from b2d0ce3 to 554b2d9 Compare August 23, 2018 00:42

k8s-ci-robot added the sig/api-machinery label Aug 23, 2018

misterikkit reviewed Aug 23, 2018

View reviewed changes

k82cn reviewed Aug 23, 2018

View reviewed changes

wgliang force-pushed the opt/improve-performance branch 6 times, most recently from 21aa2c1 to 6b26ab5 Compare August 23, 2018 04:15

bsalamat reviewed Aug 23, 2018

View reviewed changes

wgliang force-pushed the opt/improve-performance branch from 6b26ab5 to ae4bf88 Compare August 23, 2018 22:43

bsalamat reviewed Aug 23, 2018

View reviewed changes

wgliang force-pushed the opt/improve-performance branch from ae4bf88 to 65127ae Compare August 23, 2018 23:27

wgliang force-pushed the opt/improve-performance branch from 65127ae to 7a48c9f Compare August 28, 2018 00:21

wgliang force-pushed the opt/improve-performance branch from c54f8ce to c6b7f76 Compare August 28, 2018 12:09

Huang-Wei reviewed Aug 28, 2018

View reviewed changes

Huang-Wei mentioned this pull request Aug 28, 2018

REQUEST: New membership for Huang-Wei kubernetes/org#47

Closed

6 tasks

misterikkit suggested changes Aug 28, 2018

View reviewed changes

wgliang force-pushed the opt/improve-performance branch from c6b7f76 to 6d7ec85 Compare August 28, 2018 22:53

k8s-ci-robot assigned sttts Sep 1, 2018

wgliang force-pushed the opt/improve-performance branch from 6d7ec85 to 4753ed6 Compare September 3, 2018 14:38

sttts reviewed Sep 3, 2018

View reviewed changes

k8s-ci-robot added the lgtm label Sep 3, 2018

k8s-ci-robot added the approved label Sep 3, 2018

Not split nodes when searching for nodes but doing it all at once

6c63dcf

wgliang force-pushed the opt/improve-performance branch from 4753ed6 to 6c63dcf Compare September 4, 2018 06:08

k8s-ci-robot removed the lgtm label Sep 4, 2018

sttts added the lgtm label Sep 4, 2018

bsalamat added this to the v1.12 milestone Sep 4, 2018

bsalamat added status/approved-for-milestone priority/important-soon kind/feature labels Sep 4, 2018

k8s-github-robot merged commit a0b457d into kubernetes:master Sep 4, 2018

wgliang mentioned this pull request Sep 7, 2018

Replace Parallelize with function ParallelizeUntil and formally depre… #68403

Merged

		@@ -413,24 +419,23 @@ func (g genericScheduler) findNodesThatFit(pod v1.Pod, nodes []v1.Node) ([]v
		}
		if fits {

Not split nodes when searching for nodes but doing it all at once #67555

Not split nodes when searching for nodes but doing it all at once #67555

Conversation

wgliang commented Aug 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wgliang commented Aug 18, 2018

Uh oh!

wgliang commented Aug 22, 2018

Uh oh!

misterikkit left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

misterikkit left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bsalamat commented Aug 24, 2018

Uh oh!

wgliang commented Aug 27, 2018

Uh oh!

wgliang commented Aug 28, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

misterikkit left a comment

Choose a reason for hiding this comment

wgliang commented Aug 18, 2018 •

edited

Loading

wgliang commented Aug 30, 2018 •

edited

Loading