This repository was archived by the owner on Feb 25, 2025. It is now read-only.

Add a complexity scoring class for Metal and OpenGL #31417

Merged

gw280 merged 15 commits into flutter:main from gw280:gwright-complexity-score

Feb 18, 2022

+2,358 −72

Contributor

gw280 commented Feb 11, 2022

This adds an initial implementation of a complexity scoring class for Metal. Some notes:

There are a lot of magic numbers in this file. These come from me processing benchmark data, and I have written long descriptions in the comments to give the reader an idea of where these numbers came from.
drawTextBlob and drawVertices are incomplete right now, because we are unable to get the glyph count or the vertex count respectively. In their place, I've put in estimates where I can.
The scores assigned are based off a baseline of 0.0005ms being a score of 100. This is a very rough estimate. With a 32-bit unsigned integer, this will allow us to score up to approximately 21 seconds before we overflow.
Throughout the file I reference the constants m and c. This stems from y=mx+c, and details the line graph used to best fit the benchmark data for that particular usecase. Important: these constants are before dividing the benchmark data by the number of draw calls made.

flutter/flutter#86728

Tests to come.

Pre-launch Checklist

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I read and followed the Flutter Style Guide and the C++, Objective-C, Java style guides.
I listed at least one issue that this PR fixes in the description above.
I added new tests to check the change I am making or feature I am adding, or Hixie said the PR is test-exempt. See testing the engine for instructions on
writing and running engine tests.
I updated/added relevant documentation (doc comments with ///).
I signed the CLA.
All existing and new tests are passing.

If you need help, consider asking for advice on the #hackers-new channel on Discord.

gw280 requested a review from flar

February 11, 2022 23:10

flutter-dashboard bot commented Feb 11, 2022

It looks like this pull request may not have tests. Please make sure to add tests before merging. If you need an exemption to this rule, contact Hixie on the #hackers channel in Chat (don't just cc him here, he won't see it! He's on Discord!).

If you are not sure if you need tests, consider this rule of thumb: the purpose of a test is to make sure someone doesn't accidentally revert the fix. Ask yourself, is there anything in your PR that you feel it is important we not accidentally revert back to how it was before your fix?

Reviewers: Read the Tree Hygiene page and make sure this patch meets those guidelines before LGTMing.

flutter-dashboard bot added the needs tests label

zanderso reviewed

View reviewed changes

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc

+                  return;
+                }
+                // The performance penalties seem fairly consistent percentage-wise
+                float non_hairline_penalty = 1.0f;

Member

zanderso Feb 11, 2022

Right now this suggestion is a premature optimization, but just want to throw it out there in case it's useful later:

Consider avoiding floating point. You can do that by multiplying these factors by 10, and then dividing by 10 later when you need the final result.

Other strategies might be needed below, but we can delay thinking about them until they're needed.

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

flar reviewed

View reviewed changes

Contributor

flar left a comment

A bunch of questions asked, so I'll leave this as a "Comment" review rather than an approve or request changes review.

display_list/display_list_complexity_metal.h Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.h Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated

+                  // m = 1/2
+                  // c = 1
+                  save_layer_complexity = (save_layer_count_ + 2) * 100000;
+                }

Contributor

flar Feb 12, 2022

So if I have 200 saveLayers then I can save time by adding one more? If the slope changes at 200, then perhaps this can be handled in saveLayer by doing something like:

  if (++count > 200) {
    accumulate(M1 * x + B1);
  } else {
    accumulate(M2 * x + B2);
  }

Also, is this depth based or sequential based or both?

Contributor

flar Feb 12, 2022

Plugging in 200 for the count in both equations shows a huge difference in the values. That doesn't sound right. Is my comment about 201 saveLayers taking less time than 199 saveLayers true?

Contributor Author

gw280 Feb 12, 2022

Depth vs. sequential might make a difference but there's no clear trend. Looking at the benchmark data, it varies from -30% to +18%.

Bizarrely, the benchmarking data shows a decrease in overall time at around 128 saveLayer calls, then it starts to spike upwards at a much higher rate starting at around 256 calls. I see this dip in both the nested and the unnested benchmark runs. See the attached screenshot - for the blue line, the X axis values are the saveLayer count; for the orange line, multiply them by 8. That being said, the data is very hard to actually fit to a trend so this is really just a very rough approximation.

With all that being said, any saveLayer right now is going to hit the threshold for caching, and when we're talking about 200 saveLayer calls, whether we're talking about a cost of 20,200,000 (very roughly a time cost of 101 milliseconds) or 8,200,000 (roughly 41ms), they both far exceed the threshold for caching. I think we're overthinking this.

.

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc Outdated Show resolved Hide resolved

display_list/display_list_complexity_metal.cc

+                // one and a less expensive one. Both scale linearly with area.
+                //
+                // Expensive: All filled style, symmetric w/AA
+                bool expensive =

Contributor

flar Feb 12, 2022

symmetric RRects are more expensive than non-symmetric versions? Did you mention this to Skia?

Contributor Author

gw280 Feb 12, 2022

Not yet, but yes, symmetric does seem to be more expensive.

gw280 changed the title ~~Add a complexity scoring class for Metal~~ Add a complexity scoring class for Metal and OpenGL

Contributor Author

gw280 commented Feb 15, 2022

Big update:

I've added a base class called DisplayListComplexityHelper where a lot of the common code between the two complexity calculators lives
Added an OpenGL implementation now
Fixed some minor bugs that were present before

Tests to come.

gw280 requested review from zanderso and flar

February 15, 2022 03:09

gw280 force-pushed the gwright-complexity-score branch from 79d7cc6 to 49f792c Compare

February 15, 2022 03:11

zanderso reviewed

View reviewed changes

Member

zanderso left a comment

small nit: Comments should end with a '.'.

The license check is just asking for the new files to be listed in the golden file.

display_list/display_list_complexity_helper.h Show resolved Hide resolved

Member

zanderso commented Feb 15, 2022

/cc @iskakaushik

gw280 force-pushed the gwright-complexity-score branch from 82b2e54 to 51e92a6 Compare

February 17, 2022 23:26

zanderso reviewed

View reviewed changes

flow/raster_cache.cc Outdated Show resolved Hide resolved

George Wright added 14 commits

February 17, 2022 16:50


          Add a Metal complexity score calculator

b2e943f


          Metal Complexity Scorer

9a2dbc3


          Review updates, some bug fixes

a49d36c


          Redo drawLine, drawRect and drawOval to be calculated similarly to th…

dd92f1c

…e other ops.

Add some comments on the rationale behind the decisions made.
Remove some floating point arithmetic.


          Add Complex & Filled w/AA case to DrawDRRect for Metal

0e5d2e5


          Add a ComplexityCalculatorHelper base class that both the Metal and G…

066516b

…L calculators use


          Licences

1672df4


          Rename should_be_cached -> ShouldBeCached, compute -> Compute

86069e0


          Add unittests, some API name changes to confirm with coding style gui…

b082e94

…delines, fix minor bug in GL calculator's drawLine


          Address minor comments nit, fix deps issue in flow tests

9812f87


          licences

98920a0


          Add a worked example to the comments

7abf727


          Initialise complexity_score_ to 0

d11cebd


          Remove todo!

c4ba8ee

gw280 removed the needs tests label


          update setColorFilter

d822f46

gw280 force-pushed the gwright-complexity-score branch from d359980 to d822f46 Compare

February 18, 2022 01:08

gw280 requested a review from zanderso

February 18, 2022 01:56

zanderso approved these changes

View reviewed changes

gw280 merged commit d476e7a into flutter:main

engine-flutter-autoroll mentioned this pull request

Roll Engine from 605454e20143 to 0fd7713f2b5f (3 revisions) flutter/flutter#98727

Merged

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request


          d476e7a Add a complexity scoring class for Metal and OpenGL (flutter/…

1456e25

…engine#31417)

gw280 mentioned this pull request

Tweak saveLayer complexity scoring on Metal #31616

Merged

jonahwilliams mentioned this pull request

Performance regression with canvas.drawPoints in stable releases after 2.10.5 flutter/flutter#104985

Open

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet