(improvement)[bucket] Add auto bucket implement #15250

JackDrogon · 2022-12-21T11:25:06Z

Proposed changes

Problem summary

用户经常设置不合适的bucket，导致各种问题，这里提供一种方式，来自动设置分桶数。暂时而言只对olap表生效

实现思路

根据数据量，计算分桶数。
对于分区表，可以根据历史分区的数据量、机器数、盘数，确定一个分桶。
主要问题是初始桶数不好确定。
这里提供两种方式：

根据机器数、盘数，确定一个分桶数
用户可以提供一个数据量的经验值，根据这个值，确定分桶数。

详细设计

建表

create table tbl1
(...)
[PARTITION BY RANGE(...)]
DISTRIBUTED BY HASH(k1) BUCKETS AUTO
properties(
    "estimate_partition_size" = "100G"
)

BUCKETS AUTO 表示自动设定buckets
estimate_partition_size：可选参数，提供一个单分区初始数据量。

分桶计算逻辑
初始分桶计算

没有给 estimate_partition_size
这种基本上不太靠谱。使用了default_bucket_num(10)。
给了 estimate_partition_size

这里我们先假设给的是单副本文本格式的数据量

先根据数据量得出一个桶数：N
首先数据量除以5（按5比1的压缩比算）
< 100MB : 1
< 1G: 2

1G: 每1G一个分桶。
根据桶数和盘数的乘机得出一个桶数 M
每个BE节点算1
磁盘容量，每50G算1
min(M, N, 128)，如果这个值小于N，也小于机器数。取机器数。

举例：

1. 先根据数据量得出一个桶数：N
    首先数据量除以5（按5比1的压缩比算）
    < 100MB : 1
    < 1G: 2
    > 1G:  每1G一个分桶。

2. 根据BE数和盘数的乘机得出一个桶数 M
    每个BE节点算1
    磁盘容量，每50G算1
    
3. min(M, N, 128)，如果这个值小于N，也小于机器数。
取机器数。这里就是

举例：
1. 100MB，10台机器，2T * 3盘 = 1
数据量: 1
BE磁盘: 10 * 40 * 3 = 1200
min计算: 1
最终: 1

2. 1G, 3台机器，500GB * 2盘 = 2
数据量: 2
BE磁盘: 3 * 10 * 2 = 60
min计算: 2
最终: 2

3. 100G，3台机器，500GB * 2盘 = 20
数据量: 20
BE磁盘: 3 * 10 * 2 = 60
min计算: 20
最终: 20

4. 500G，3台机器，1T * 1盘 = 60
数据量: 100
BE磁盘: 3 * 21 * 1 = 63
min计算: 63
最终: 63

5. 500G，10台机器，2T * 3盘 = 128
数据量: 500
BE磁盘: 10 * 41 * 3 = 1230
min计算: 100
最终: 100

6. 1T，10台机器，2T * 3盘 = 128 
数据量: 200
BE磁盘: 10 * 41 * 3 = 1230
min计算: 128
最终: 128

7. 500G，1台机器，100TB * 1盘 = 128
数据量: 100
BE磁盘: 1 * 2048 * 1 = 2048
min计算: 100
最终: 100

8. 1TB, 200台机器，4T * 7盘 = 200
数据量: 205
BE磁盘: 200 * 80 * 7 = 112000
min计算: 128
最终: 200

计算未来分桶
仅针对分区表。
根据最多前7个分区的数据量的指数平均值，作为estimate_partition_size，进行评估。
需要判断历史分区的趋势：
比如前五个分区，每个都比前一个大，说明数据再增长，则此时不能求平均值，而应该取趋势值。
仅考虑递增和递减的情况。其他情况，求平均。

Checklist(Required)

Does it affect the original behavior:
- Yes
- No
- I don't know
Has unit tests been added:
- Yes
- No
- No Need
Has document been added or modified:
- Yes
- No
- No Need
Does it need to update dependencies:
- Yes
- No
Are there any changes that cannot be rolled back:
- Yes (If Yes, please explain WHY)
- No

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

hello-stephen · 2022-12-21T15:06:56Z

TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 34.96 seconds
load time: 498 seconds
storage size: 17120651505 Bytes
https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/tmp/20230113131233_clickbench_pr_79952.html

JackDrogon · 2022-12-22T03:48:01Z

Need to add show create table with autobucket && estimate_partition_size settings

morningman · 2022-12-22T14:11:52Z

Better using BUCKETS AUTO instead of BUCKETS 0.
You can return 0 internally.

morningman

Please add some unit tests and regression tests for this

fe/fe-core/src/main/java/org/apache/doris/common/util/PropertyAnalyzer.java

morningman · 2022-12-22T14:16:52Z

fe/fe-core/src/main/java/org/apache/doris/common/util/AutoBucketUtils.java

+
+        int buckets = 0;
+        for (Backend backend : backends.values()) {
+            if (!backend.isLoadAvailable()) {


Why judge isLoadAvailable?

If backend is not loadAvailable，it would not be treated as a machine that could take on data.

fe/fe-core/src/main/java/org/apache/doris/common/util/AutoBucketUtils.java

fe/fe-core/src/main/java/org/apache/doris/clone/DynamicPartitionScheduler.java

… property auto_bucket to _auto_bucket (apache#15250)

fe/fe-core/src/main/java/org/apache/doris/analysis/CreateTableStmt.java

fe/fe-core/src/main/java/org/apache/doris/analysis/DistributionDesc.java

fe/fe-core/src/main/java/org/apache/doris/catalog/DistributionInfo.java

fe/fe-core/src/main/java/org/apache/doris/catalog/OlapTable.java

table.getPartitions() in DynamicPartitionScheduler::getBucketsNum (apache#15250)

…apache#15250)

…ions sort by PartitionItem (apache#15250)

… property auto_bucket to _auto_bucket (apache#15250)

table.getPartitions() in DynamicPartitionScheduler::getBucketsNum (apache#15250)

…apache#15250)

…ions sort by PartitionItem (apache#15250)

…heck (apache#15250)

…ons sort by PartitionItem (apache#15250)

…eck (apache#15250)

…ition (apache#15250)

morningman

LGTM

morningman · 2023-01-05T14:52:15Z

fe/fe-core/src/main/java/org/apache/doris/common/util/AutoBucketUtils.java

+        int buckets = 0;
+        for (Backend backend : backends.values()) {
+            if (!backend.isLoadAvailable()) {
+                break;


Suggested change

break;

continue;

github-actions · 2023-01-16T06:22:46Z

PR approved by at least one committer and no changes requested.

github-actions · 2023-01-16T06:22:48Z

PR approved by anyone and no changes requested.

enterwhat · 2023-02-17T06:16:06Z

how can it support Colocation Join ？

perid007 · 2023-05-22T11:28:17Z

how can it support Colocation Join ？

+1 same question

github-actions bot added the area/planner label Dec 21, 2022

JackDrogon force-pushed the feature/autobucket branch 2 times, most recently from 5ad9d10 to 62117bf Compare December 21, 2022 12:05

JackDrogon force-pushed the feature/autobucket branch 2 times, most recently from 5ae2d53 to e136c9d Compare December 21, 2022 20:16

morningman self-requested a review December 22, 2022 01:13

morningman added the dev/1.2.1 label Dec 22, 2022

morningman reviewed Dec 22, 2022

View reviewed changes

dataroaring changed the title ~~[feature] Add auto bucket implement~~ (improvement)[bucket] Add auto bucket implement Dec 24, 2022

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Dec 25, 2022

[feature] Update auto bucket syntax to BUCKETS AUTO (apache#15250)

041d5ec

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Dec 25, 2022

[feature] Add show create with BUCKETS AUTO implement (apache#15250)

bfe5be9

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Dec 25, 2022

[fix] Fix DistributionInfo reload with autoBucket setting && Refactor…

3216b97

… property auto_bucket to _auto_bucket (apache#15250)

morningman reviewed Dec 26, 2022

View reviewed changes

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Dec 27, 2022

[feature] Refactor by FeConstants.default_bucket_num && Fix used old

8bf5893

table.getPartitions() in DynamicPartitionScheduler::getBucketsNum (apache#15250)

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 5, 2023

[feature] Add AutoBucketUtils ut (apache#15250)

85085f7

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 5, 2023

[feature] Add regression test (apache#15250)

1b689ef

github-actions bot added the kind/test label Jan 5, 2023

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 5, 2023

[feature] Fix AutoBucketUtilsTest by use mocked Env/SystemInfoService (…

a813bec

…apache#15250)

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 9, 2023

[feature] Remove DistributionDesc write (apache#15250)

ab6438f

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 9, 2023

[feature] Fix DynamicPartitionScheduler.getBucketsNum history partit…

025ca63

…ions sort by PartitionItem (apache#15250)

morningman added dev/1.2.2 and removed dev/1.2.1 labels Jan 10, 2023

JackDrogon added 5 commits January 13, 2023 16:41

[feature] Add auto bucket implement

6d06e23

[feature] Update auto bucket syntax to BUCKETS AUTO (apache#15250)

56bde6e

[feature] Add show create with BUCKETS AUTO implement (apache#15250)

b1c8fc6

[fix] Fix DistributionInfo reload with autoBucket setting && Refactor…

12ae545

… property auto_bucket to _auto_bucket (apache#15250)

[feature] Refactor by FeConstants.default_bucket_num && Fix used old

c193595

table.getPartitions() in DynamicPartitionScheduler::getBucketsNum (apache#15250)

JackDrogon added 4 commits January 13, 2023 16:44

[feature] Add AutoBucketUtils ut (apache#15250)

15946fa

[feature] Add regression test (apache#15250)

496e969

[feature] Fix AutoBucketUtilsTest by use mocked Env/SystemInfoService (…

e09dd5d

…apache#15250)

[feature] Remove DistributionDesc write (apache#15250)

20ed5ea

JackDrogon force-pushed the feature/autobucket branch from 025ca63 to 1f3ae47 Compare January 13, 2023 08:53

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 13, 2023

[feature] Fix DynamicPartitionScheduler.getBucketsNum history partit…

1f3ae47

…ions sort by PartitionItem (apache#15250)

JackDrogon added a commit to JackDrogon/doris that referenced this pull request Jan 13, 2023

[feature] Fix AutoBucketUtilsTest show create table BUCKETS AUTO c…

e425a96

…heck (apache#15250)

JackDrogon added 3 commits January 13, 2023 18:51

[feature] Fix DynamicPartitionScheduler.getBucketsNum history partiti…

764ebe3

…ons sort by PartitionItem (apache#15250)

[feature] Fix AutoBucketUtilsTest show create table BUCKETS AUTO ch…

0f9c89a

…eck (apache#15250)

[feature] Fix DynamicPartitionScheduler.getBucketsNum skip empty part…

3b3e76d

…ition (apache#15250)

JackDrogon force-pushed the feature/autobucket branch from e425a96 to 3b3e76d Compare January 13, 2023 10:52

morningman approved these changes Jan 16, 2023

View reviewed changes

github-actions bot added the approved label Jan 16, 2023

github-actions bot added the reviewed label Jan 16, 2023

morningman merged commit 3407536 into apache:master Jan 18, 2023

morningman added dev/1.2.2-merged and removed dev/1.2.2 labels Jan 23, 2023

morningman pushed a commit that referenced this pull request Jan 23, 2023

(improvement)[bucket] Add auto bucket implement (#15250)

d078112

dutyu pushed a commit to dutyu/doris that referenced this pull request Feb 1, 2023

(improvement)[bucket] Add auto bucket implement (apache#15250)

962f645

(improvement)[bucket] Add auto bucket implement #15250

(improvement)[bucket] Add auto bucket implement #15250

Conversation

JackDrogon commented Dec 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Problem summary

实现思路

详细设计

Checklist(Required)

Further comments

Uh oh!

hello-stephen commented Dec 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JackDrogon commented Dec 22, 2022

Uh oh!

morningman commented Dec 22, 2022

Uh oh!

morningman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

morningman Dec 22, 2022

Choose a reason for hiding this comment

Uh oh!

JackDrogon Dec 23, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

morningman left a comment

Choose a reason for hiding this comment

Uh oh!

morningman Jan 5, 2023

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 16, 2023

Uh oh!

github-actions bot commented Jan 16, 2023

Uh oh!

enterwhat commented Feb 17, 2023

Uh oh!

perid007 commented May 22, 2023

Uh oh!

JackDrogon commented Dec 21, 2022 •

edited

Loading

hello-stephen commented Dec 21, 2022 •

edited

Loading