feat(storage): improve optimize and recluster #11850

zhyass · 2023-06-25T05:06:28Z

I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/

Summary

Add limit for recluster.

ALTER TABLE [IF EXISTS] <name> RECLUSTER [FINAL] [WHERE condition] [LIMIT <segment_count>]

The option LIMIT sets the maximum number of segments to be recluster. Databend will select the newest segments. The default segment_ount limit is max_threads * 4.

Add memory usage limit for recluster
Sort the blockmeta by cluster_statistics during compact
after optimize compact, do recluster
Optimize recluster, serialize block in parallel during recluster.

create table test_order (
    id bigint,
    id1 bigint,
    id2 bigint,
    id3 bigint,
    id4 bigint,
    id5 bigint,
    id6 bigint,
    id7 bigint,
    
    s1 varchar,
    s2 varchar,
    s3 varchar,
    s4 varchar,
    s5 varchar,
    s6 varchar,
    s7 varchar,
    s8 varchar,
    s9 varchar,
    s10 varchar,
    s11 varchar,
    s12 varchar,
    s13 varchar,
    
    d1 DECIMAL(20, 8),
    d2 DECIMAL(20, 8),
    d3 DECIMAL(20, 8),
    d4 DECIMAL(20, 8),
    d5 DECIMAL(20, 8),
    d6 DECIMAL(30, 8),
    d7 DECIMAL(30, 8),
    d8 DECIMAL(30, 8),
    d9 DECIMAL(30, 8),
    d10 DECIMAL(30, 8),
    
    insert_time datetime,
    insert_time1 datetime,
    insert_time2 datetime,
    insert_time3 datetime,
    
    i int
) CLUSTER BY(id,insert_time);


create table random_source (
    id bigint,
    id1 bigint,
    id2 bigint,
    id3 bigint,
    id4 bigint,
    id5 bigint,
    id6 bigint,
    id7 bigint,
    
    s1 varchar,
    s2 varchar,
    s3 varchar,
    s4 varchar,
    s5 varchar,
    s6 varchar,
    s7 varchar,
    s8 varchar,
    s9 varchar,
    s10 varchar,
    s11 varchar,
    s12 varchar,
    s13 varchar,
    
    d1 DECIMAL(20, 8),
    d2 DECIMAL(20, 8),
    d3 DECIMAL(20, 8),
    d4 DECIMAL(20, 8),
    d5 DECIMAL(20, 8),
    d6 DECIMAL(30, 8),
    d7 DECIMAL(30, 8),
    d8 DECIMAL(30, 8),
    d9 DECIMAL(30, 8),
    d10 DECIMAL(30, 8),
    
    insert_time datetime,
    insert_time1 datetime,
    insert_time2 datetime,
    insert_time3 datetime,
    
    i int
) Engine = Random;

insert into test_order select * from random_source limit 50000000;

Before optimize:

mysql> optimize table test_order compact;
Query OK, 32000000 rows affected (3 min 10.62 sec)

After:

mysql> optimize table test_order compact;
Query OK, 32000000 rows affected (55.94 sec)

The recluster Pipeline is as follows:
┌──────────┐     ┌───────────────┐     ┌─────────┐
│FuseSource├────►│CompoundBlockOp├────►│SortMerge├────┐
└──────────┘     └───────────────┘     └─────────┘    │
┌──────────┐     ┌───────────────┐     ┌─────────┐    │     ┌──────────────┐     ┌─────────┐
│FuseSource├────►│CompoundBlockOp├────►│SortMerge├────┤────►│MultiSortMerge├────►│Resize(N)├───┐
└──────────┘     └───────────────┘     └─────────┘    │     └──────────────┘     └─────────┘   │
┌──────────┐     ┌───────────────┐     ┌─────────┐    │                                        │
│FuseSource├────►│CompoundBlockOp├────►│SortMerge├────┘                                        │
└──────────┘     └───────────────┘     └─────────┘                                             │
┌──────────────────────────────────────────────────────────────────────────────────────────────┘
│         ┌──────────────┐
│    ┌───►│SerializeBlock├───┐
│    │    └──────────────┘   │
│    │    ┌──────────────┐   │    ┌─────────┐    ┌────────────────┐     ┌─────────────────┐     ┌──────────┐
└───►│───►│SerializeBlock├───┤───►│Resize(1)├───►│SerializeSegment├────►│TableMutationAggr├────►│CommitSink│
     │    └──────────────┘   │    └─────────┘    └────────────────┘     └─────────────────┘     └──────────┘
     │    ┌──────────────┐   │
     └───►│SerializeBlock├───┘
          └──────────────┘

Closes #11799

tests/sqllogictests/suites/base/09_fuse_engine/09_0008_fuse_optimize_table

ZhiHanZ · 2023-06-26T02:24:41Z

wonder about the mechanism, does the purge still needs to store candidate in memory and delete all candidates in a old segment?

zhyass · 2023-06-27T15:51:32Z

wonder about the mechanism, does the purge still needs to store candidate in memory and delete all candidates in a old segment?

will select and purge the oldest snapshots.

BohuTANG · 2023-06-28T03:00:31Z

Summary(By llmchain.rs)

Added support for "limit" parameter in OPTIMIZE TABLE and ALTER TABLE RECLUSTER statements
The code changes added support for the "limit" parameter in the ReclusterTable action, OptimizeTableStmt struct, and Display implementation of OptimizeTableStmt. The changes also added support for parsing the "limit" parameter in the OptimizeTableStmt and added new test cases for OPTIMIZE TABLE statement with different options and limits.
Changed "dry_run_limit" parameter to "dry_run" boolean parameter
The code changes modified the dry_run_limit parameter to a dry_run boolean parameter in the vacuum_handler.rs and handler.rs files. The changes also added a constant DRY_RUN_LIMIT and modified the function signature of do_vacuum to take a boolean dry_run instead of an optional dry_run_limit.
Modified function signatures to take a Vec instead of a slice &[DataBlock]
The code changes modified the function signatures of compact_final in transform_block_compact.rs, transform_block_compact_for_copy.rs, and transform_compact.rs to take a Vec<DataBlock> instead of a slice &[DataBlock]. The changes also modified the consume_event function to check if the output port can push before pushing the next data block.
Added support for TransformSerializeBlock and TransformSerializeSegment
The code changes added support for TransformSerializeBlock and TransformSerializeSegment in various files, including transform_append.rs, transform_serialize_data.rs, and replace.rs. The changes also removed the unused AppendTransform and added a new module mutation_meta.
Added limit parameter to various functions
The code changes added a limit parameter to the compact method in table.rs, the do_purge function in gc.rs, and the do_recluster function in recluster.rs. The changes also modified the implementation of the do_recluster function to use the limit parameter to limit the number of segment locations processed at a time.
Changed various function parameters and return types
The code changes changed the dry_run_limit parameter to a boolean dry_run parameter in various files, including vacuum_handler.rs, handler.rs, and gc.rs. The changes also changed the return type of apply_delete function in merge_into_mutator.rs and the function signature of compact_table function in hive_table.rs.

flaneur2020 · 2023-06-29T06:27:30Z

wonder about the mechanism, does the purge still needs to store candidate in memory and delete all candidates in a old segment?

IMO purge do not need scan the world, but only need just tailing the older snapshot & segment & block files.

If I understands correctly, the block files have some kind of time ordering, if a block file's creation time earlier than the earliest active snapshot & not included in this snapshot, then it can be safely purged.

zhyass · 2023-06-29T09:57:17Z

IMO purge do not need scan the world, but only need just tailing the older snapshot & segment & block files.

If I understands correctly, the block files have some kind of time ordering, if a block file's creation time earlier than the earliest active snapshot & not included in this snapshot, then it can be safely purged.

This can be used by purge orphan blocks in vacuum table.

However, we cannot guarantee the accuracy of the file's creation time and the creation time of block is generated earlier than the creation time of snapshot.

github-actions · 2023-07-12T01:57:48Z

Docker Image for PR

tag: pr-11850-f689982

note: this image tag is only available for internal use,
please check the internal doc for more details.

This comment was marked as outdated.

Sign in to view

zhyass marked this pull request as draft June 25, 2023 05:06

mergify bot added the pr-feature this PR introduces a new feature to the codebase label Jun 25, 2023

vercel bot deployed to Preview June 25, 2023 13:54 View deployment

zhyass requested review from lichuang and dantengsky June 25, 2023 14:30

zhyass marked this pull request as ready for review June 25, 2023 14:32

zhyass requested a review from flaneur2020 June 25, 2023 14:33

This comment was marked as resolved.

Sign in to view

BohuTANG reviewed Jun 25, 2023

View reviewed changes

tests/sqllogictests/suites/base/09_fuse_engine/09_0008_fuse_optimize_table Show resolved Hide resolved

BohuTANG reviewed Jun 25, 2023

View reviewed changes

tests/sqllogictests/suites/base/09_fuse_engine/09_0008_fuse_optimize_table Outdated Show resolved Hide resolved

zhyass force-pushed the improve_clustering branch from 4e26c8c to 9430699 Compare June 27, 2023 10:06

zhyass added the ci-benchmark Benchmark: run all test label Jun 27, 2023

databendlabs deleted a comment from github-actions bot Jun 27, 2023

vercel bot deployed to Preview June 27, 2023 15:48 View deployment

zhyass added ci-benchmark Benchmark: run all test and removed ci-benchmark Benchmark: run all test labels Jun 27, 2023

databendlabs deleted a comment from github-actions bot Jun 28, 2023

zhyass removed the ci-benchmark Benchmark: run all test label Jun 28, 2023

zhyass requested a review from sundy-li June 28, 2023 06:27

zhyass changed the title ~~feat(storage): add limit for optimize~~ feat(storage): improve optimize and reclusters Jun 28, 2023

zhyass changed the title ~~feat(storage): improve optimize and reclusters~~ feat(storage): improve optimize and recluster Jun 28, 2023

dantengsky added the ci-cloud Build docker image for cloud test label Jun 28, 2023

This comment was marked as outdated.

Sign in to view

vercel bot deployed to Preview June 29, 2023 11:41 View deployment

This comment was marked as outdated.

Sign in to view

This comment was marked as off-topic.

Sign in to view

vercel bot deployed to Preview July 1, 2023 11:42 View deployment

vercel bot deployed to Preview July 3, 2023 07:32 View deployment

vercel bot deployed to Preview July 3, 2023 14:41 View deployment

zhyass removed request for flaneur2020 and lichuang July 3, 2023 15:37

zhyass force-pushed the improve_clustering branch from b461133 to ab80bc7 Compare July 4, 2023 07:56

This was referenced Jul 4, 2023

chore: modify compact_final and add block count in clustering_history #11969

Merged

refactor: append transform and mutation log #11971

Merged

vercel bot deployed to Preview July 5, 2023 09:49 View deployment

zhyass force-pushed the improve_clustering branch from d9f461c to 2cc0705 Compare July 5, 2023 12:57

improve optimize and recluster

94f4377

zhyass force-pushed the improve_clustering branch from 2cc0705 to 94f4377 Compare July 5, 2023 13:29

This comment was marked as off-topic.

Sign in to view

BohuTANG added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels Jul 12, 2023

vercel bot deployed to Preview July 12, 2023 01:42 View deployment

dantengsky mentioned this pull request Jul 14, 2023

[DO NOT MERGE]: testing replace + re-cluster #12059

Closed

Merge remote-tracking branch 'upstream/main' into improve_clustering

cbdbee5

zhyass force-pushed the improve_clustering branch from a74b739 to cbdbee5 Compare July 14, 2023 08:56

vercel bot deployed to Preview July 14, 2023 08:58 View deployment

dantengsky approved these changes Jul 17, 2023

View reviewed changes

BohuTANG merged commit cec8404 into databendlabs:main Jul 17, 2023

dantengsky mentioned this pull request Jul 17, 2023

feat: auto optimize table during execution of replace into statement #12100

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(storage): improve optimize and recluster #11850

feat(storage): improve optimize and recluster #11850

Uh oh!

zhyass commented Jun 25, 2023 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as resolved.

Uh oh!

Uh oh!

ZhiHanZ commented Jun 26, 2023

Uh oh!

zhyass commented Jun 27, 2023

Uh oh!

BohuTANG commented Jun 28, 2023

Uh oh!

This comment was marked as outdated.

flaneur2020 commented Jun 29, 2023

Uh oh!

zhyass commented Jun 29, 2023 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

github-actions bot commented Jul 12, 2023

Uh oh!

Uh oh!

feat(storage): improve optimize and recluster #11850

feat(storage): improve optimize and recluster #11850

Uh oh!

Conversation

zhyass commented Jun 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

This comment was marked as outdated.

This comment was marked as resolved.

Uh oh!

Uh oh!

ZhiHanZ commented Jun 26, 2023

Uh oh!

zhyass commented Jun 27, 2023

Uh oh!

BohuTANG commented Jun 28, 2023

Summary(By llmchain.rs)

Uh oh!

This comment was marked as outdated.

flaneur2020 commented Jun 29, 2023

Uh oh!

zhyass commented Jun 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

github-actions bot commented Jul 12, 2023

Docker Image for PR

Uh oh!

Uh oh!

zhyass commented Jun 25, 2023 •

edited

Loading

zhyass commented Jun 29, 2023 •

edited

Loading