Refactor lake compaction scheduler to keep scheduling unrelated partitions on fe restart #54883

wxl24life · 2025-01-09T09:01:37Z

Enhancement

In the previous implementation, if some lake compaction transaction was not finished, and FE restarted, Lake compaction scheduler in FE would fail to schedule at all. If that compaction txn takes long time to finish (published), or even worse it was unable to publish at all, this will block the compaction scheduler forever, and will introduce a nightmare as all partition's compaction score continuously to increase.

In my new design, I will remove the global min active txn id checker logic, which is the main reason that blocks all compaction jobs from scheduling. Instead, I will rebuild the running compaction jobs on FE restart, and give the scheduler another chance to control the unfinished transactions.

wxl24life added the type/enhancement Make an enhancement to StarRocks label Jan 9, 2025

wxl24life linked a pull request Jan 9, 2025 that will close this issue

[Enhancement] Lake compaction scheduler optimize in fe restart scenarios #54881

Open

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor lake compaction scheduler to keep scheduling unrelated partitions on fe restart #54883

Refactor lake compaction scheduler to keep scheduling unrelated partitions on fe restart #54883

wxl24life commented Jan 9, 2025 •

edited

Loading

Refactor lake compaction scheduler to keep scheduling unrelated partitions on fe restart #54883

Refactor lake compaction scheduler to keep scheduling unrelated partitions on fe restart #54883

Comments

wxl24life commented Jan 9, 2025 • edited Loading

Enhancement

wxl24life commented Jan 9, 2025 •

edited

Loading