-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[fix] [broker] topics failed to delete after remove cluster from replicated clusters set and caused OOM #23360
[fix] [broker] topics failed to delete after remove cluster from replicated clusters set and caused OOM #23360
Conversation
...ker/src/main/java/org/apache/pulsar/broker/service/SystemTopicBasedTopicPoliciesService.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please check the review comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, good work @poorbarcode
Triggering CI with fix #23431 |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #23360 +/- ##
============================================
+ Coverage 73.57% 74.36% +0.78%
- Complexity 32624 34983 +2359
============================================
Files 1877 1953 +76
Lines 139502 147162 +7660
Branches 15299 16199 +900
============================================
+ Hits 102638 109430 +6792
- Misses 28908 29305 +397
- Partials 7956 8427 +471
Flags with carried forward coverage won't be shown. Click here to find out more.
|
I wonder if this is causing #23474 which currently blocks Pulsar CI. There's PR #23478 to disable OneWayReplicatorTestBase tests the problem is addressed. @poorbarcode do you have a chance to fix the problem? |
Self-assigned #23474, I will fix it next week |
@poorbarcode Please also handle cherry-picking to branch-3.0 and branch-3.3 |
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
It appears that this PR fails the test in branch-3.3. https://github.com/heesung-sn/pulsar/actions/runs/11535492430/job/32111441443
|
Sure, it will be fixed by the PR: #23522 |
@poorbarcode Do you have a chance to take a look at #23543 ? It's a deadlock in SystemTopicBasedTopicPoliciesService which happens also after #23522 has been merged. The deadlock might be completely unrelated to the recent changes, but since you know SystemTopicBasedTopicPoliciesService well, I'd appreciate it if you can take a look at the thread dump and the deadlocks. |
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
…icated clusters set and caused OOM (apache#23360)
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af)
…rom replicated clusters set and caused OOM (apache#23360)" This reverts commit cab51f4.
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
…icated clusters set and caused OOM (apache#23360) (cherry picked from commit d9bc7af) (cherry picked from commit 187e6d4)
Motivation
Background
namespace-level replicatedClusters
__change_events
being deleted first.delete topic-level policies
can not be executed successfully anymore.topic.checkReplication
and other operations will retry again and again, as a result, the broker will crash of OOMModifications
__change_event
has been deleted.Documentation
doc
doc-required
doc-not-needed
doc-complete
Matching PR in forked repository
PR in forked repository: x