Only scale up zone after all leader zone replicas are ready #164

jhesketh · 2024-08-12T08:28:40Z

No description provided.

charleskorn

tl;dr: if I understand this correctly, I don't believe it will solve the problem we're hoping it will. I think we need to wait until all replicas in the leader StatefulSet are ready.

If I'm following the change here correctly, this will scale up the follower StatefulSet based on the number of ready replicas in the leader StatefulSet.

However, I don't think this will work for the problem we're trying to solve in the context of Mimir's store-gateways, for two reasons.

First: I think it can still lead to a read outage for a tenant, as it's still possible to get into a state where all store-gateways for a tenant aren't ready. This is easiest illustrated with an example:

Imagine we currently have one store-gateway per zone and three zones, and a tenant configured with a replication factor of three (one per zone). Let's say we've just initiated a scale up to 10 store-gateways per zone, and zone B is configured to follow zone A, and zone C follows zone B.

Let's say the tenant is sharded to new store-gateway a-7 in zone A, and all other store-gateways in zone A are ready.

Zone B will then scale up to 9 replicas. Let's say the tenant is sharded to new store-gateway b-3 in zone B.

After some time, all store-gateways except a-7 and b-3 are ready. Zone C will then scale up to 8 replicas. Let's say the tenant is sharded to new store-gateway c-5 in zone C.

At this point, all three store-gateways for the tenant aren't ready, so queries for this tenant will fail.

Second: this will cause a lot of churn in zones B and C. For example, as more and more replicas in zone A come ready, more replicas will be added to zone B. However, each time the number of replicas change, this causes reshuffling of tenants and blocks between store-gateways. So if we gradually scale up zone B as new store-gateways in zone A become ready, zone B will experience quite a bit of churn, and the same will happen in zone C as zone B's new store-gateways become ready.

I think the solution to this is to only scale up the follower zones once all desired replicas of the leader zone are up and ready - this ensures we don't end up in a state with all store-gateways for a tenant in a non-ready state, and minimises the amount of churn in follower zones.

jhesketh · 2024-08-12T10:09:19Z

Good catch, thank you. I'll make it so scale-up only occurs once all pods are ready.

pkg/config/config.go

pkg/controller/replicas.go

pkg/controller/replicas_test.go

charleskorn

LGTM

CHANGELOG.md

jhesketh added 2 commits August 12, 2024 18:28

Only scale up zone after leader zone replicas are ready

6ef8b83

Update CHANGELOG

8b07584

charleskorn reviewed Aug 12, 2024

View reviewed changes

jhesketh changed the title ~~Only scale up zone after leader zone replicas are ready~~ Only scale up zone after all leader zone replicas are ready Aug 12, 2024

Change to only scaling once all replicas are ready

8c95b10

charleskorn reviewed Aug 13, 2024

View reviewed changes

pkg/config/config.go Outdated Show resolved Hide resolved

pkg/controller/replicas.go Show resolved Hide resolved

pkg/controller/replicas_test.go Outdated Show resolved Hide resolved

jhesketh added 3 commits August 13, 2024 13:48

Rename config annotation

45dbb5d

Add log line

b09348f

remove redundant test

da73e39

charleskorn approved these changes Aug 13, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Update changelog

3b182f6

jhesketh enabled auto-merge (squash) August 13, 2024 04:09

jhesketh merged commit fa52527 into main Aug 13, 2024
6 checks passed

jhesketh deleted the jhesketh/ready-replicas branch August 13, 2024 04:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only scale up zone after all leader zone replicas are ready #164

Only scale up zone after all leader zone replicas are ready #164

jhesketh commented Aug 12, 2024

charleskorn left a comment

jhesketh commented Aug 12, 2024

charleskorn left a comment

Only scale up zone after all leader zone replicas are ready #164

Only scale up zone after all leader zone replicas are ready #164

Conversation

jhesketh commented Aug 12, 2024

charleskorn left a comment

Choose a reason for hiding this comment

jhesketh commented Aug 12, 2024

charleskorn left a comment

Choose a reason for hiding this comment