Fix: Do not cache native resources created without CommonLabels #1818

Baarsgaard · 2025-01-11T17:59:35Z

I read a blog post on operator memory pitfalls mentioning Owns() being a footgun, which is used in the grafana_reconciler SetupWithManager.

TLDR: By declaring Owns() or using Get/List you tell the the controller-runtime to watch and cache all instances of the client.Object, which on large clusters could result in a lot of ConfigMaps, Secrets and Deployments in the Grafana-Operators case.

I expected this to be a problem due to the pprof profiles uploaded in #1622 which was verified by following the steps outlined below.

The post linked to an Operator SDK trick for configuring the client.Object cache with labels.

mgr, err := ctrl.NewManager(ctrl.GetConfigOrDie(), ctrl.Options{
  Cache: cache.Options{
    ByObject: map[client.Object]cache.ByObject{
      &corev1.Secret{}: cache.ByObject{
	Label: labels.SelectorFromSet(labels.Set{"app": "app-name"}),
      },
    },
  },
})

I remembered that #1661 added common labels to resources created by the operator to reduce memory consumption.

Verifying cache issues:

Start a local kind cluster with some default resources (oneliner)

make start-kind && \
kind export kubeconfig --name kind-grafana && \
make ko-build-kind && \
IMG=ko.local/grafana/grafana-operator make deploy && \
kubectl patch deploy -n grafana-operator-system grafana-operator-controller-manager-v5  --type='json' -p='[{"op": "replace", "path": "/spec/template/spec/containers/0/imagePullPolicy", "value":"IfNotPresent"}]'

Get a baseline heap reading

kubectl port-forward -n grafana-operator-system deploy/grafana-operator-controller-manager-v5 8888 &
go tool pprof -top -nodecount 20 http://localhost:8888/debug/pprof/heap

Create empty test file: fallocate -l 393216 large_file
Create a couple hundred ConfigMaps

for i in {0..200}; do kubectl create cm test-cm-$i --from-file=./large_file ; done

Get Updated heap

go tool pprof -top -nodecount 20 http://localhost:8888/debug/pprof/heap

# Output on master branch
ile: v5
Type: inuse_space
Time: Jan 11, 2025 at 8:34pm (CET)
Showing nodes accounting for 54.72MB, 100% of 54.72MB total
Showing top 20 nodes out of 107
      flat  flat%   sum%        cum   cum%
   46.91MB 85.72% 85.72%    46.91MB 85.72%  k8s.io/api/core/v1.(*ConfigMap).Unmarshal # <--- this one
       2MB  3.66% 89.38%        2MB  3.66%  runtime.malg
       1MB  1.83% 91.20%        1MB  1.83%  encoding/json.typeFields
    0.75MB  1.37% 92.58%     0.75MB  1.37%  go.uber.org/zap/zapcore.newCounters
    0.54MB  0.99% 93.56%     0.54MB  0.99%  github.com/gogo/protobuf/proto.RegisterType
    0.52MB  0.94% 94.51%     0.52MB  0.94%  k8s.io/apimachinery/pkg/watch.(*Broadcaster).Watch.func1
    0.50MB  0.92% 95.43%     0.50MB  0.92%  unicode.map.init.1
    0.50MB  0.92% 96.34%     0.50MB  0.92%  k8s.io/apimachinery/pkg/runtime.(*Scheme).AddKnownTypeWithName
    0.50MB  0.91% 97.26%     0.50MB  0.91%  github.com/go-openapi/swag.(*indexOfInitialisms).sorted.func1
    0.50MB  0.91% 98.17%     0.50MB  0.91%  go.mongodb.org/mongo-driver/bson/bsoncodec.(*kindDecoderCache).Clone
....

Current progress

Watching and caching has been limited to resources controlled by the operator of Kind:

Deployment
Ingress
Service
ServiceAccount
PersistentVolumeClaim
Route if IsOpenShift

This is done with the existing CommonLabels selector introduced in #1661:
app.kubernetes.io/managed-by: "grafana-operator"

Memory consumption in an empty kind cluster after ~1 minute¹:

Change	Heap (kb)	% reduction²	Note
None (master)	8579.22	0%
Limited resource listed above	5743.40	-33%	New default
Limited `ConfigMaps` and `Secrets`	3585.48	-58%	An experiment for now

TODO:

Allow users to tune the Informers Cache controlling ConfigMap/Secret using label selectors, similar to WATCH_NAMESPACE_SELECTOR
Potentially a way to tune them individually.
Somehow test this? (Input is welcome)

Heap will increase over time as the operator stabilizes. ↩
The reduction is by no means representative of real deployments.
For clusters mixing the Grafana-Operator and other workloads in cluster scoped mode, the reduction is likely significantly higher.
Even if the Grafana-Operator was the only Deployment in a cluster, this should still reduce memory as it won't cache itself 😉 ↩

Baarsgaard changed the title ~~feat(internal): Ignore deployments/Configmaps missing CommonLabels~~ WIP: Ignore deployments/Configmaps missing CommonLabels Jan 11, 2025

Baarsgaard force-pushed the reduce_cache_size branch 2 times, most recently from 389e8d6 to e4ed220 Compare January 11, 2025 23:56

Baarsgaard changed the title ~~WIP: Ignore deployments/Configmaps missing CommonLabels~~ Fix: Do not cache native resources created without CommonLabels Jan 12, 2025

fix: Limit cache for k8s native resources

4e03b7a

Baarsgaard force-pushed the reduce_cache_size branch from e4ed220 to 4e03b7a Compare January 12, 2025 11:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Do not cache native resources created without CommonLabels #1818

Fix: Do not cache native resources created without CommonLabels #1818

Baarsgaard commented Jan 11, 2025 •

edited

Loading

Fix: Do not cache native resources created without CommonLabels #1818

Are you sure you want to change the base?

Fix: Do not cache native resources created without CommonLabels #1818

Conversation

Baarsgaard commented Jan 11, 2025 • edited Loading

Verifying cache issues:

Current progress

TODO:

Footnotes

Baarsgaard commented Jan 11, 2025 •

edited

Loading