Refactor code for the move to Kubernetes #5

nikodemas · 2024-10-25T14:16:34Z

This PR together with dmwm/CMSKubernetes#1559 prepares the udp-collector for the move to the Kubernetes infrastructure.

FYI @vkuznet @leggerf @brij01

vkuznet · 2024-10-25T16:26:10Z

@nikodemas , I doubt that the changes you propose are valid. Let me break down for you the k8s requirements:

the /health end-point should provide status of underlying service. In case of your changes you have UDP server serving UPD requests and separate goroutine (thread) of HTTP server using health end-point. If UDP server experience an issue the HTTP server knows nothing about it. Therefore proposed implementation does not capture the state of health of UDP server
removing shell script has nothing to do with k8s, I suggest to keep them around for references. You may not need to pack them into a final image.
you remove code with prometheus metrics which are useful to understand performance of the underlying service. In k8s you can rely on those to know what are the limits required to ru the service.

I suggest to re-factor the code in the following way:

you may keep your HTTP server goroutine but it should do ping request to UDP server and receive back a reply. Doing this way it will probe the status of UDP server
you may keep code under one main which will run both UDP and HTTP server but you must expose proper ports for them in k8s, one to serve UDP incoming requests, and another to expose HTTP to make health probes
I suggest to put back prometheus metrics and use them within UDP server. What we should measure is performance of UDP server and not HTTP one. You may create a new UDP request for that, similar how original code is doing ping/pong exchange. A new UDP request will return UDP server prometheus metrics. And you HTTP server will need another end-point /metrics which will place UDP request to UDP server to get its metrics.

nikodemas · 2024-11-19T15:47:59Z

@vkuznet thanks for your comments. I have adjusted everything accordingly:

the monitoring service now constantly sends ping messages to the udp server and /health endpoint checks the time since the last pong is received
Prometheus metrics are added back to the udp server monitor script
udp_server and udp_server_monitor are now started from one main since I couldn't start them with a shell script from a distroless image. however, having a shell script doesn't make sense anymore because now everything can be started with a single go executable, so I would like to keep the shell script deleted
I also updated the version in the GitHub actions as it was failing because of some package mismatch
after this is merged I should update the go.mod and the main.go to reference the github.com/dmwm/udp-collector/ again instead of the local folder

README.md

Makefile

go.mod

nikodemas added 3 commits October 23, 2024 12:34

Update health checks

5f38000

delete udp_server_monitor from makefile

0a6dd32

Cleanup repository and update README

fc05bcb

nikodemas requested a review from vkuznet October 25, 2024 15:48

nikodemas added 6 commits November 11, 2024 15:08

add back udp_server_monitor and adjust the code

df43447

revert the updates on Makefile

76541c8

update command to get pid

ae6a9b5

another pid retrieval update

60ee294

Add prometheus metrics export

90b94cd

update prom metrics names and README

fba6c5c

nikodemas force-pushed the master branch from ff1f275 to fba6c5c Compare November 14, 2024 14:12

nikodemas added 3 commits November 18, 2024 16:47

Refactor code to stop using script for starting

83c08cd

Update monitoring script

0cfaa79

update README, GH actions and execution command

7f58261

nikodemas mentioned this pull request Nov 19, 2024

Prepare udp-collector for Kubernetes dmwm/CMSKubernetes#1559

Merged

vkuznet requested changes Nov 19, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

Makefile Outdated Show resolved Hide resolved

Makefile Outdated Show resolved Hide resolved

go.mod Outdated Show resolved Hide resolved

go.mod Outdated Show resolved Hide resolved

Update build details

2c94c26

nikodemas force-pushed the master branch from 7a2f04f to 2c94c26 Compare November 25, 2024 16:18

nikodemas requested a review from vkuznet November 25, 2024 16:20

vkuznet approved these changes Nov 25, 2024

View reviewed changes

vkuznet merged commit 551b960 into dmwm:master Nov 25, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor code for the move to Kubernetes #5

Refactor code for the move to Kubernetes #5

nikodemas commented Oct 25, 2024 •

edited

Loading

vkuznet commented Oct 25, 2024

nikodemas commented Nov 19, 2024

Refactor code for the move to Kubernetes #5

Refactor code for the move to Kubernetes #5

Conversation

nikodemas commented Oct 25, 2024 • edited Loading

vkuznet commented Oct 25, 2024

nikodemas commented Nov 19, 2024

nikodemas commented Oct 25, 2024 •

edited

Loading