-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fence_scsi_check is leaving zombie processes #535
Comments
Is this on physical servers, VMs or cloud? Can you provide some more info e.g. distro, distro version, watchdog type (might be logged in /var/log/watchdog and/or /var/log/messages)? |
It is currently on VM (VMware). Disto: Rocky 8.6 /etc/watchdog.conf ;
|
For unresponsive block-devices (e.g. you can reproduce that by suspending via dm) you usually get a zombie when trying to address. And I think there is no real way out of it. |
Thanks a lot for the explanation. Actually the fence and rebooting functionality is working as expected as far as I could test. I was checking the documentation of watchdog and I have found this part;
So should I suppose that this is a problem only with the watchdog process. |
I wasn't aware of this mechanism in watchdog-daemon - that btw seem totally unrelated to the fence-scsi-script being called. |
I have a cluster with enabled watchdog. I have realized that fence_scsi_check is creating zombie processes continuously on both node.
I have tried to increase the "test-timeout" in /etc/watchdog.conf but it didn't solve the problem.
Honestly, it can be a problem by watchdog, I am not sure.
Here it is the output of watchdog;
The text was updated successfully, but these errors were encountered: