-
-
Notifications
You must be signed in to change notification settings - Fork 429
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OH 4.3.0/4.3.1 is never getting past startlevel 30 for some users #4547
Comments
This issue has been mentioned on openHAB Community. There might be relevant details there: |
This issue has been mentioned on openHAB Community. There might be relevant details there: https://community.openhab.org/t/rule-cron-blocking-openhab/161469/42 |
This issue has been mentioned on openHAB Community. There might be relevant details there: https://community.openhab.org/t/rule-engine-not-running-although-bundle-active/161517/10 |
I was able to reproduce the invalid cron expression issue by adding this rule in a file: rule "Test"
when
Time cron "0 0 5 31 2 ? ?"
// s m hh DD MM WW YYYY
then
// Nothing
end The rule will stay in state |
You might want to add this as well: https://community.openhab.org/t/rules-no-longer-executing-after-upgrade-to-oh-4/161174/4 |
I'm inclined to believe it's related to memory usage. Please let's collect information here to help with diagnosing. Anyone to have the problem that file based DSL rules do not execute, please: Check If you know of any specific log setting to help with further diagnosis or a log message that should appear but does not, let us know. |
Have started experiencing something quite similar after upgrading from 4.2 to 4.3. For me the issue occurs occasionally. I shut down OH via shell script every night to do some backup and maintenance, and then restart again after a few minutes. This has worked well for many years. Output of java --version:
Let me know in case you would want further info on the system. |
Out of curiosity, are you running bindings that use jupnp? I have an utterly enormous amount of memory usage because of my upnp environment. If I disable those bindings, memory usage goes way down. I ended up moving my OH off to a higher end Intel NUC11 to make sure I had plenty of memory to run everything. |
no jupnp, not that I'm aware |
FWIW I now have a 4.3.1 instance that says it's at runlevel 100 - but log output says the engine doesn't run @rkoshak pls change this issue's title as the runlevel IS 100 |
I'm not sure that there isn't two if not three different issues going on here. One issue has been identified and I think it's been fixed already. Giving Quartz a bogus cron expression leads to an infinite loop. However, others are being stuck at start level 30 and now you are reporting getting past that, and no cron expressions are involved. The only real commonality I can see is everyone with the problem is using file based Rules DSL and a lot of RAM (though is it an unusual amount of RAM? 🤷 I don't know if it still makes sense to have these all combined in one issue, or if it makes sense to split them up as they might not be related problems after all. |
Are we sure that in the original we couldn't get past runlevel 30 or did you only indirectly assume that because no rule was executing ? On RAM, I agree. I have 64bit installations that work (with more or less same config) and are of equal size (4+ G) which is huge but apparently not unusual and not all in itself a symptom or problem. |
Two of the three threads I linked to above confirmed that runlevel was stuck at 30 before I created the issue. Your thread actually popped up in the middle of my creating the issue and I didn't ask for confirmation there. Checking the runlevel was the one of the first things I asked about on the other threads though. Your report is the first that shows a problem with OH making it past runlevel 30 so that kind of indicates it might be a different issue with the same symptom. We know that one issue was the cron expression issue for sure. Those experiencing the problem without a bogus cron trigger remain explanation and mitigation. Over all, it would be very useful if those who have cron triggered rules and are able can test with the snapshots (I know that's a bigger lift now with OH 5 than usual) and confirm their problem is fixed. Then maybe we can tell if those who got stuck at start level 30 was because of the infinite loop and separate those from users experiencing the same symptom for different reasons. |
@rkoshak All bugfixes should be backported to 4.3.x branch, and will thus be included in our 4.3.2 patch release (expected later today). This should be easier to test for users who have not yet moved to 64-bit/Java 21. |
I was hoping it was going to be back ported and I didn't know 4.3.2 was coming out so soon. |
Expected Behavior
When OH starts the rule engine starts and begins executing rules.
Current Behavior
It's been reported at least three times now on the forum:
The first link has the most thorough investigation. I'm not positive that it's the same root problem for all three as they differ in details.
In all three cases the problem appeared when upgrading to 4.3.1, but backporting to 4.3.0 does not fix the problem. So it might be a regression from before 4.3.0 release.
File based Rules DSL rules seem to be involved in all three cases.
Possible Solution
Steps to Reproduce (for Bugs)
This is trickly because clearly it's not happening for everyone.
Context
Users who face this have no rules running.
Mitigation is to go back to 4.2.
Your Environment
The text was updated successfully, but these errors were encountered: