-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Blocks failing to be inserted into DBS - submit4 #10887
Comments
@yuyiguo Yuyi, is there any easy way to find out why these blocks are being rejected by the server (or maybe even by the dbs3-client?)? I see something strange in that json dump:
vs
Could this be the reason why this block fails to be inserted (I did not check whether the other blocks have the same issue though). |
Which DBS server were these blocks inserted ? |
"Merged" is in multiple configs in prod. |
I did not see any problems from the data. I was able to upload your example block to int db |
I need more info from you in order to debug the problem:
|
Hi @yuyiguo The logs are in |
@yuyiguo thanks for the prompt feedback. I understand that the json construction is okay then, and that that Answering your questions:
is actually failing to be inserted into DBS since 15/Oct (!!!). The same operation fails over and over, every few minutes (sometimes every hour or two). Here is the last 2 timestamps from the log (local time, thus FNAL time):
|
@muhammadimranfarooqi thanks for the quick reply. I am looking for the log for dbs server on VMs, not the front end. |
@muhammadimranfarooqi I found them. |
You can find logs in |
@amaltaro
The block was inserted into DBS Friday, October 15, 2021 12:21:39 PM GMT-05:00 DST. |
Haaa, my guess was correct then. I do not think there were any changes on the DBS Server side over the last 6 months or more, right? The component code expects this error: but it is no longer returned. Which likely means that that error report is no longer properly handled either in WMCore, or in the dbs3-client itself. Anyhow, now I should have all the necessary information to properly debug and provide a fix for this. Thanks again, Yuyi! |
Issue identified and fix provided in: dmwm/DBS#660 And here is the command to patch dbs3-client in our agents:
followed by a restart of the DBS3Upload component:
|
All the production and some of the testbed agents have been patched. This issue can be closed as soon as the DBS pull request gets merged. |
Impact of the bug
WMAgent
Describe the bug
Before shutting down cmsgwms-submit4, which is running 1.5.2 and is fully drained, I noticed that there is a handful of blocks that are failing to be inserted into DBS:
error in the DBS3Upload is
400 Bad Request
.It is not clear whether this same issue is also happening in other agents or not. We should verify that.
How to reproduce it
Not sure
Expected behavior
It would be helpful to know further details of why the bulk insert call fails. Besides that, we should of course make sure that the block dump information is correct, and if so, it should be successfully inserted into DBS Server.
Additional context and error message
A dump of one of those blocks can be found at: https://amaltaro.web.cern.ch/amaltaro/forWMCore/Issue_10887/dbsuploader_block.json
The text was updated successfully, but these errors were encountered: