-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensure correct data-type values in DBS injection requests #10844
Comments
Here are full details of the flaw in DBS python server implementation. Let's start with example of using DBS client API:
As you can see the code above will use a string value of
Now, we can look-up results in DBS via curl call (scurl here refers to curl command with provided grid certs):
As you can see from above, even though we provided Now, let's confirm that it is an issue with DBS server and not DBS client. For that we'll verify that new value for processing era does not exists in DBS and inject it via curl call using JSON with string value of processing version.
With
The reported error says that we can't unmarshal (the marshal/unmarshal names used for encode/decode in Go) the provided JSON because ProcessingEras.processing_version attribute is type of int64. Using this example I conclude that there are two kind of issues with DBS python implementation:
Therefore, I expect that due to issue one none of the Python clients check data-types of JSON request and may pass incorrectly wrong data-type to DBS server. Since DBS server is casting under the hood values we never saw this problem. But if this is the case and when we move to |
@yuyiguo , @klannon @KatyEllis @amaltaro @todor-ivanov I provided concrete example (sorry it is lengthy technical one), but it shows the flaw in DBS python server implementation which we should address in all python based clients before we'll move on to use Go implementation of DBS server. The example is based on DBS unit tests, but it affects almost all int based attributes used in JSON requests for data injection into DBS. There are different ways to resolve this problem, e.g.
|
@belforte this issue affects CRAB as well, therefore please pay attention to it and ensure that CRAB client will take care of proper data-type for injection DBS APIs. |
@vkuznet instead of extracting what needs to be adapted from your PR. Could you please provide a full list of what those parameters are and what is the expected data type to be provided for them? I understand it's mostly for write APIs, so if you happen to know what are the exact API names concerned here, it would be quite helpful as well. |
The APIs in questions are all write APIs we use. These APIs uses JSON payload to send data to DBS, and therefore all data you send should have proper data-types. I already provided full documentation here along with exact structure of JSON payload such that you should not guess and only can have at posted JSON examples over there. For DBS GET APIs there is no issue since they rely on HTTP URI ( If you need to know exact data type used by DBS DB I suggest to use DBS DB schema directly. For instance, the dbs2go post apis documentation provides example for
which matches DB schema definition for PROCESSING_VERSION as |
and if you'll pay closer look to dbs2go documentation it also provides pointer to Go code structures used to parse input JSON, e.g. in case of processingeras it points to ProcessingEras where you can find that the ProcessingEra struct has all data-types, in this case:
So, it defines |
Okay, we will have to go through all the write APIs and ensure that our application is defining those data types according to what you documented for the DBS2Go server. By the way, I'm updating the initial description here providing a link to consult this full documentation. |
can't we assume that existing code sends correct data until/unless the new server complains ? |
yes you can, and when Alan provide me the pointer to FwkJobReport code I checked that majority of data are send correctly, e.g. https://github.com/dmwm/WMCore/blob/master/src/python/WMCore/FwkJobReport/Report.py#L811 and other lines indicate that this code correctly cast data-types before producing Fwjr document. But since I don't know all places in WMCore/CRAB code I can't say for sure. |
@vkuznet as we discussed today, this is mostly related to the write APIs - thus relying on DBSWriter - and given that we have not yet started the Go-based DBSWriter validation (using the default server URI), I am moving it back to the ToDo list. We still have to review our write APIs and the data type/format of the data we post to the DBS server through the Writer instance. Once we can start testing the Go-based writer instance, we can resume this activity. Please let me know if I missed anything. Thanks! |
@amaltaro everything is correct. And, I assume that DBSWriter tests are scheduled for Q2. I only want to point out that from server point of view everything is ready for tests and DBSWriter is available on testbed under |
@vkuznet is there anything else that should be considered in this ticket? There is one other piece of WMCore code that writes to DBS, the @belforte Stefano, just a FYI. With all the tests that Valentin did against Go-based DBSWriter, the only issue we had was with |
Alan, I think in terms of development everything is in place. But per our previous discussion it would be good to install new WMAgent with this code in place and re-do integration tests again. Once this will be done we can be ready for migration to new DBS Writer. |
Sounds good! How about we start discussing a transition of the DBSWriter in cmsweb-testbed then? Even though the patch you made provides database schema change, the DAOs do the right data type casting, so we can easily apply those patches to our "old" testbed WMAgents to move on with further tests. |
Alan, we're ready to make a switch. For that we will only need to perform change of redirect rules in cmsweb-testbed and point DBSWriter from python based code to Go implementation. I suggest that we discuss this in upcoming WMCore meeting. |
@vkuznet Valentin, while updating the Oracle preprod password, I happen to find an Oracle unit test that started failing with these changes. We still do not test PRs against Oracle backend, one of the reasons is because it has to be executed in a single node to avoid database concurrent access (create/destroy) and it takes ~1.5h to run. Anyhow, I managed to separate the daily builds we have on oracle (actually twice a day). Here is the last build without your changes to processing_version: so, there is a total of 56 new failures in Oracle-based unit tests. Here is one example: It's likely that those failures share a common root cause, so there is no need to investigate all of the 56 errors. |
Alan, turns out WMCore has plenty of processingVer defaults pointed to None. I scan WMCore code for them, based on jenkins test failures and I think I identified the final set, unless the default values of processingVer at a different lines. Please include this PR #11059 in jenkins tests and we will see how it is going. |
Impact of the new feature
I found through DBS unit tests that DBS client allows to pass wrong data-types to DBS server, mostly represent integer as string, e.g.
'123'
which then is casted to int data-type in DBS server. This is not allowed in Go implementation since it follows strict data-type. The issue here is Python itself which does not enforce strict data-typing. As such a special care should be done on a client side to avoid sending wrong data-type data via HTTP requests.Is your feature request related to a problem? Please describe.
Yes, in Go implementation it is impossible to send wrong data-type via HTTP request
Describe the solution you'd like
All WMCore tools should ensure to send proper data-types in DBS injection requests.
Full API documentation can be found at: https://github.com/vkuznet/dbs2go/blob/master/docs/apis.md#post-apis
Describe alternatives you've considered
None
Additional context
You may refer to DBS#657 which demonstrate the issue and provide a fix for DBS unit test client.
The text was updated successfully, but these errors were encountered: