-
Notifications
You must be signed in to change notification settings - Fork 273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For exists(), more precise error handling. #757
base: main
Are you sure you want to change the base?
Conversation
@martindurant this implements solution 2) mentioned in #750. I am happy to change it to another solution, as discussed. |
I think 2) is the right choice. |
Can we make a test which surfaces the ConnectionError? Perhaps just set the endpoint_url to something silly. |
I will try to think of something, and also attempt to add tests for the other changes. |
I created test coverage for two of the three new cases:
However, I cannot figure out how to create good test coverage for the third path, the
If this is good enough, it is already in the PR. But I am happy to improve this one. |
Yes, that test is fine - it is, of course, failing on permissions with real AWS in this case (moto does not enforce credentials, which is why you couldn't test with that) |
|
||
def test_exists_bucket_nonexistent(s3, caplog): | ||
# Ensure that NO warning is raised and False is returned if checking bucket | ||
# existance when we have full access. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well, the bucket might still exist, but it's not one we can see. We happen to know for the sake of the test that it doesn't exist.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure whether this is true for all S3 systems, but at least for our internal systems, I get a "PermissionError" when trying to access a bucket I cannot see (because I don't have permission to see it) but a FileNotFoundError
like in this test when I would have permission to see, but it simply doesn't exist. This is why in the PR change, the warning would only be raised in case of PermissionError
but not in case of FileNotFoundError
. Should I change it to return the warning in both cases?
I finally managed to set up pre-commit, I ran in SSL errors before. So hopefully the PR should not fail because of that any more. I also added changes as requested. My tests pass locally but I don't have the setup to test all Python versions currently. |
Co-authored-by: Martin Durant <[email protected]>
Merged PR but removed duplicate endpoint_url (this was raising a different error). In addition, the other test now clears warnings cache as well - this was the reason for the test failure. |
Forgot to run pre-commit, this is now fixed... But I am honestly not sure where the warnings are coming from, I even installed Python 3.8 locally and ran the tests there, they run fine... The problem is the |
The following may remove the warnings --- a/s3fs/core.py
+++ b/s3fs/core.py
@@ -541,6 +541,12 @@ class S3FileSystem(AsyncFileSystem):
@staticmethod
def close_session(loop, s3):
if loop is not None and loop.is_running():
+ try:
+ loop = asyncio.get_event_loop()
+ loop.create_task(s3.__aexit__(None, None, None))
+ return
+ except RuntimeError:
+ pass
try:
sync(loop, s3.__aexit__, None, None, None, timeout=0.1) |
Your failures look like genuine problems. For the client close warnings (which are not errors) #760 will fix this. |
If we don't manage to fix the warnings issue, I could also completely remove the warning from the PR. While I think it would be benefitial to warn users, it would already be an improvement if we manage to get rid of the generic |
This PR improves the error handling logic for
s3fs.S3FileSystem.exists()
and fixes #750First change is that there is no longer a generic
except Exception
block, meaning that e.g.ConnectionError
now correctly raises an error instead of returningFalse
when querying a bucket.Second change is that a warning is now logged when attempting to query the existence of a bucket that the user does not have access to - this is important since the bucket might actually exist. This is in line with Amazon's proposed behavior, see here.
For more information on this, see related issue #750.