Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log when paths are rejected because of robots.txt #27

Open
mthuurne opened this issue Feb 28, 2021 · 0 comments
Open

Log when paths are rejected because of robots.txt #27

mthuurne opened this issue Feb 28, 2021 · 0 comments
Labels
enhancement New feature or request

Comments

@mthuurne
Copy link
Member

Currently, if a path is not fetched because it is excluded by means of a robots.txt file, the user is not notified of that fact. That makes it non-obvious why APE doesn't spider the entire site.

It would be useful if rejected paths would be logged, either at the default level or with a single --verbose option. Maybe log each rejected path with --verbose and only log a count at the default level: that would at least hint the user that there is something going on that --verbose would provide more details about.

@mthuurne mthuurne added the enhancement New feature or request label Feb 28, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant