Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Logging rework #141

Merged
merged 1 commit into from
Sep 12, 2024
Merged

Conversation

m-appel
Copy link
Member

@m-appel m-appel commented Sep 10, 2024

All crawlers now log only using the logging module and not using stderr anymore. Also the IYP module now automatically logs node/relationship retrieval and creation. As a consequence, some crawlers are a bit chatty, which should be fixed in the future.

The general log level was changed to INFO and the level name is now included in the logs instead of the process name.

As parts of this also some crawlers got reworked:

  • alice_lg: Fixed unit test that checked for relationship types which are not created.
  • ihr:
    • Reference modification time is more precise, matching the exact timebin used
    • Data is handled in-memory, removing need for temporary files
    • Moved to batch creation, making the crawler faster and removing dependency on other crawlers

All crawlers now log only using the logging module and not using stderr
anymore. Also the IYP module now automatically logs node/relationship
retrieval and creation. As a consequence, some crawlers are a bit
chatty, which should be fixed in the future.

The general log level was changed to INFO and the level name is now
included in the logs instead of the process name.

As parts of this also some crawlers got reworked:

- alice_lg: Fixed unit test that checked for relationship types which
  are not created.
- ihr:
  - Reference modification time is more precise, matching the exact
    timebin used
  - Data is handled in-memory, removing need for temporary files
  - Moved to batch creation, making the crawler faster and removing
    dependency on other crawlers
Copy link
Member

@romain-fontugne romain-fontugne left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks, that looks good to me. That will run on the 15th

@romain-fontugne romain-fontugne merged commit 078e5de into InternetHealthReport:main Sep 12, 2024
1 check passed
@m-appel m-appel deleted the log-rework branch November 15, 2024 05:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants