Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DNS remodeling #119

Merged

Conversation

romain-fontugne
Copy link
Member

@romain-fontugne romain-fontugne commented Feb 2, 2024

Remodeling of DNS data

Description

After multiple discussions we decided to improve the way we model DNS data. The main changes include de-coupling the DomainName and HostName nodes.

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • [ x] Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation.
  • I have updated the documentation accordingly.

@romain-fontugne
Copy link
Member Author

I have done most of the changes we discussed. Only things left are the cloudflare crawlers.

@romain-fontugne romain-fontugne changed the title Dns remodeling DNS remodeling Feb 2, 2024
@romain-fontugne
Copy link
Member Author

Forgot to lower umbrella in the config (readme should also be updated)

@romain-fontugne
Copy link
Member Author

I have adjusted all crawlers for the new DomainName and HostName node definition.

Still left: add (:IP)-[:SERVE]-(:Service) links.

@romain-fontugne
Copy link
Member Author

I did a full run and got two errors:

  • Couldn't get the MANRS data
  • flatdict exception in ripe.atlas_probes

These are completely unrelated to the modifications so I think we can now merge this PR. @m-appel are you planning any other changes here?

@m-appel
Copy link
Member

m-appel commented Feb 7, 2024

No, I think we can merge it (actually squash and merge, since it's many commits). I had the MANRS problem yesterday as well, but worked on rerun, I think they had some problems.

@romain-fontugne romain-fontugne marked this pull request as ready for review February 7, 2024 07:55
@romain-fontugne romain-fontugne merged commit 838c8be into InternetHealthReport:main Feb 7, 2024
1 check passed
MAVRICK-1 added a commit to MAVRICK-1/internet-yellow-pages that referenced this pull request Feb 7, 2024
…operty

DNS remodeling (InternetHealthReport#119)

* update url2domain to url2hostname

* remove iana root zone file and dns hierarchy from config file

* Atlas measurement targets are now hostnames

* update openintel crawlers to the new DNS model

* umbrella now ranks a mix of DomainName and HostName nodes and should be run after openintel.umbrella1m

* Add explanation for cloudflare DNS modeling

* lower umbrella crawler in config file

* update READMEs with the new DNS modeling

* add (:Service {name:'DNS'}) node and link it to authoritative name servers

* Nodes do not have reference properties

* Normalize IPv6 addresses

* Fix wrong crawler name

* Typos and formatting

* Remove infra_mx crawler since it does not do anything at the moment

* Update Cisco Umbrella crawler

- Batch create new nodes (happens more often than expected)
- Add logging output
- Do not use builtins as variable names

* Remove redundant set and parameters

* Remove Service node for now

We could not decide on a name, so we will deal with this later.

---------

Co-authored-by: Malte Tashiro <[email protected]>

Add OpenINTEL DNS dependency crawler

Integrate with existing files and remove some unnecessary stuff.

Co-authored-by: Raffaele Sommese <[email protected]>

precommit error rectified

Update __init__.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants