Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Narrative Service: Integrate Clinical Trials and Preprints from PubPharm #290

Open
HermannKroll opened this issue Jul 3, 2024 · 1 comment

Comments

@HermannKroll
Copy link
Owner

HermannKroll commented Jul 3, 2024

Load preprints and clinicial trials from PubPharm. We need to adjust the following skript to crawl the collections and convert the updated document data into a JSON format. The skript allows us to retrieve documents that have been modified since a specific data.

The attached skript must be adjusted and integrated into NarrativeAnnotation:

  • integrate a proper main + argpaser
  • argpase parameter: collections (string, required) is used to adjust the crawling URL
  • argparse parameter: output collection for our service (a name how we want to call the collection)
  • argpase last_update: date (YYYY-MM-DD) is used to adjust the crawling URL (data since when updated should be retrieved)

#283 has to be solved before (otherwise we cannot handle updated documents)

@HermannKroll
Copy link
Owner Author

UB has to evaluate the data quality (cocerning our last meeting).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants