Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Easy MULTI_INPUT_JSON specification with for-each #95

Merged
merged 2 commits into from
Dec 4, 2024

Conversation

hannahbast
Copy link
Member

@hannahbast hannahbast commented Dec 3, 2024

So far, specifying the MULTI_INPUT_JSON was cumbersome when the input consists of many files that are assigned to few graphs via a pattern. A typical case is when all files in a directory or with a certain prefix should be assigned to the same graph. There is now a nice and simple syntax for this. Here are two example elements of a MULTI_INPUT_JSON array, which should be self-explanatory:

{ "cmd": "zcat {}", "graph": "http://example.org/graph1", "for-each": "dir1/*.ttl.gz" }
{ "cmd": "zcat {}", "graph": "http://example.org/graph2", "for-each": "dir2/*.ttl.gz" }

Update the Qleverfile for UniProt to use this new feature (and read the input as gzipped TTL files, and not N-Triples as before).

Hannah Bast added 2 commits December 2, 2024 21:26
So far, specifying the `MULTI_INPUT_JSON` was cumbersome when the input
consists of many files, assigned to few graphs via a pattern. For
example, when all files in a directory should be assigned to the same
graph. There is now a nice syntax for this typical case, here is an
example element of the `MULTI_INPUT_JSON`, which should be self-explanatory:

{ "cmd": "zcat {}", "graph": "http://example.org", "for-each": "ex/*.gz" }

Update the `Qleverfile` for UniProt to use this new feature (and read
the input as gzipped TTL files, and not N-Triples as before).
@hannahbast hannahbast merged commit 547add9 into main Dec 4, 2024
3 of 4 checks passed
hannahbast added a commit that referenced this pull request Dec 17, 2024
Since #77, the Qleverfile can specify multiple input streams, with an individual configuration per stream. Since #95, there is an easy way to assign a graph to all files matching a certain pattern. This is now used for the PubChem Qleverfile.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant