You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The VoID vocabulary offers information about datasets which should be used by the crawler. The following triples contain the URI http://dbpedia.org/sparql as well as the information that this URI should be crawled as SPARQL endpoint:
:DBpedia a void:Dataset;
void:sparqlEndpoint <http://dbpedia.org/sparql> .
Solution
At the moment, the RDF processing is not very intelligent and simply stores all newly retrieved URIs. We would have to enhance the processing, e.g., by using the decorator pattern and decorator classes which handle special cases like that. However, depending on the number of special cases we will have in the future, the decorator pattern might become to heavy.
Additionally, the storage of newly found URIs might be an issue. In the example above, it wouldn't be sufficient to add the type information since the URI could already have been found before and the update might be rejected by the store. The same holds for the Frontier's queue which might reject the appending of the new URI since it might already be known. In this case, update strategies might be necessary.
The text was updated successfully, but these errors were encountered:
Description
The VoID vocabulary offers information about datasets which should be used by the crawler. The following triples contain the URI
http://dbpedia.org/sparql
as well as the information that this URI should be crawled as SPARQL endpoint:Solution
The text was updated successfully, but these errors were encountered: