[Dataset]: BNE Source Needs Refactored #187
Labels
enhancement
New feature to add to the code
Medium
Medium priority task
v0.0.2
Change being developed for v.0.0.2
Priority Level
Medium
Dataset Name
BNE
Description
BNE is the national library of Spain. We already have it as a source, but it is set up from an old dump file. There are better files available here: https://datos.gob.es/en/catalogo?publisher_display_name=Biblioteca+Nacional+de+Espa%C3%B1a.
Data Access Method
Base URL: https://datos.gob.es/en/catalogo
Example URLs: https://www.bne.es/media/datosgob/catalogo-autoridades/entidad/entidad-JSON.zip
https://datos.gob.es/en/catalogo/ea0019768-catalogo-de-autoridades-geografico-JSON.zip
Data Format
Format: JSON-LD
JSON example:
Entity Matching
The data has fields like
otros_identificadores
andotros_codigos_identificacion
to store sameAs URIs. They map to LC and BNF.Technical Requirements
Known Limitations
No response
Example Integration
Add to the BNE config the URLs for the datasets from BNE that we would like to harvest, in the remote dump files block. Ensure these files can be downloaded via existing download code. Write a loader to load these files into the BNE datacache. Refactor the existing BNE mapper to use this new data structure.
The text was updated successfully, but these errors were encountered: