Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Dataset]: Refactor LC to Use ActivityStreams #195

Open
4 tasks
kkdavis14 opened this issue Dec 18, 2024 · 0 comments
Open
4 tasks

[Dataset]: Refactor LC to Use ActivityStreams #195

kkdavis14 opened this issue Dec 18, 2024 · 0 comments
Assignees
Labels
enhancement New feature to add to the code Medium Medium priority task v0.0.2 Change being developed for v.0.0.2

Comments

@kkdavis14
Copy link
Contributor

kkdavis14 commented Dec 18, 2024

Priority Level

Low

Dataset Name

LCNAF & LCSH

Description

This ticket is to research and possibly implement harvesting LC data via their ActivityStreams.

Data Access Method

https://id.loc.gov/authorities/subjects/activitystreams/feed/1.json
https://id.loc.gov/authorities/names/activitystreams/feed/1.json

Data Format

ActivityStreams format with JSON data available

{
"@context": [
"https://www.w3.org/ns/activitystreams#",
"https://emm-spec.org/0.1/context.json",
     {
       "madsrdf": "http://www.loc.gov/mads/rdf/v1#",
       "skos": "http://www.w3.org/2004/02/skos/core#",
       "bf": "http://id.loc.gov/ontologies/bibframe/"
     }
   ],
"summary": "Page 1 of 120223 pages in http://id.loc.gov/authorities/names/activitystreams/feed",
"type": "OrderedCollectionPage",
"id": "http://id.loc.gov/authorities/names/activitystreams/feed/1",
"partOf": "http://id.loc.gov/authorities/names/activitystreams/feed",
"next": "http://id.loc.gov/authorities/names/activitystreams/feed/2",
"orderedItems": [
     {
        "type": "Update",
        "published": "2024-12-18",
        "actor": "http://id.loc.gov/vocabulary/organizations/nlamu",
        "object": {
             "id": "http://id.loc.gov/authorities/names/no2024139904",
             "update": "2024-12-18",
             "type": [
                 "madsrdf:PersonalName",
                 "madsrdf:SimpleType",
                 "madsrdf:Name",
                 "madsrdf:Authority"
              ],
             "url": [{
                 "type": "Link",
                 "href": "http://id.loc.gov/authorities/names/no2024139904.rdf",
                 "mediaType": "application/rdf+xml"
               },
            {
                "type": "Link",
                "href": "http://id.loc.gov/authorities/names/no2024139904.json",
                "mediaType": "application/json"

Entity Matching

sameAs properties exist in the data which we already leverage in the mappers.

Technical Requirements

  • Research if using this AS is feasible for our use
  • Create a new harvester for the LC ActivityStreams implementation
  • Refactor LC mappers
  • Refactor LC index loaders

Known Limitations

No response

Example Integration

First, we need to investigate the ActivityStream LC is vending to see if it is feasible for our use. If acceptable, write LC harvesters to grab data via the AS. Refactor the mappers to use the data in this new format (may not be necessary). Refactor the index loaders to use data in the new format.

@kkdavis14 kkdavis14 added the enhancement New feature to add to the code label Dec 18, 2024
@kkdavis14 kkdavis14 self-assigned this Dec 18, 2024
@kkdavis14 kkdavis14 changed the title [Dataset]: LC ActivityStreams [Dataset]: Refactor LC to Use ActivityStreams Jan 6, 2025
@kkdavis14 kkdavis14 added the v0.0.2 Change being developed for v.0.0.2 label Jan 7, 2025
@kkdavis14 kkdavis14 added the Medium Medium priority task label Jan 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature to add to the code Medium Medium priority task v0.0.2 Change being developed for v.0.0.2
Projects
None yet
Development

No branches or pull requests

1 participant