You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Amy to investigate options to re-map from the most recently fetched vernacular data; start with initial analysis of Nuxeo-based collections where there's only 1 version of fetched data that was published (vs. cases where there are multiple versions)
Honing in on that 25% of records without date data:
rikolti-prd
rikolti-stg
w/out date data
533,607 records
535,707 records
w/ version_path
194,201 records (36%)
160,437 records (30%)
w/out version_path
339,406 records (64%)
375,270 records (70%)
Described in collections:
rikolti-prd
rikolti-stg
w/out date data
1168 collections
1172 collections
w/ version_path
439 collections (37%)
412 collections (35%)
w/out version_path
729 collections (62%)
760 collections (65%)
So we know which vernacular version was run through the pipeline and published for 37% of published collections missing date data and 35% of staged collections missing date data.
Honing in on that 62-65% of collections without version paths:
rikolti-prd
rikolti-stg
w/out version_path
729 collections
760 collections
w/ one vernacular version in s3
3 collections
30 collections
w/ many vernacular versions in s3
726 collections
720 collections
So we can infer which vernacular version was run through the pipeline and published for 3 more published collections and 30 more staged collections because there is only one vernacular version stored in s3.
Nuxeo Analysis
rikolti-prd
rikolti-stg
total
2,137,124 records
2,146,265 records
Nuxeo w/out date data
124,089 records (6%)
124,213 records (6%)
Honing in on that 6% of records from Nuxeo and without date data:
rikolti-prd
rikolti-stg
Nuxeo w/out date data
124,089 records
124,213 records
Nuxeo w/ version_path
2,335 records
1,921 records
Nuxeo w/out version_path
121,754 records
122,292 records
Described in collections:
rikolti-prd
rikolti-stg
Nuxeo w/out date data
303 collections
306 collections
Nuxeo w/ version_path
13 collections
14 collections
Nuxeo w/out version_path
290 collections
292 collections
Honing in on those 290 Nuxeo collections without version paths:
rikolti-prd
rikolti-stg
Nuxeo w/out version_path
290 collections
292 collections
Nuxeo w/ one vernacular version in s3
0 collections
Nuxeo w/ many vernacular versions in s3
290 collections
So we can't infer a version path for any of the 290 collections without version paths.
==
The text was updated successfully, but these errors were encountered: