-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ingest BOR appeal data into lake #553
Conversation
@ccao-jardine this is in athena now if you want to take a look |
Co-authored-by: Dan Snow <[email protected]>
@@ -1,6 +1,3 @@ | |||
**Primary Key**: `reascd` | |||
{% enddocs %} | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nothing to do with this PR, just random extraneous code that needed to be removed.
@@ -0,0 +1,9 @@ | |||
# appeals | |||
|
|||
{% docs table_appeals %} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thought: Realizing that maybe we made a mistake by not prefixing these table names in the docs with their respective database, since some stuff like appeals
could definitely refer to multiple tables in the future. Fine for now, but something to fix in a later PR. Can you make an issue @wrridgeway?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Co-authored-by: Dan Snow <[email protected]>
Co-authored-by: Dan Snow <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks all set to me @wrridgeway!
ETL pipeline to take data from BOR appeals open data asset and load it into athena. Data is more-or-less unaltered. Glue crawler for the
ccbor
database was also added.See #548 for more details.
As of right now using the
RSocrata
package (sincejsonlite
can't handle the amount of data being gathered from the open data portal) which needs credentials. I've added them to.Renviron
, but that's a file we track with git in this repo. So we'll need to figure out a solution for this.