Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update 2025 input data #66

Merged
merged 4 commits into from
Dec 16, 2024

Conversation

dfsnow
Copy link
Member

@dfsnow dfsnow commented Dec 16, 2024

A few updates here to reflect the changes from ccao-data/model-res-avm#283, as well as catch the condo model up on the unload = TRUE option of noctua.

Comment on lines +22 to +24
# Adds arrow support to speed up ingest process
noctua_options(unload = TRUE)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We hadn't yet switched to using the unload = TRUE method for noctua, meaning array columns were still being pulled in as comma-separated strings. Most of the updates in this file are to implement the same array handling used in the res model.

"RemoteUsername": "DyfanJones",
"RemoteRepo": "noctua",
"RemoteRef": "master",
"RemoteSha": "23a4cfbf537407c7a1547fc13ba771ba2eb098e0",
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

noctua didn't actually get bumped (in #62) to the version with the unload fix, so this PR does that as well as bumping/adding a few other dependencies.

@dfsnow dfsnow force-pushed the dfsnow/update-2025-test-input-data branch from 14ce8e2 to 7e48669 Compare December 16, 2024 18:54
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will be our 2024 data setup until we get 2024 sales and data sorted out. I'll dvc push once this is merged.

@dfsnow dfsnow marked this pull request as ready for review December 16, 2024 21:02
Copy link
Contributor

@jeancochrane jeancochrane left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for picking this up! One small question but it's not serious.

Comment on lines +5 to +9
deps:
- path: pipeline/00-ingest.R
hash: md5
md5: 29292ee2bef109914c423c9259aa8879
size: 22847
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Question, non-blocking] I haven't seen us use this pattern before, what's the goal of making the script a dependency?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's used in both model repos now, with the thinking being that the pipeline should be restarted if the script changes. I'm not super tied to this setup though if you don't like it. I agree it's a bit weird.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that makes sense, thanks!

@dfsnow dfsnow merged commit 1a227fa into 2025-assessment-year Dec 16, 2024
7 of 8 checks passed
@dfsnow dfsnow deleted the dfsnow/update-2025-test-input-data branch December 16, 2024 22:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants