TADA_DataRetrieval updates for sf option, tribal options, big data options #566

mbrousil · 2025-01-28T20:53:16Z

Hi there!

This PR covers a few large changes to the data retrieval process based on conversations I have had with Cristina and Hillary:

Overview

Addition of {sf} methods to allow users to query WQP data using {sf} objects
Addition of options to allow tribal lands to be more directly queried using TADA_DataRetrieval
New function, TADA_TribalOptions, to assist users with identifying and querying tribal lands
Folding the processes in TADA_BigDataRetrieval into TADA_DataRetrieval and removing TADA_BigDataRetrieval to avoid confusion
Adding progress bar to large data pulls, user prompt to confirm download, silencing {dataRetrieval} messages + error handling for HTTP errors, vignette update

Additional info

{sf} methods use aoi_sf arg and largely begin here. First checks what data are available for the bbox of the {sf} object provided, then uses only MonitoringLocationIdentifiers inside the {sf} object when running the full query
Tribal land queries use tribal_area_type and tribe_name_parcel args and are handled alongside {sf} because they use this EPA spatial data. Both tribal_area_type and tribe_name_parcel are required. {sf} and tribal args can't be used at the same time (error), and if geographic info like statecode are provided in addition to either {sf} or tribal args then a warning is returned
tribal_area_type refers to one of the EMEF/Tribal MapServer layers. tribe_name_parcel refers to either TRIBE_NAME or PARCEL_NO entries from that layer. The TADA_TribalOptions function is included to help users see TRIBE_NAME/PARCEL_NO options available to them and check punctuation, etc.
TADA_BigDataHelper is now used to handle "big" data requests within TADA_DataRetrieval. By default this is triggered with maxrecs = 250000 & maxsites = 300.
1. Two (1, 2) progress bars are included inside TADA_BigDataHelper
2. The ask_user function is used to confirm that the user wants to download the dataset after the number of records is determined
3. In general the messages from {dataRetrieval} are now silenced because they were returning a lot of information that was hiding (what we considered) more useful information from TADA_DataRetrieval. But we've made sure to include checks for HTTP errors, which will then be communicated back to the user
4. Additional info now in vignette 1 to explain the new {sf}, tribal, and big data functionality

A few notes:

I left NULL as the default for the aoi_sf argument instead of "null" because the character version didn't work properly
I had hoped to work on issues related to character length limits in queries, as discussed with Cristina, but ran out of time
From my tests it didn't seem like the way that data are indexed by calendar date affected query speed

Please let me know if I can provide any other info on any of this! For example I didn't include any info from speed tests to avoid overwhelming amounts of info here. Thanks for your help.

Closes #361, closes #427, closes #345, closes #159

Co-authored-by: Katie/Ryn Willi (she/her) <[email protected]>

Add sf, big data to TADA_DataRetrieval

Co-authored-by: B Steele <[email protected]>

Prepare for submission

mbrousil and others added 30 commits August 23, 2024 17:08

Add sf and tribal query options to TADA_DataRetrieval

e089719

Add TADA_TribalOptions to R/GeospatialFunctions.R

b268ce8

Merge remote-tracking branch 'upstream/develop' into develop

a96064f

tribal options edits

49ddc83

TADA_DR rewrite

57e5a03

Helper function for large queries

7d9ac2a

Document bigdatahelper

2b06dbd

update geospatial funs

929dc37

warning -> message & add dplyr::

001c2eb

odds and ends

19d80a2

Add user prompts to data retrieval

a71a495

Add progress bar & change warnings to messages

0d55e74

Include progress bar internals

27038fb

Update DataDiscoveryRetrieval.R

02de221

Apply suggestions from code review

6e16a9b

Co-authored-by: Katie/Ryn Willi (she/her) <[email protected]>

Apply suggestions from code review

0c42ea5

Update DataDiscoveryRetrieval.R

21cf9c5

Merge pull request #7 from mbrousil/develop

87150ea

Add sf, big data to TADA_DataRetrieval

http errors

152e979

TADA_TribalOptions + vignette + logic

fd3019f

Vignette updates

a8dcb05

Update testing, build, etc.

d8fe5f3

Run styler::style_pkg()

a0b1b4d

Example housekeeping

8ab8857

Remove bigdataretrieval

dfd0b3c

Apply suggestions from code review

abe653f

Co-authored-by: B Steele <[email protected]>

Documentation and messaging

acab23e

Clarify tribe_name_parcel reqs

4b3ec7f

Redo tribal_area_type check

c3e40e3

Catch clips with no sites

f0de6b1

mbrousil and others added 10 commits January 27, 2025 13:08

More quietly

cd05dfd

tigris

820f762

Recheck/rebuild

77357be

Style

8e4b98a

Update R/DataDiscoveryRetrieval.R

fb1fc49

Co-authored-by: B Steele <[email protected]>

Apply suggestions from code review

1310736

Co-authored-by: B Steele <[email protected]>

Update DataDiscoveryRetrieval.R

7602fac

Merge pull request #8 from mbrousil/develop

1918de0

Prepare for submission

Merge remote-tracking branch 'upstream/develop' into develop

3493ee7

Merge remote-tracking branch 'upstream/develop' into develop

6297335

mbrousil marked this pull request as draft January 29, 2025 00:47

mbrousil marked this pull request as ready for review January 29, 2025 00:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TADA_DataRetrieval updates for sf option, tribal options, big data options #566

TADA_DataRetrieval updates for sf option, tribal options, big data options #566

mbrousil commented Jan 28, 2025

TADA_DataRetrieval updates for sf option, tribal options, big data options #566

Are you sure you want to change the base?

TADA_DataRetrieval updates for sf option, tribal options, big data options #566

Conversation

mbrousil commented Jan 28, 2025

Overview

Additional info

A few notes: