support length 1 arrays and relative metrics #11

elray1 · 2025-01-08T03:22:52Z

Two distinct sets of updates here:

update the schema to support length 1 arrays. I'm not sure if this is a core thing about YAML or just some behavior in the R packages we're using to read and validate the data structures -- but it seems like to get validations to pass in the case of arrays of strings with length 1, we need to say that valid entries may be either strings or arrays of strings. Added this to the schema and some test cases.
support computation of relative metrics:
- update schema to accept relative_metrics and baseline as properties of target entries
- add appropriate validations of those things and tests for them
- if relative metrics are specified, compute them, and add tests for that

elray1 · 2025-01-08T03:26:07Z

tests/testthat/helper-check_exp_scores_for_window.R

the bulk of this code was previously in test-generate_eval_data.R. It's been moved to this separate helper-*.R file, with some minor updates around the include_rel parameter.

Per my suggestion in hubverse-org/hubEvals#69 (review), I would recommend saving these outputs as test fixtures instead of generating them when the tests are run. From the looks of it, you only need to generate the tables for the relative metrics and subset them to the columns without relative metrics.

elray1 · 2025-01-08T03:31:12Z

tests/testthat/test-generate_eval_data.R

unfortunately the git diff for this file is useless, diff got confused. For this test file, there are three changes:

deleted check_exp_scores_for_window function (this moved to separate R script)

Added ", no relative metrics" to the description of the first test case, no changes to the test itself.

Added a second test case with relative metrics. This is a near-duplicate of the first test case, but it refers to a config file that includes relative metrics and asks check_exp_scores_for_window to include relative metric results.

zkamvar

This looks okay to me with the exception of the test helper, as noted in my comment below.

Regarding this comment:

update the schema to support length 1 arrays. I'm not sure if this is a core thing about YAML or just some behavior in the R packages we're using to read and validate the data structures -- but it seems like to get validations to pass in the case of arrays of strings with length 1, we need to say that valid entries may be either strings or arrays of strings. Added this to the schema and some test cases.

This is partially a symptom of jsonlite and partially a symptom of the yaml package. Both have their weird idiosyncrasies and something always gets lost in translation. I think your solution is a good workaround.

zkamvar · 2025-01-08T17:05:17Z

tests/testthat/helper-check_exp_scores_for_window.R

Per my suggestion in hubverse-org/hubEvals#69 (review), I would recommend saving these outputs as test fixtures instead of generating them when the tests are run. From the looks of it, you only need to generate the tables for the relative metrics and subset them to the columns without relative metrics.

zkamvar

Thank you! I have one minor suggestion to remove calls to library(), but otherwise this looks good!

tests/testthat/testdata/create_exp_score_fixtures.R

Co-authored-by: Zhian N. Kamvar <[email protected]>

elray1 · 2025-01-09T13:56:32Z

I tried removing the library() calls, but just using rlang::.data didn't work immediately. My take is that this is good enough at this point and it's likely not worth our time to sort out removing that last library(rlang) call.

zkamvar · 2025-01-09T14:34:10Z

I tried removing the library() calls, but just using rlang::.data didn't work immediately. My take is that this is good enough at this point and it's likely not worth our time to sort out removing that last library(rlang) call.

Yeah, that's a leaky abstraction and definitely not worth it. I'm a little surprised that .data is coming from rlang, since it's a dplyr idiom.

support length 1 arrays and relative metrics

bf8399f

elray1 commented Jan 8, 2025

View reviewed changes

zkamvar self-requested a review January 8, 2025 15:45

zkamvar requested changes Jan 8, 2025

View reviewed changes

elray1 added 2 commits January 8, 2025 16:54

refactor expected scores to fixtures

6495df9

no spaces in file name

221b1b0

elray1 requested a review from zkamvar January 8, 2025 22:07

zkamvar previously approved these changes Jan 8, 2025

View reviewed changes

tests/testthat/testdata/create_exp_score_fixtures.R Outdated Show resolved Hide resolved

Update tests/testthat/testdata/create_exp_score_fixtures.R

7dd09cb

Co-authored-by: Zhian N. Kamvar <[email protected]>

elray1 dismissed zkamvar’s stale review via 7dd09cb January 9, 2025 01:22

elray1 requested a review from zkamvar January 9, 2025 01:24

make the script go again

8a3632a

zkamvar approved these changes Jan 9, 2025

View reviewed changes

elray1 merged commit a7cff83 into main Jan 9, 2025
7 of 8 checks passed

elray1 deleted the elr/rel_metrics branch January 9, 2025 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support length 1 arrays and relative metrics #11

support length 1 arrays and relative metrics #11

elray1 commented Jan 8, 2025

elray1 Jan 8, 2025

zkamvar Jan 8, 2025

elray1 Jan 8, 2025 •

edited

Loading

zkamvar left a comment

zkamvar Jan 8, 2025

zkamvar left a comment

elray1 commented Jan 9, 2025

zkamvar commented Jan 9, 2025

support length 1 arrays and relative metrics #11

support length 1 arrays and relative metrics #11

Conversation

elray1 commented Jan 8, 2025

elray1 Jan 8, 2025

Choose a reason for hiding this comment

zkamvar Jan 8, 2025

Choose a reason for hiding this comment

elray1 Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

zkamvar left a comment

Choose a reason for hiding this comment

zkamvar Jan 8, 2025

Choose a reason for hiding this comment

zkamvar left a comment

Choose a reason for hiding this comment

elray1 commented Jan 9, 2025

zkamvar commented Jan 9, 2025

elray1 Jan 8, 2025 •

edited

Loading