transition from `add_tailor(prop)` and `method` to `fit.workflow(calibration)` #262

simonpcouch · 2024-09-27T16:43:19Z

Closes #255, closes #254, closes #252, closes #233, and related to this gist.

This PR removes the add_tailor(prop) and method arguments in favor of an argument fit.workflow(calibration) giving the data to fit the calibrator on.

Benefits:

The workflow now truly has enough information to fit without leaking data (method wasn't actually enough)
add_tailor(method) really wasn't truly independent of the data/resampling scheme
workflows does not need to "know about" rsample

Thanks @hfrick for this suggestion—this feels much better.

This PR should be much easier to review commit-by-commit than altogether.

TODOs from here: update tune and (possibly?) rename .should_inner_split() to something like .workflow_needs_calibration().

workflows will no longer take an `add_tailor(prop)` or `add_tailor(method)` argument, instead taking a `fit.workflow(calibration)` argument that supersedes both of them. first, remove machinery that relates specifically to those arguments.

…bration)` * removes `add_tailor(prop)` and `add_tailor(method)` * adds `fit.workflow(calibration)` * various documentation updates

R/fit.R

tests/testthat/test-post-action-tailor.R

hfrick

This is such a nice progression! I've left a few questions but overall I mainly wanted to say that your very structured whittling down of the discussion around this in the gist is what made my suggestion so easy. 🙇‍♀️ 🙌

R/fit.R

hfrick · 2024-09-30T13:03:10Z

R/fit.R

 #'
 #' @param ... Not used
 #'
+#' @param calibration A data frame of predictors and outcomes to use when


I'm wondering if we should adapt the name slightly to make it more obvious that this the data for calibration, rather than, say, the method. data_calibration, calibration_data, calibration_set?

Sure! I don't have strong preferences between any of these options, but agree that calibration by itself could be ambiguous.

R/fit.R

R/post-action-tailor.R

tests/testthat/_snaps/fit.md

tests/testthat/test-post-action-tailor.R

R/fit.R

hfrick · 2024-09-30T13:59:40Z

R/post-action-tailor.R

@@ -59,7 +59,7 @@
 #' datasets, resulting in the preprocessor and model generating predictions on
 #' rows they've seen before. Similarly problematic situations could arise in the
 #' context of other resampling situations, like time-based splits.
-#' In general, use the [rsample::inner_split()] function to prevent data
+#' In general, use the `rsample::inner_split()` function to prevent data


Same comment about inner_split() being "internal" or not

* edits to `validate_has_calibration()`: * refer to "The workflow" rather than `caller_arg()` * Warn rather than error on unneeded calibration set * All arguments on one line in function signature * comment on expectation re: predictions * doc tweaks

* `.should_inner_split()` -> `.workflow_includes_calibration()` * refactor conditional in `fit.workflow()` into two

github-actions · 2024-10-16T01:40:13Z

This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

simonpcouch added 3 commits September 27, 2024 11:03

remove make_inner_split() and its tests

73cf3e4

workflows will no longer take an `add_tailor(prop)` or `add_tailor(method)` argument, instead taking a `fit.workflow(calibration)` argument that supersedes both of them. first, remove machinery that relates specifically to those arguments.

transition from add_tailor(prop) and method to `fit.workflow(cali…

d7a9797

…bration)` * removes `add_tailor(prop)` and `add_tailor(method)` * adds `fit.workflow(calibration)` * various documentation updates

remove rsample Suggests

47eda7b

simonpcouch commented Sep 27, 2024

View reviewed changes

R/fit.R Outdated Show resolved Hide resolved

tests/testthat/test-post-action-tailor.R Show resolved Hide resolved

simonpcouch requested a review from hfrick September 27, 2024 16:47

hfrick approved these changes Sep 30, 2024

View reviewed changes

hfrick mentioned this pull request Sep 30, 2024

Rename inner_split()? tidymodels/rsample#553

Open

simonpcouch added 2 commits September 30, 2024 09:44

apply suggestions from review

0a38a86

* edits to `validate_has_calibration()`: * refer to "The workflow" rather than `caller_arg()` * Warn rather than error on unneeded calibration set * All arguments on one line in function signature * comment on expectation re: predictions * doc tweaks

rephrase "inner split"

b28a6c4

* `.should_inner_split()` -> `.workflow_includes_calibration()` * refactor conditional in `fit.workflow()` into two

This was referenced Sep 30, 2024

rename fit.workflow(calibration) #263

Open

transition from add_tailor(prop) and method tidymodels/tune#945

Merged

simonpcouch merged commit 7929511 into main Oct 1, 2024
11 checks passed

simonpcouch deleted the calibration-set-arg branch October 1, 2024 12:51

github-actions bot locked and limited conversation to collaborators Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transition from `add_tailor(prop)` and `method` to `fit.workflow(calibration)` #262

transition from `add_tailor(prop)` and `method` to `fit.workflow(calibration)` #262

simonpcouch commented Sep 27, 2024

hfrick left a comment

hfrick Sep 30, 2024

simonpcouch Sep 30, 2024

hfrick Sep 30, 2024

github-actions bot commented Oct 16, 2024

transition from add_tailor(prop) and method to fit.workflow(calibration) #262

transition from add_tailor(prop) and method to fit.workflow(calibration) #262

Conversation

simonpcouch commented Sep 27, 2024

hfrick left a comment

Choose a reason for hiding this comment

hfrick Sep 30, 2024

Choose a reason for hiding this comment

simonpcouch Sep 30, 2024

Choose a reason for hiding this comment

hfrick Sep 30, 2024

Choose a reason for hiding this comment

github-actions bot commented Oct 16, 2024

transition from `add_tailor(prop)` and `method` to `fit.workflow(calibration)` #262

transition from `add_tailor(prop)` and `method` to `fit.workflow(calibration)` #262