Validation and hyperparameter tuning setup #16

cczhu · 2019-11-05T18:52:03Z

We need to determine whether our fit errors are similar to Arman's reported results, and those by Bagheri et al. 2013. We also want to investigate if we can significantly relax criteria for data to be included for validation, and isolating a portion of this data as a holdout set. We technically don't need to only consider permanent stations, just any station with sufficient data across multiple years.

Actual testing may take an extended amount of time (and involve comparing our results to those of a Gaussian Process Regression), so the goal of this issue is to merge the preliminary CountMatch with master, then create a new Sandbox Branch ecosystem to test ways for validation testing and hyperparameter tuning.

Merge countmatch branch with master (don't delete countmatch).
Rebase sandbox by master.
Set up a new notebook for validation testing. Create a CountMatch model mini-pipeline that allows us to vary hyperparameters like the number of neighbours considered or the minimum requirements of a permanent count station.
Perform preliminary experiments for validation.

The text was updated successfully, but these errors were encountered:

cczhu · 2019-12-06T22:16:44Z

A decent amount of discussion about this is happening in #14 right now, and this may get implicitly solved once we create an MVP countmatch fitter.

cczhu · 2019-12-22T20:07:15Z

Old Charles is correct - this is now partly solved by the latest commits in #14, but even better - it's in function form rather than notebook form.

Preliminary results suggest that the annual growth factor is by far the most sensitive parameter governing the AADT predictions, so we'll have to think more about #26 before embarking on a full hyperparameter estimation journey.

(Hyperparameter estimation also takes many hours to run a single experiment. Since our work is embarrassingly parallel, we should consider multiprocessing solutions to speed up the work. The simple thing to do is to multi-thread the hyperparameter tuning experiments. The more involved solution is to multi-thread the estimator - the more lucrative one to multi-thread countmatch. We may also finally want to spin up a cluster to do some of this work...

See #7)

cczhu added the countmatch Pythonization of PRTCS into CountMatch module label Nov 5, 2019

cczhu added this to the CountMatch MVP milestone Nov 5, 2019

cczhu assigned cczhu and aharpalaniTO Nov 5, 2019

cczhu removed this from the CountMatch MVP milestone Mar 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation and hyperparameter tuning setup #16

Validation and hyperparameter tuning setup #16

cczhu commented Nov 5, 2019

cczhu commented Dec 6, 2019

cczhu commented Dec 22, 2019

Validation and hyperparameter tuning setup #16

Validation and hyperparameter tuning setup #16

Comments

cczhu commented Nov 5, 2019

cczhu commented Dec 6, 2019

cczhu commented Dec 22, 2019