Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel Processing Prototyping #7

Open
cczhu opened this issue Sep 20, 2019 · 0 comments
Open

Parallel Processing Prototyping #7

cczhu opened this issue Sep 20, 2019 · 0 comments
Assignees
Labels
enhancement New feature or request experimentation General experimentation and exploration

Comments

@cczhu
Copy link
Contributor

cczhu commented Sep 20, 2019

Much of TEPs-I is embarrassingly parallel, meaning it shouldn't take much effort to parallelize many jobs. Indeed, Arman does it in MATLAB with just a few flags at the start of his processes. This should be implemented in our code as well.

Technologies for parallelization:

  • Multiprocessing (also (here)[https://www.machinelearningplus.com/python/parallel-processing-python/])
  • joblib.parallel
  • Luigi and Airflow can do batch serialization, but I'd rather not use them - way too much overhead for our purposes, and may not play well with a system-level Luigi or Airflow server running all or jobs.

One concern: *do these packages break our ability to do parallelized hyperparameter tuning with a model expermentation package?

File I/O parallelization is discussed in #6 .

@cczhu cczhu self-assigned this Sep 20, 2019
@cczhu cczhu added enhancement New feature or request experimentation General experimentation and exploration labels Sep 20, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request experimentation General experimentation and exploration
Projects
None yet
Development

No branches or pull requests

1 participant