Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check that Polaris app_run can use binding settings properly #346

Open
cms21 opened this issue Apr 27, 2023 · 2 comments
Open

Check that Polaris app_run can use binding settings properly #346

cms21 opened this issue Apr 27, 2023 · 2 comments
Assignees

Comments

@cms21
Copy link
Contributor

cms21 commented Apr 27, 2023

Check and fix if necessary that the Polaris app_run can implement the binding settings described in the ALCF docs:
https://docs.alcf.anl.gov/polaris/running-jobs/#running-multiple-mpi-applications-on-a-node

@cms21 cms21 self-assigned this Apr 28, 2023
@cms21
Copy link
Contributor Author

cms21 commented Apr 28, 2023

Strategy: use the setting of CUDA_VISIBLE_DEVICES to set the cpu_bind option. This will be the default when GPUs are used.

@cms21
Copy link
Contributor Author

cms21 commented May 24, 2023

A few notes:

  • We should use self._node_spec.cpu_ids[0] to make the assignment in order to be consistent with the node spec
  • The number of cpus for Polaris is set incorrectly to 64 in the compute node for the platform, fix this
  • We should create a tester for _build_cmdline

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant