Feature/mpi #53

fluidnumerics-joe · 2024-09-13T14:03:28Z

Relates to Issue #51

This feature adds fully tested MPI implementations for scalars and vectors in 2-D and 3-D. The primary changes are in the SideExchange routines where asynchronous (isend/irecv) pt2pt messaging is used to send boundary data and receive into extboundary data. MPI+GPU implementations specifically assume that GPU Aware MPI is available and GPU pointers are used in the message passing.

We have also added parallel reduction for entropy calculation (global grid integration) and cleaned up the model API to make adding new models quite easy. Specifically, we have removed the need to be aware of loop ordering, MPI call requirements, or GPU acceleration requirements to be able to have a model implemented rather quickly. Further optimization for GPU platforms is possible from the first model specification, but is still quite reasonable out of the box.

This reduces the number of objects required to instantiate a model. Instead, a user can simply enable domain decomposition during mesh creation. At the moment, this option is provided only when reading in a mesh.

codecov · 2024-09-13T14:36:37Z

Codecov Report

Attention: Patch coverage is 75.58685% with 364 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/SELF_MappedVector_3D_t.f90	42.06%	84 Missing ⚠️
src/SELF_MappedScalar_3D_t.f90	42.02%	80 Missing ⚠️
src/gpu/SELF_DGModel3D.f90	63.39%	41 Missing ⚠️
src/gpu/SELF_DGModel2D.f90	65.68%	35 Missing ⚠️
src/SELF_DGModel3D_t.f90	77.61%	30 Missing ⚠️
src/SELF_DGModel2D_t.f90	83.20%	21 Missing ⚠️
src/SELF_Model.f90	77.04%	14 Missing ⚠️
src/SELF_MappedScalar_2D_t.f90	83.33%	10 Missing ⚠️
src/SELF_MappedVector_2D_t.f90	83.87%	10 Missing ⚠️
src/gpu/SELF_DGModel1D.f90	86.84%	10 Missing ⚠️
... and 17 more

📢 Thoughts on this report? Let us know!

fluidnumerics-joe · 2024-09-14T14:33:17Z

There seems to be a problem with the nvhpc compiler - it terminates with SIGNAL 11 while compiling the src/SELF_MappedScalar_2D_t.f90 file and its unclear why. I've opened #55 to track this issue and I've disabled builds with nvhpc until we get some feedback from Nvidia on this.

Also adds gpu-direct for vector 2d

fluidnumerics-joe

There's a number of cleanup items we can do here. We will want to make note in #51 and in the v0.0.1 release notes that we have not explicitly tested meshes with flip=1 in 2-D and flip>=1 in 3-D, though these bits are coded up in a way we believe is correct.

We need to add documentation on how to build and verify OpenMPI with GPU-Awareness for ROCm and CUDA platforms in the docs/Learning/dependencies.md . Additionally, we will want to add documentation for developers on how the CMake build system checks for GPU awareness, should this ever need to be updated/patched in the future.

Last, add an example for advection-diffusion-2d and advection-diffusion-3d and include a short writeup for each in the documentation that demonstrates how to run this in single-domain and multi-domain mode.

src/SELF_DGModel2D_t.f90

src/SELF_DGModel1D_t.f90

src/SELF_DGModel2D_t.f90

src/SELF_DGModel3D_t.f90

src/SELF_MappedScalar_2D_t.f90

src/SELF_MappedScalar_3D_t.f90

src/SELF_MappedVector_2D_t.f90

src/cpu/SELF_DomainDecomposition.f90

While putting in these tests, I was able to resolve a number of previously undetected errors with MPI read/write

fluidnumerics-joe · 2024-09-30T20:42:17Z

Some of the adjustments required for MPI have made it quite cumbersome for a user to add their own model. To remedy this, I'm going to finalize this PR with some adjustments to the backend routines that are provided for the FluxMethod, SourceMethod, and boundary conditions so that a user does not need to be aware of our preferred loop structure or how to exchange data over MPI; this meets the goal of keeping the barrier to entry low.

2-d and 3-d coming next Also, updated the armory superci configuration file

The null models provide coverage for the built in template functions that are supposed to provide no flux divergence and no source. The null modules also provide a template for folks that want to build their own models using SELF.

CMakeLists.txt

src/CMakeLists.txt

docs/Learning/dependencies.md

garrettbyrd

No comments affect functionality; approved and ready to merge when/if you want to address cleanliness fixes.

…e support

fluidnumerics-joe added 2 commits September 11, 2024 19:55

Move domain decomposition to attribute of mesh

c35f687

This reduces the number of objects required to instantiate a model. Instead, a user can simply enable domain decomposition during mesh creation. At the moment, this option is provided only when reading in a mesh.

Add 2-D MPI exchange and other support routines.

82706e3

fluidnumerics-joe added the work-in-progress If this label is present, this issue or PR is a work in progress. label Sep 13, 2024

fluidnumerics-joe self-assigned this Sep 13, 2024

fluidnumerics-joe added 3 commits September 13, 2024 14:21

Set number of MPI ranks in tests to 2

0ed6d83

Fix formatting

bc5234e

Fix formatting

9a1bf9b

fluidnumerics-joe added 3 commits September 13, 2024 15:57

Alias elemtorank onto local pointer (resolves #54)

64a0207

Add optional SELF_MPIEXEC_OPTIONS for setting process binding in tests

7205437

Set default number of mpi processes for tests to 2

34cc519

fluidnumerics-joe added 9 commits September 21, 2024 12:10

Add GPU direct communications

ca03665

Fix formatting

2f74b56

Add 3-D MPI exchange methods for scalar and vector

bcb3953

Formatting

fdd6cfc

Add GPU-Direct communications for 3D

78bffd4

Also adds gpu-direct for vector 2d

Formatting

2228ec7

Fix issues in vector3d side exchange and 3d mpi gpu comms

d0d8a1e

Formatting

e536846

Add check for gpu aware mpi support in cmake stage

1bd5667

fluidnumerics-joe commented Sep 27, 2024

View reviewed changes

fluidnumerics-joe mentioned this pull request Sep 29, 2024

v0.0.1 Checklist #51

Closed

8 tasks

fluidnumerics-joe added 4 commits September 29, 2024 21:04

Remove MPI related conditionals for 1D classes

b389a7d

Remove resetcount option for mpiexchange

f24f99d

Ensure gradient calculation is enabled for diffusion terms

7c5a7cd

Add tests for MPI IO and pickup runs

4f7a7a0

While putting in these tests, I was able to resolve a number of previously undetected errors with MPI read/write

fluidnumerics-joe mentioned this pull request Sep 30, 2024

More tests needed for meshes with a broader range of element orientations #56

Open

Add call to gather final entropy for correctness check

06aa584

fluidnumerics-joe added 9 commits October 1, 2024 01:59

Simplify API for writing new models in 1-D

36e1bf8

2-d and 3-d coming next Also, updated the armory superci configuration file

Fix install prefix

f1aca93

Add multiple GPUs and multiple tasks for armory test

066c41f

Simplify 2d and 3d dgmodel api

fbd28eb

Add nulldgmodel*

1a19d8d

The null models provide coverage for the built in template functions that are supposed to provide no flux divergence and no source. The null modules also provide a template for folks that want to build their own models using SELF.

Fix formatting

f9b3995

Fix formatting

f25f7b5

Remove templates

1534116

Add docs on mpi dependency

65e6d85

fluidnumerics-joe requested a review from garrettbyrd October 1, 2024 19:52

garrettbyrd reviewed Oct 1, 2024

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

garrettbyrd reviewed Oct 1, 2024

View reviewed changes

src/CMakeLists.txt Show resolved Hide resolved

garrettbyrd reviewed Oct 1, 2024

View reviewed changes

docs/Learning/dependencies.md Outdated Show resolved Hide resolved

garrettbyrd approved these changes Oct 1, 2024

View reviewed changes

fluidnumerics-joe mentioned this pull request Oct 1, 2024

HIP/CUDA Language support in CMake build system #57

Closed

Fix typo in docs; add explanatory comments regarding HIP/CUDA languag…

3738412

…e support

fluidnumerics-joe merged commit 60692f1 into main Oct 1, 2024
12 checks passed

fluidnumerics-joe deleted the feature/mpi branch October 1, 2024 21:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/mpi #53

Feature/mpi #53

fluidnumerics-joe commented Sep 13, 2024 •

edited

Loading

codecov bot commented Sep 13, 2024 •

edited

Loading

fluidnumerics-joe commented Sep 14, 2024

fluidnumerics-joe left a comment

fluidnumerics-joe commented Sep 30, 2024

garrettbyrd left a comment

Feature/mpi #53

Feature/mpi #53

Conversation

fluidnumerics-joe commented Sep 13, 2024 • edited Loading

codecov bot commented Sep 13, 2024 • edited Loading

Codecov Report

fluidnumerics-joe commented Sep 14, 2024

fluidnumerics-joe left a comment

Choose a reason for hiding this comment

fluidnumerics-joe commented Sep 30, 2024

garrettbyrd left a comment

Choose a reason for hiding this comment

fluidnumerics-joe commented Sep 13, 2024 •

edited

Loading

codecov bot commented Sep 13, 2024 •

edited

Loading