Track regions that fields are valid over #2324

dschwoerer · 2021-05-18T13:12:43Z

Resolves #2295

I am not sure it is particular useful for 2D - but at least the patches applied cleanly.

Could be extended, to give a speedup in more cases, as with the region tracking, boundaries can be excluded if they are not needed, at the cost of more lookup-maps (that need to be computed once).

I implemented this only for Field3D because otherwise the 2D-3D look-ups are needed, and I only needed it for Field3D.

ZedThree

Please could you give a high-level overview of what this is intended to do, as well as how it works? I gather it's for making sure operations are done over consistent regions, but it looks like it does the operations over the intersection of regions. Do we want that behaviour over just checking the regions are consistent?

I think you also need to add docstrings on all the new functions, as well as some low-level comments. I found Mesh::getCommonRegion quite difficult to follow, for instance.

I'm also a bit worried about performance. There's now more map lookups for everything arithmetic operator, even after the initial creation of the lookups.

Please could you add some before/after timing comparisons, at least for an optimised build?

ZedThree · 2021-05-18T14:11:40Z

include/bout/region.hxx

+    if (this->size() != other.size()) {
+      return false;
+    }
+    for (auto i1 = this->begin(), i2 = other.begin(); i1 != this->end(); ++i1, ++i2) {
+      if (i1 != i2) {
+        return false;
+      }
+    }
+    return true;


Suggested change

if (this->size() != other.size()) {

return false;

}

for (auto i1 = this->begin(), i2 = other.begin(); i1 != this->end(); ++i1, ++i2) {

if (i1 != i2) {

return false;

}

}

return true;

return std::equal(begin(), end(), other.begin());

Should probably be implemented as a free function rather than a member

std::equal in the 3 argument version is actually missing the size check:
https://en.cppreference.com/w/cpp/algorithm/equal

Good spot, the four argument version should do the trick

include/bout/region.hxx

src/mesh/mesh.cxx

tests/unit/include/bout/test_region.cxx

Direct indexing should be faster

Format output as mardown table and print time in micro seconds.

dschwoerer · 2021-05-18T18:05:02Z

Please could you give a high-level overview of what this is intended to do, as well as how it works? I gather it's for making sure operations are done over consistent regions, but it looks like it does the operations over the intersection of regions. Do we want that behaviour over just checking the regions are consistent?

The idea is that each field can store over what region it has valid data. So if a field is initialised from a constant value, it is valid everywhere. A derivative would only be valid in the region without boundaries. If you multiply it with a derivative, it is still only valid in the region without boundaries, thus the intersection is used.

I think you also need to add docstrings on all the new functions, as well as some low-level comments. I found Mesh::getCommonRegion quite difficult to follow, for instance.

Sure, will do 👍

I'm also a bit worried about performance. There's now more map lookups for everything arithmetic operator, even after the initial creation of the lookups.

Thinking about it again, the map can be avoided, and instead a flattend "2D" array could be used.

Please could you add some before/after timing comparisons, at least for an optimised build?

I guess that would depend on the size of the meshes. It should be particular bad for a small mesh. I tried the example/performance/arithmetic for benchmarking, and am a bit confused about the result (all with --enable-optimize=fast and no guard cells)

with map on a mesh with 2 * 2 * 16 with "RGN_NOZ" and "RGN_ALL" set:

TIMING	minimum	mean	maximum
Fields:	0.536 us	0.572 us	11.163 us
C loop:	0.046 us	0.053 us	0.132 us
Templates:	0.201 us	0.209 us	0.681 us
Range For:	0.138 us	0.163 us	10.706 us

map-free on a mesh with 2 * 2 *16:

TIMING	minimum	mean	maximum
Fields:	0.689 us	0.721 us	11.445 us
C loop:	0.026 us	0.037 us	0.235 us
Templates:	0.184 us	0.200 us	0.976 us
Range For:	0.127 us	0.146 us	0.403 us

map-free on a mesh with 2 * 2 *16 with "RGN_NOZ" and "RGN_ALL" set:

TIMING	minimum	mean	maximum
Fields:	0.569 us	0.587 us	1.570 us
C loop:	0.047 us	0.053 us	0.178 us
Templates:	0.208 us	0.218 us	0.762 us
Range For:	0.141 us	0.159 us	0.401 us

current next on a mesh with 2 * 2 * 16:

TIMING	minimum	mean	maximum
Fields:	0.660 us	0.767 us	12.847 us
C loop:	0.027 us	0.040 us	0.160 us
Templates:	0.180 us	0.198 us	0.619 us
Range For:	0.130 us	0.144 us	0.562 us

ZedThree · 2021-05-19T13:18:08Z

Oh that arithmetic performance example is maybe a little out of date now. Timings for a full model might be more instructive, like blob2d or elm-pb.

dschwoerer · 2021-06-11T17:51:57Z

I tried the blob2d model, but the variation is to large to see any differences.

I am however thinking of extending this, and also setting the region after taking a derivative the region it is valid for. That also requires some code to extend the region, e.g. after communication or appling boundary conditions. However, that would be incompatible with some models, as BCs implemented in different ways would not be captured automatically.

That could allow to scale a bit better towards small grids, as the halo would not always be included in arithmetic operations, but that would require more work - so not sure it is worth it ...

…region

Switch (partially) to regions

bendudson · 2023-12-30T19:39:54Z

Thanks @dschwoerer ! I think this looks pretty good, but have a couple of questions:

Is this something that should always be done, or is it a correctness check that could be disabled if CHECK == 0 for example?
If a user performs an operation "by hand", e.g. outerloop calculations, what happens if they don't set the valid region?

Happy new year!

dschwoerer · 2024-01-02T10:34:14Z

Thanks @dschwoerer ! I think this looks pretty good, but have a couple of questions:

I am happy to also add this to docs, but I am not sure where.

1. Is this something that should always be done, or is it a correctness check that could be disabled if `CHECK == 0` for example?

I think it should also be done. Right not setRegion() is not called in a lot of places, but if it is, it can result in performance improvements, as e.g. boundary regions can be skipped in the computation.

2. If a user performs an operation "by hand", e.g. outerloop calculations, what happens if they don't set the valid region?

As long as setRegion() is not called, nothing changes.

It is not called in a lot of cases, as I was not sure how to handle users writing data. For example, the derivatives could call setRegion() with the region that they have computed the field, but then if the user implements some custom boundary function, the region will not be updated, and things might break.

Maybe in the future we can add this as optional feature, to call setRegion() for the derivatives, and for the build-in boundaries extend that region appropriately. For outerloop it should be fairly straight forward, as the user does everything, and can ensure that if setRegion() is used, it is used correctly.

Happy new year!

Happy new year and thanks for the questions :-)

ZedThree · 2024-01-03T17:12:01Z

src/mesh/mesh.cxx

+    BOUT_OMP(critical(mesh_intersection_realloc))
+#if BOUT_USE_OPENMP
+    if (region3Dintersect.size() <= pos)
+#endif


Is there another way to write this without the #ifdef guards and repeated conditionals? Maybe BOUT_OMP(single)? If not, then I think this needs a good comment to ensure it doesn't get accidentally removed

I added comments. Please let me know if there is still anything unclear.

bendudson · 2024-01-03T17:20:28Z

include/bout/field2d.hxx

@@ -174,6 +174,9 @@ public:
  /// Return a Region<Ind2D> reference to use to iterate over this field
  const Region<Ind2D>& getRegion(REGION region) const;
  const Region<Ind2D>& getRegion(const std::string& region_name) const;
+  const Region<Ind2D>& getDefaultRegion(const std::string& region_name) const {


Bikeshedding perhaps, but I think the name getDefaultRegion could be confusing: It's getting the field region, using the given region name as the default, rather than getting the default region. Perhaps getRegionWithDefault?

Since getRegion gets the region with the given name or ID, perhaps getValidRegionWithDefault is more explicit that it's the valid region that's being requested. Too long?

I changed it to getValidRegionWithDefault - that is probably easier to understand. The length seems ok to me 👍

ZedThree · 2024-01-03T17:29:45Z

src/mesh/mesh.cxx

+  /* Memory layout of indices
+   * left is lower index, bottom is higher index
+   *    0  1  2  3
+   * 0
+   * 1  0
+   * 2  1  2
+   * 3  3  4  5
+   * 4  6  7  8  9
+   *
+   * As we only need half of the square, the indices do not depend on
+   * the total number of elements.


This explanation could do with expanding. I think I now understand what's going on, but this diagram is definitely confusing by itself

I tried to explain it, but apparently failed to do so.
Would you mind adding a comment, as you understand it now? If not, I can try again, but I am not quite sure how to do it better :-(

This function finds the ID of the region corresponding to the intersection of two regions, and caches the result. The cache is a vector, indexed by some function of the two input IDs. Because the intersection of two regions doesn't depend on the order, and the intersection of a region with itself is the identity operation, we can order the IDs numerically and use a generalised triangle number: $[n (n - 1) / 2] + m$ to construct the cache index. This diagram shows the result for the first few numbers.

These indices might be sparse, but presumably we don't expect to store very many intersections so this shouldn't give much overhead.

After calculating the cache index, we look it up in the cache (possibly reallocating to ensure it's large enough). If the index is in the cache, we can just return it as-is, otherwise we need to do a bit more work.

First, we need to fully compute the intersection of the two regions. We then check if this corresponds to an existing region. If so, we cache the ID of that region and return it. Otherwise, we need to store this new region in region3D -- the index in this vector is the ID we need to cache and return here.

Should make the intent much cleaner, and also raises a nice exception if a value is used, without being present.

github-actions

clang-tidy made some suggestions

include/bout/field3d.hxx

include/bout/mesh.hxx

src/mesh/mesh.cxx

dschwoerer · 2024-01-05T14:44:10Z

Failure seems to be related to the fact that the --cflags do not include -std=c++17

I am not sure to what extend that was already an issue in the past while compiling against bout++, but it certainly is now required if you include mesh.hxx or field3d.hxx

dschwoerer · 2024-01-05T15:30:40Z

36f152d might be worth backporting to master?

ZedThree · 2024-01-12T13:58:34Z

CMakeLists.txt

@@ -468,7 +468,7 @@ set_target_properties(bout++ PROPERTIES
 # Set some variables for the bout-config script
 set(CONFIG_LDFLAGS "${CONFIG_LDFLAGS} -L\$BOUT_LIB_PATH -lbout++")
 set(BOUT_INCLUDE_PATH "${CMAKE_CURRENT_SOURCE_DIR}/include")
-set(CONFIG_CFLAGS "${CONFIG_CFLAGS} -I\${BOUT_INCLUDE_PATH} -I${CMAKE_CURRENT_BINARY_DIR}/include ${CMAKE_CXX_FLAGS}")
+set(CONFIG_CFLAGS "${CONFIG_CFLAGS} -I\${BOUT_INCLUDE_PATH} -I${CMAKE_CURRENT_BINARY_DIR}/include ${CMAKE_CXX_FLAGS} -std=c++17")


We might need to revisit this as this is compiler-dependent

👍 I am not sure how widely that is used, thus I would wait whether anybody ever complains ...

include/bout/region.hxx

Co-authored-by: Peter Hill <[email protected]>

ZedThree · 2024-01-12T14:26:21Z

src/mesh/interpolation/monotonic_hermite_spline_xz.cxx

@@ -24,7 +24,7 @@
 #include "bout/index_derivs_interface.hxx"
 #include "bout/interpolation_xz.hxx"
 #include "bout/mesh.hxx"
-#include "bout/output.hxx"
+//#include "bout/output.hxx"


Suggested change

//#include "bout/output.hxx"

ZedThree · 2024-01-12T15:12:03Z

src/mesh/mesh.cxx

+  /* Memory layout of indices
+   * left is lower index, bottom is higher index
+   *    0  1  2  3
+   * 0
+   * 1  0
+   * 2  1  2
+   * 3  3  4  5
+   * 4  6  7  8  9
+   *
+   * As we only need half of the square, the indices do not depend on
+   * the total number of elements.


This function finds the ID of the region corresponding to the intersection of two regions, and caches the result. The cache is a vector, indexed by some function of the two input IDs. Because the intersection of two regions doesn't depend on the order, and the intersection of a region with itself is the identity operation, we can order the IDs numerically and use a generalised triangle number: $[n (n - 1) / 2] + m$ to construct the cache index. This diagram shows the result for the first few numbers.

These indices might be sparse, but presumably we don't expect to store very many intersections so this shouldn't give much overhead.

After calculating the cache index, we look it up in the cache (possibly reallocating to ensure it's large enough). If the index is in the cache, we can just return it as-is, otherwise we need to do a bit more work.

First, we need to fully compute the intersection of the two regions. We then check if this corresponds to an existing region. If so, we cache the ID of that region and return it. Otherwise, we need to store this new region in region3D -- the index in this vector is the ID we need to cache and return here.

Co-authored-by: David Bold <[email protected]>

dschwoerer added 6 commits May 18, 2021 14:28

Better comment

04a7f49

Add basic region tracking

8041190

Ignore test artifacts

07b9606

Add region ypar

413da69

Set region for parallel slices

e392dea

Check within valid region, if set

a571465

ZedThree reviewed May 18, 2021

View reviewed changes

dschwoerer added 2 commits May 18, 2021 19:57

Avoid maps in fieldops

7b248a7

Direct indexing should be faster

Add nicer formatting to output

68c2d01

Format output as mardown table and print time in micro seconds.

ZedThree changed the title ~~Track region~~ Track regions fields are valid over May 19, 2021

ZedThree changed the title ~~Track regions fields are valid over~~ Track regions that fields are valid over May 19, 2021

dschwoerer and others added 12 commits September 27, 2021 09:33

Merge branch 'next' into track-region

983d95d

Switch toward regions

21c6459

Simplify expression

4a1259e

Assume periodicity in z

99fe484

Switch to regions for FCI regions

a61a000

Recommendations from clang-tidy

cd4a1d2

Merge branch 'next' into track-region

beab98a

Merge remote-tracking branch 'origin/track-region' into track-region-…

a50c409

…region

Remove unsed variable

23e1fc5

BugFix: Fix separation of inner and outer boundary

b5e9eca

ensure getCommonRegion is thread safe

a887968

[skip ci] Apply black changes

6b3e4c1

dschwoerer mentioned this pull request Mar 11, 2022

FCI: yup and y+1 #2526

Open

dschwoerer and others added 3 commits September 17, 2022 23:28

Merge pull request #2439 from boutproject/track-region-region

7e1ca9b

Switch (partially) to regions

Merge branch 'next' of github.com:boutproject/BOUT-dev into track-region

b4baab1

Apply clang-format changes

d3150d0

dschwoerer mentioned this pull request Dec 20, 2023

Work around pip changes #2824

Merged

ZedThree reviewed Jan 3, 2024

View reviewed changes

bendudson reviewed Jan 3, 2024

View reviewed changes

ZedThree reviewed Jan 3, 2024

View reviewed changes

dschwoerer and others added 7 commits January 5, 2024 11:13

rename getDefaultRegion to getValidRegionWithDefault

bf3f64f

Remove python2 compatibility

3c50728

Fix escaping

0c5d41c

Add comments on OpenMP + mutex

121a4ab

Add more OpenMP comments

242a6f4

use std::optional<size_t>

b8be3fe

Should make the intent much cleaner, and also raises a nice exception if a value is used, without being present.

Apply clang-format changes

433f92b

github-actions bot reviewed Jan 5, 2024

View reviewed changes

include/bout/field3d.hxx Show resolved Hide resolved

include/bout/field3d.hxx Show resolved Hide resolved

include/bout/mesh.hxx Show resolved Hide resolved

include/bout/mesh.hxx Show resolved Hide resolved

src/mesh/mesh.cxx Show resolved Hide resolved

Merge branch 'next' of https://github.com/boutproject/BOUT-dev into HEAD

09c4a5a

dschwoerer added 2 commits January 5, 2024 15:46

Explicitly set -std=c++17 for --cflags

36f152d

Be more verbose by default

d4dab6c

ZedThree reviewed Jan 12, 2024

View reviewed changes

include/bout/region.hxx Outdated Show resolved Hide resolved

prefer std::equal

a127d2b

Co-authored-by: Peter Hill <[email protected]>

ZedThree reviewed Jan 12, 2024

View reviewed changes

ZedThree and others added 4 commits January 16, 2024 12:44

Improve comment

204b118

Co-authored-by: David Bold <[email protected]>

Remove commented out code

1de0d2a

Merge branch 'next' into track-region

4c204fe

Apply clang-format changes

311fd5e

ZedThree approved these changes Jan 18, 2024

View reviewed changes

ZedThree merged commit bffb973 into next Jan 18, 2024
1 check passed

ZedThree deleted the track-region branch January 18, 2024 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track regions that fields are valid over #2324

Track regions that fields are valid over #2324

dschwoerer commented May 18, 2021

ZedThree left a comment

ZedThree May 18, 2021

dschwoerer May 25, 2021

ZedThree May 25, 2021

dschwoerer commented May 18, 2021

ZedThree commented May 19, 2021

dschwoerer commented Jun 11, 2021

bendudson commented Dec 30, 2023

dschwoerer commented Jan 2, 2024 •

edited

Loading

ZedThree Jan 3, 2024

dschwoerer Jan 5, 2024

bendudson Jan 3, 2024

bendudson Jan 3, 2024

dschwoerer Jan 5, 2024

ZedThree Jan 3, 2024

dschwoerer Jan 5, 2024

ZedThree Jan 12, 2024

github-actions bot left a comment

dschwoerer commented Jan 5, 2024

dschwoerer commented Jan 5, 2024

ZedThree Jan 12, 2024

dschwoerer Jan 12, 2024

ZedThree Jan 12, 2024

ZedThree Jan 12, 2024

Track regions that fields are valid over #2324

Track regions that fields are valid over #2324

Conversation

dschwoerer commented May 18, 2021

ZedThree left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dschwoerer commented May 18, 2021

ZedThree commented May 19, 2021

dschwoerer commented Jun 11, 2021

bendudson commented Dec 30, 2023

dschwoerer commented Jan 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

dschwoerer commented Jan 5, 2024

dschwoerer commented Jan 5, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dschwoerer commented Jan 2, 2024 •

edited

Loading