Code cleanup and performance improvements for communication, allow/block and rename #1800

jkeiren · 2025-01-14T19:11:25Z

This pull request has the following main changes:

Extract calculation of the following operators over summands from linearise.cpp into separate files:
- communication
- allow/block
- rename
Add tests for the code that is extracted
Minor code cleanup for allow/block and rename
Major rework of communication:
- I added more documentation, relating it to Muck van Weerdenburg's note.
- Cleaned up by using more modern C++
- Major performance improvement, but using the trick explained below.

Major performance improvement

When calculating the communication operator over a complex multiaction, for every combination of matching communication expressions, a new summand is generated. The reason for this are the open terms that can appear as parameters to the multiactions: the decision whether a communication actually succeeds depends on the valuation of the variables. Therefore, when calculating the operators, we do not yet know which of the communications will end up being successful, and we take this into account. If n communication expressions match, this gives rise to 2^n multiactions.

In practice, however, many of the multiactions that are generated will later be filtered out because they are not allowed, or they contain an action that is blocked. The optimization that is implemented is as follows:
Calculate the set of actions that appears in a multiaction in the allow set, and only add multiactions to the result in which all actions appear in this set of actions. This may still generate multiactions that are later removed by the allow, but it already avoids a significant amount of work.

For concrete examples of mCRL2 models generated from Cordis models, this reduces the time for mcrl22lps --no-alpha from 1550s to 11s; for some summands, the number of multiactions generated is reduced from 10^6 to just 1.

TODO

Clean up the code in linearise_communication.h, and revert the use of some of the iterators in the interfaces, since they reduce readability.

This seems a remnant of ancient history, when rest was stored as a pointer. To eliminate, we use the invariant rest[i].empty() == rest_is_null[i].

while (a != b = c) { S } is rather ugly and hard to understand. Replace this with the equivalent: b = c; while (a != b) { S ; b = c }

The previous refactoring showed that the body of the loop was executed at most once, and can be replaced by a simple if-then-else

if match_failed[i] == true, tmp[i] is never read anymore.

After refactoring, this code can be shared between can_communicate and might_communicate.

Due to the use of temporary date, can_communicate and might_communicate are tightly couple to comm_entry. This changeset makes this explicit, by pushing the functions into the comm_entry class as methods. This also nicely encapsulates the date contained in comm_entry.

rest[i].empty() iff rest_is_null[i] is not an invariant (we only know rest_is_null[i] implies rest[i].empty())

- Make lhs, rhs in comm_entry constant; this was already the case, but the intent is now also clear in the code. - Use references instead of copies in some places where appropriate.

r_is_null is true iff r.empty()

First step in refactoring this code. Using iterators should avoid a lot of copying and (re)allocation.

Ignore IDE files for Jetbrains

This now allows removing occursinterm in favour of search_free_variable.

Preparation for using std::multiset.

- One vector was copied in combining tuple lists. By using move semantics, we avoid creation and destruction of aterms - Multisets so far do not seem to prove an advantage, so we avoid their use.

Also fix bug that was caught by this test.

In case there were many matches of communication expressions in a single summand, first all possible combinations of matches were explored, and the corresponding condition and summand generated. So, for a single summand that matches n communication expressions, 2^n possible multiactions are generated. In practice, however, many of these multiactions contain actions that are either blocked, or that are not part of any multiaction in an allow set. I added two test cases that were very slow without this change, but that take hardly any time now. These cases are adapted from the mCRL2 translation of a Cordis model, for which without the modification mcrl22lps --no-alph takes 1550s, and with the modification it terminates in 11s.

This significantly cleans up the parameter lists of the methods used.

jkeiren added 30 commits January 2, 2025 15:38

Document comm_table, extract method.

4bb22ef

Eliminate rest_is_null.

16a30ae

This seems a remnant of ancient history, when rest was stored as a pointer. To eliminate, we use the invariant rest[i].empty() == rest_is_null[i].

Improve readability of code

7efbe5e

while (a != b = c) { S } is rather ugly and hard to understand. Replace this with the equivalent: b = c; while (a != b) { S ; b = c }

Remove now useless while loop.

4d95980

The previous refactoring showed that the body of the loop was executed at most once, and can be replaced by a simple if-then-else

Remove superfluous assignment

893e167

if match_failed[i] == true, tmp[i] is never read anymore.

Extract method to reduce code duplication

9efd488

After refactoring, this code can be shared between can_communicate and might_communicate.

Fix regression introduced in 16a30ae

c1c4c34

rest[i].empty() iff rest_is_null[i] is not an invariant (we only know rest_is_null[i] implies rest[i].empty())

Small performance improvements

5f43161

- Make lhs, rhs in comm_entry constant; this was already the case, but the intent is now also clear in the code. - Use references instead of copies in some places where appropriate.

Get rid of repeated allocation and deallocation of terms

0c9610f

Eliminate parameter r_is_null

72f78d7

r_is_null is true iff r.empty()

Cache can/might_communicate

3e862c1

Perform some obvious code cleanup.

3ff549a

Use iterators as argument

ce31d19

First step in refactoring this code. Using iterators should avoid a lot of copying and (re)allocation.

Pass iterators instead of lists.

d8811b4

Pass iterators into might_communicate

6983c21

Store iterator pairs instead of action_list in might_communicate

9993d64

Extract renaming from linearise.cpp

05b2a3b

Rename an individual summand

d7f2ae6

Move allow/block to separate file

9f5d355

Add missing namespaces

779cb20

Simplify insert and fix regression test

d27b4ce

Extract calculation of the communication operator.

909117b

Remove accidenally committed file

bb03949

Update .gitignore

42711e7

Ignore IDE files for Jetbrains

Swap arguments of occursinterm

91dc910

This now allows removing occursinterm in favour of search_free_variable.

Extract method to improve readability

51e3b0e

Document method

5449f65

Minor refactoring and documentation

22af019

Fix typo

1d2ab45

jkeiren added 18 commits January 6, 2025 13:36

Extend action_compare to take into account arguments

f4652f3

Move sorting of communications to utility

59d6c35

Simplify insert_timed_delta_summand

04e7633

Introduce type for action(name) multisets

c7d13b5

Preparation for using std::multiset.

Preparation for using multisets

50314ca

Use iterators instead of objects.

5461916

Use iterators in psi

1014e38

Small updates to allow caching using names instead of multiactions

45f1e92

Store a single iterator.

ce1960b

Allow move semantics and remove multiset

6df44aa

- One vector was copied in combining tuple lists. By using move semantics, we avoid creation and destruction of aterms - Multisets so far do not seem to prove an advantage, so we avoid their use.

Count operations when calculating communication operator

9b9281c

Document; reduce worst case complexity

eda9aa8

Add tests for renaming.

eb40b99

Add basic test for calculating allow_

064fd54

Also fix bug that was caught by this test.

Fix test for renaming

72f4719

Move test to correct library.

292adea

Use utility function.

dd7669b

jkeiren requested a review from mlaveaux January 14, 2025 19:11

jkeiren marked this pull request as draft January 14, 2025 19:11

jkeiren added 3 commits January 15, 2025 08:49

Revert the use of iterators in the interfaces.

a18964e

Remove type aliases.

be96510

Improve unit test structure

064a184

jkeiren marked this pull request as ready for review January 15, 2025 10:42

jkeiren added the enhancement Something can be improved label Jan 15, 2025

jkeiren added 3 commits January 16, 2025 09:07

Move communication calculation into a class.

cc702bd

This significantly cleans up the parameter lists of the methods used.

Fix assertions.

3910ff6

Add missing include for Windows

a87f69b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code cleanup and performance improvements for communication, allow/block and rename #1800

Code cleanup and performance improvements for communication, allow/block and rename #1800

jkeiren commented Jan 14, 2025 •

edited

Loading

Code cleanup and performance improvements for communication, allow/block and rename #1800

Are you sure you want to change the base?

Code cleanup and performance improvements for communication, allow/block and rename #1800

Conversation

jkeiren commented Jan 14, 2025 • edited Loading

Major performance improvement

TODO

jkeiren commented Jan 14, 2025 •

edited

Loading