Implement caching for Estimators and Transformers #845

JulioAPeraza · 2023-11-07T21:54:39Z

Closes #844.

Changes proposed in this pull request:

We are implementing caching at three levels of a meta-analysis:

High-level (or estimator-level): caches _fit()
Mid-level (or transformer-level): caches _transform()
Low-level (or modeled activation (ma)-level): caches _get_ma_map()

The most typical use case will be at lower levels, given that it is very common to recompute the same MA maps when working with the same database (e.g., Neurosynth).

Transformer, at the mid-level, will benefit as well from caching.

At the estimator level, caching was implemented just for the sake of completeness. We currently recommend saving the MetaResult object to a pickle file and loading it again if will be reused. This reduces the large overhead that comes from hashing a whole NiMARE database object.

codecov · 2023-11-08T00:14:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (720c5c6) 89.12% compared to head (391f52e) 89.13%.
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #845   +/-   ##
=======================================
  Coverage   89.12%   89.13%           
=======================================
  Files          49       49           
  Lines        6142     6155   +13     
=======================================
+ Hits         5474     5486   +12     
- Misses        668      669    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

This reverts commit 604b6ca.

JulioAPeraza · 2023-11-09T23:12:47Z

This shows the performance at memory level 1 (caching only _fit()). With memory level 2 (caching _fit() and _transform()) the performance was the same. However, at memory level 3 (caching _fit(), _transform(), and _get_ma_map()) I saw a minimal decrease in the performance but still better than no caching.

JulioAPeraza · 2023-11-09T23:19:04Z

I tested the hierarchy inverted:

Memory level 1: caching _get_ma_map().
Memory level 2: caching _get_ma_map(), _transform().
Memory level 3: caching _get_ma_map(), _transform(), and _fit().

However, the performance at level 1 with caching was worse than no caching, probably because the computation in _get_ma_map() is not that expensive.

jdkent

LGTM!

JulioAPeraza added 5 commits November 7, 2023 16:05

Implement high-level caching (estimators)

3fef903

Implement mid-level caching (transformer)

32f4363

Implement low-level caching (MA maps)

e1963ea

test memory caching

07c2650

Leverage new caching parameter for Maskers

d92cf42

JulioAPeraza added the enhancement New feature or request label Nov 7, 2023

Add memory parameters to report

74434a0

JulioAPeraza added 4 commits November 8, 2023 19:14

Use func_memory_level to establish the caching hierarchy

306e3cb

Update test_meta_kernel.py

4d8350e

Test performance with inverted memory hierarchy

604b6ca

Revert "Test performance with inverted memory hierarchy"

061d4e5

This reverts commit 604b6ca.

JulioAPeraza added 3 commits November 28, 2023 11:47

Remove support for caching MA maps

4834e3c

Add versionchanged to docstrings

f5aa415

Update kernel.py

391f52e

JulioAPeraza marked this pull request as ready for review November 28, 2023 18:58

JulioAPeraza requested a review from jdkent November 28, 2023 18:59

jdkent approved these changes Dec 13, 2023

View reviewed changes

jdkent merged commit a1c0414 into neurostuff:main Dec 13, 2023
19 checks passed

JulioAPeraza deleted the caching-ma-maps branch December 13, 2023 21:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement caching for Estimators and Transformers #845

Implement caching for Estimators and Transformers #845

JulioAPeraza commented Nov 7, 2023

codecov bot commented Nov 8, 2023 •

edited

Loading

JulioAPeraza commented Nov 9, 2023

JulioAPeraza commented Nov 9, 2023

jdkent left a comment

Implement caching for Estimators and Transformers #845

Implement caching for Estimators and Transformers #845

Conversation

JulioAPeraza commented Nov 7, 2023

codecov bot commented Nov 8, 2023 • edited Loading

Codecov Report

JulioAPeraza commented Nov 9, 2023

JulioAPeraza commented Nov 9, 2023

jdkent left a comment

Choose a reason for hiding this comment

codecov bot commented Nov 8, 2023 •

edited

Loading