[BUG] Fix CAGRA graph optimization bug #565

enp1s0 · 2025-01-11T06:51:27Z

The merging process of the pruned and revedge graphs is inappropriate, resulting in neighbor index duplication. This PR fixes the issue and adds the validation of graph indices at the end of the graph::optimze function.

The simplest solution to fix this issue is 1151511, but it ignores the memory space optimization by rapidsai/raft@12480cf. The code changes in this PR apply the memory space optimization to the simplest solution to remove the memory space for the pruned graph (pruned_graph).

This PR also fixes a problem that throwing an exception from an OMP loop does not work as expected.

This is the search performance comparison between the graph optimization that has a bug (branch-25.02) and fixed (fix-cagra-graph-optimization-bug). Each point represents the search performance for itopk=32, 64, ... 512. By this PR, the recall becomes slightly higher when searching with a small itopk size. Because the number of duplicated nodes by this bug is not so large (typically less than 100 in a 1M dataset), the performances are almost the same when searching with a large itopk size and traversing the graph sufficiently.

enp1s0 · 2025-01-12T14:40:45Z

The new optimize function gets stuck in cagra_extreme_inputs_oob_test because there are some invalid index (~0u) nodes in the initial knn graph. It would be better to check the invalid nodes first in the function and abort the process if there are any.

It seems too severe to abort the optimization process if there is even one invalid node, so I updated the code not to abort unless the pruned graph can be generated even if there are invalid nodes in the initial kNN graph.

enp1s0 · 2025-01-13T15:42:39Z

@anaruse Can you review the update? It is related to merging the pruned graph and MST optimization edges.

cjnolet · 2025-01-16T21:34:28Z

Approving, but waiting to merge until you confirm it's ready @enp1s0

enp1s0 · 2025-01-17T05:12:23Z

@cjnolet ~~Thank you for approving. I think it's ready to be merged.~~ Akira and I found a problem with this fix. Please wait a moment.

…ptimize`

enp1s0 · 2025-01-17T15:52:31Z

@anaruse Can you review the code again?

anaruse · 2025-01-20T05:51:57Z

@anaruse Can you review the code again?

It looks good to me.

… params (#569) This PR updates the default chunk size of the CAGRA graph extension and also adds a knob to control the batch size of the CAGRA searches run inside for better throughput. The default chunk size was set to 1 in the current implementation because there is a potential problem with low recall when the chunk size is large, because no edges are made within nodes in the same chunk. However, as I have investigated, the low recall problem rarely occurs with large chunk sizes. # Search performance The performance was measured after applying a bugfix #565 ## degree = 32 ![extend-ir0 9-degree32](https://github.com/user-attachments/assets/a5bb2fb6-8c12-49ad-b96a-1b384d79a96b) (I don't know the reason the performance is unstable in NYTimes.) ## degree = 64 ![extend-ir0 9-degree64](https://github.com/user-attachments/assets/8e926e1c-d772-4682-9419-9cc027f09d3f) So I increase the default chunk size to the size of the new dataset vectors for better throughput in this PR. I also make public a knob to control the search batch size in the `extend' function to control the balance between throughput and memory consumption. Authors: - tsuki (https://github.com/enp1s0) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Corey J. Nolet (https://github.com/cjnolet) - Tamas Bela Feher (https://github.com/tfeher) URL: #569

cjnolet · 2025-01-24T00:27:42Z

/merge

enp1s0 added 3 commits January 11, 2025 14:43

Fix dst ptr of the pruned graph

1151511

Add duplication check in the pruning step

a803614

Add node index validation to detail::graph::optimize

c9c8807

enp1s0 requested a review from a team as a code owner January 11, 2025 06:51

enp1s0 self-assigned this Jan 11, 2025

github-actions bot added the cpp label Jan 11, 2025

enp1s0 added bug Something isn't working non-breaking Introduces a non-breaking change and removed cpp labels Jan 11, 2025

enp1s0 changed the title ~~Fix CAGRA graph optimization bug~~ [BUG] Fix CAGRA graph optimization bug Jan 11, 2025

Fix log level

4acde11

github-actions bot added the cpp label Jan 11, 2025

enp1s0 added 4 commits January 13, 2025 22:53

Update pruning loop

c9d318b

Update pruned gragh validation

422b6ca

Update cagra_extreme_inputs_oob_test

42f0ce6

Fix style

bf22dc8

enp1s0 and others added 3 commits January 14, 2025 00:48

Remove unnecessary atomic add

9ba3014

Fix log level

69625c2

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

1235da6

enp1s0 mentioned this pull request Jan 14, 2025

Improve the performance of CAGRA new vector addition with the default params #569

Merged

enp1s0 added 3 commits January 16, 2025 11:05

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

d0faea9

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

263286b

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

534cbad

cjnolet approved these changes Jan 16, 2025

View reviewed changes

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

8aa0675

Update to use RAFT_EXPECTS instead of assert in `cagra::detail::o…

0151501

…ptimize`

enp1s0 added 5 commits January 17, 2025 00:30

Fix graph merge

150b64b

Add duplication check

24729fd

Fix duplication check

bd7de7f

Fix duplication check

daa9807

Fix var names

bf8df91

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

0a5e56e

enp1s0 added 4 commits January 22, 2025 11:53

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

92c48c3

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

5399a44

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

3a266be

Merge branch 'branch-25.02' into fix-cagra-graph-optimization-bug

cbbdbe1

rapids-bot bot merged commit dc00f80 into rapidsai:branch-25.02 Jan 24, 2025
61 checks passed

enp1s0 deleted the fix-cagra-graph-optimization-bug branch January 24, 2025 01:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix CAGRA graph optimization bug #565

[BUG] Fix CAGRA graph optimization bug #565

enp1s0 commented Jan 11, 2025 •

edited

Loading

enp1s0 commented Jan 12, 2025 •

edited

Loading

enp1s0 commented Jan 13, 2025

cjnolet commented Jan 16, 2025

enp1s0 commented Jan 17, 2025 •

edited

Loading

enp1s0 commented Jan 17, 2025

anaruse commented Jan 20, 2025

cjnolet commented Jan 24, 2025

[BUG] Fix CAGRA graph optimization bug #565

[BUG] Fix CAGRA graph optimization bug #565

Conversation

enp1s0 commented Jan 11, 2025 • edited Loading

enp1s0 commented Jan 12, 2025 • edited Loading

enp1s0 commented Jan 13, 2025

cjnolet commented Jan 16, 2025

enp1s0 commented Jan 17, 2025 • edited Loading

enp1s0 commented Jan 17, 2025

anaruse commented Jan 20, 2025

cjnolet commented Jan 24, 2025

enp1s0 commented Jan 11, 2025 •

edited

Loading

enp1s0 commented Jan 12, 2025 •

edited

Loading

enp1s0 commented Jan 17, 2025 •

edited

Loading