[Fix] l2_exp random fail in half-float32 mixed precision on self-neighboring #596

rhdong · 2025-01-21T20:55:19Z

No description provided.

…hboring

benfred

PR looks good to me - had a couple of minor questions though

benfred · 2025-01-22T17:17:12Z

python/cuvs/cuvs/test/test_distance.py

@@ -21,6 +21,7 @@
 from cuvs.distance import pairwise_distance


+@pytest.mark.parametrize("times", range(20))


Whats this times parameter used for? I don't see it used in the test it self -

Are you just trying to run this test multiple times here to stress test it?

Yeah, it's just for testing multiple times and to guarantee the reproducing on one going because the possibility is close to ~10% empirically.

benfred · 2025-01-22T17:18:23Z

python/cuvs/cuvs/test/test_distance.py

@@ -79,7 +80,5 @@ def test_distance(n_rows, n_cols, inplace, order, metric, dtype):
    actual = output_device.copy_to_host()

    tol = 1e-3
-    if np.issubdtype(dtype, np.float16):
-        tol = 1e-1


I think I added this reduced tolerance because I was seeing failures - is this no longer needed?

Yeah, I tried it successfully at local machines. I think this change can help us block potential actual failures in the future, so I made it.

bdice · 2025-01-22T20:30:18Z

Looks like Python tests are now passing. We are waiting on one more C++ test job. I will go ahead and trigger a merge once CI finishes to unblock CI, since several PRs depend on this.

bdice · 2025-01-22T20:30:23Z

/merge

[Fix] l2_exp random fail in half-float32 mixed precision on self-neig…

19ffdee

…hboring

rhdong requested review from a team as code owners January 21, 2025 20:55

github-actions bot added cpp Python labels Jan 21, 2025

rhdong added bug Something isn't working non-breaking Introduces a non-breaking change C++ and removed cpp Python labels Jan 21, 2025

rhdong mentioned this pull request Jan 21, 2025

[BUG] l2_exp & KL distances got randomly round-off errors in half-float32 mixed precision on self-neighboring #597

Open

rhdong requested review from benfred and cjnolet January 21, 2025 21:08

jameslamb mentioned this pull request Jan 21, 2025

introduce libcuvs wheels #594

Merged

Merge branch 'branch-25.02' into rhdong/fix-mixed

5e59626

github-actions bot added cpp Python labels Jan 22, 2025

cjnolet assigned rhdong Jan 22, 2025

benfred approved these changes Jan 22, 2025

View reviewed changes

bdice mentioned this pull request Jan 22, 2025

Revert "Temporarily skip CUDA 11 wheel CI" #601

Merged

rapids-bot bot merged commit 1c91e1f into rapidsai:branch-25.02 Jan 22, 2025
55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] l2_exp random fail in half-float32 mixed precision on self-neighboring #596

[Fix] l2_exp random fail in half-float32 mixed precision on self-neighboring #596

rhdong commented Jan 21, 2025

benfred left a comment

benfred Jan 22, 2025

rhdong Jan 22, 2025

benfred Jan 22, 2025

rhdong Jan 22, 2025

bdice commented Jan 22, 2025

bdice commented Jan 22, 2025

		@@ -21,6 +21,7 @@
		from cuvs.distance import pairwise_distance


		@pytest.mark.parametrize("times", range(20))

[Fix] l2_exp random fail in half-float32 mixed precision on self-neighboring #596

[Fix] l2_exp random fail in half-float32 mixed precision on self-neighboring #596

Conversation

rhdong commented Jan 21, 2025

benfred left a comment

Choose a reason for hiding this comment

benfred Jan 22, 2025

Choose a reason for hiding this comment

rhdong Jan 22, 2025

Choose a reason for hiding this comment

benfred Jan 22, 2025

Choose a reason for hiding this comment

rhdong Jan 22, 2025

Choose a reason for hiding this comment

bdice commented Jan 22, 2025

bdice commented Jan 22, 2025