Fuse reshapes on pointwise inputs for mlir output fusion #3569

pfultz2 · 2024-10-29T14:38:50Z

No description provided.

codecov · 2024-11-30T23:04:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.20%. Comparing base (241e24e) to head (0ef4e55).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #3569   +/-   ##
========================================
  Coverage    92.20%   92.20%           
========================================
  Files          513      513           
  Lines        21658    21658           
========================================
  Hits         19970    19970           
  Misses        1688     1688

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

shivadbhavsar

Approving, comment is probably a topic for another PR if its even an issue

shivadbhavsar · 2024-12-06T03:11:07Z

src/targets/gpu/fuse_mlir.cpp

+
        bool gemm_has_multi_outs = gemm_based_op->outputs().size() > 1;


Might be tangential to this PR, but what happens if one of the reshape ops has multi outputs? ie. something like:

dot -> reshape -> pointwise -> .... -> dot |_____________________________|

The loop below checks that all the reshapes between dot and pointwise have only one output, and it will output both if any reshape is used more than once. For the aux inputs to pointwise, it doesnt really matter:

┌───┐┌─┐ │dot││x│ └┬──┘└┬┘ │┌───▽──────┐ ││reshape │ │└┬───────┬─┘ ┌▽─▽──────┐│ │pointwise││ └┬────────┘│ ┌▽─────────▽┐ │convolution│ └───────────┘

After mlir fusion it just becomes:

┌──────────┐ │x │ └┬────────┬┘ ┌▽──────┐┌▽─────────────────────────┐ │reshape││mlir_dot_reshape_pointwise│ └┬──────┘└┬─────────────────────────┘ ┌▽────────▽─┐ │convolution│ └───────────┘

DCE wont remove the reshape since its used, and the duplicated reshape doesnt really matter since it is potentially an aliasing-like operator.

migraphx-bot · 2025-01-29T20:20:20Z

Test	Batch	Rate new b1579e	Rate old 879f30	Diff	Compare
torchvision-resnet50	64	3,235.54	3,237.49	-0.06%	✅
torchvision-resnet50_fp16	64	6,878.27	6,885.53	-0.11%	✅
torchvision-densenet121	32	2,437.85	2,438.04	-0.01%	✅
torchvision-densenet121_fp16	32	4,214.82	4,201.39	0.32%	✅
torchvision-inceptionv3	32	1,615.08	1,614.18	0.06%	✅
torchvision-inceptionv3_fp16	32	2,691.74	2,690.16	0.06%	✅
cadene-inceptionv4	16	750.75	750.10	0.09%	✅
cadene-resnext64x4	16	810.35	809.38	0.12%	✅
slim-mobilenet	64	6,668.09	6,664.81	0.05%	✅
slim-nasnetalarge	64	199.04	199.11	-0.03%	✅
slim-resnet50v2	64	3,430.59	3,426.01	0.13%	✅
bert-mrpc-onnx	8	1,146.10	1,145.27	0.07%	✅
bert-mrpc-tf	1	480.87	479.30	0.33%	✅
pytorch-examples-wlang-gru	1	462.96	470.08	-1.51%	✅
pytorch-examples-wlang-lstm	1	441.82	438.03	0.87%	✅
torchvision-resnet50_1	1	808.15	803.96	0.52%	✅
cadene-dpn92_1	1	431.76	433.48	-0.40%	✅
cadene-resnext101_1	1	391.26	389.71	0.40%	✅
onnx-taau-downsample	1	373.25	373.58	-0.09%	✅
dlrm-criteoterabyte	1	31.80	31.80	-0.01%	✅
dlrm-criteoterabyte_fp16	1	51.04	51.06	-0.03%	✅
agentmodel	1	8,558.05	8,764.93	-2.36%	✅
unet_fp16	2	57.92	58.01	-0.15%	✅
resnet50v1_fp16	1	1,058.83	1,021.79	3.63%	🔆
resnet50v1_int8	1	788.85	780.83	1.03%	✅
bert_base_cased_fp16	64	1,172.43	1,172.44	-0.00%	✅
bert_large_uncased_fp16	32	362.52	362.45	0.02%	✅
bert_large_fp16	1	202.19	201.10	0.54%	✅
distilgpt2_fp16	16	2,218.42	2,219.84	-0.06%	✅
yolov5s	1	525.33	521.90	0.66%	✅
tinyllama	1	43.59	43.62	-0.05%	✅
vicuna-fastchat	1	43.89	43.83	0.13%	✅
whisper-tiny-encoder	1	412.16	412.03	0.03%	✅
whisper-tiny-decoder	1	411.31	410.73	0.14%	✅

Check results before merge 🔆

migraphx-bot · 2025-01-29T20:20:22Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

pfultz2 added 18 commits October 16, 2024 10:22

Refactor

fa341fc

Format

d945ba8

Reverse the order

a609302

Merge branch 'develop' into mlir-fuse-reshapes-pointwise

009086f

Fix test

504bf89

Format

b590499

Merge branch 'develop' into mlir-fuse-reshapes-pointwise

0fc5c5a

Add skip parameter

3c363f5

Format

34c7048

Fix more tests

d04b188

Format

94bb38d

Fix last test

2734e74

Format

301a68e

Remove skip parameter

f5240a1

Add test for reshape fusion

b5ef8f9

Format

802268e

Update reshaper names

d7a4cd8

Format

5d5f213

pfultz2 requested a review from causten as a code owner October 29, 2024 14:38

pfultz2 requested review from shivadbhavsar and removed request for causten October 29, 2024 14:38

Merge branch 'develop' into mlir-fuse-reshapes-pointwise

0ef4e55

pfultz2 requested a review from richagadgil December 4, 2024 00:20

pfultz2 added 3 commits December 3, 2024 18:33

Fix tidy warning

b2d9a2a

Format

99100d2

Fix cppcheck issues

1fe5c5a

pfultz2 requested a review from TedThemistokleous December 5, 2024 23:11

shivadbhavsar approved these changes Dec 6, 2024

View reviewed changes

TedThemistokleous assigned pfultz2 Dec 6, 2024

TedThemistokleous added Perf Improve rocMLIR labels Dec 6, 2024

TedThemistokleous approved these changes Dec 6, 2024

View reviewed changes

causten added 2 commits December 9, 2024 18:28

Merge branch 'develop' into mlir-fuse-reshapes-pointwise

97a6db8

Merge branch 'develop' into mlir-fuse-reshapes-pointwise

b1579ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuse reshapes on pointwise inputs for mlir output fusion #3569

Fuse reshapes on pointwise inputs for mlir output fusion #3569

pfultz2 commented Oct 29, 2024

codecov bot commented Nov 30, 2024

shivadbhavsar left a comment

shivadbhavsar Dec 6, 2024

pfultz2 Dec 10, 2024 •

edited

Loading

migraphx-bot commented Jan 29, 2025

migraphx-bot commented Jan 29, 2025


		bool gemm_has_multi_outs = gemm_based_op->outputs().size() > 1;

Fuse reshapes on pointwise inputs for mlir output fusion #3569

Are you sure you want to change the base?

Fuse reshapes on pointwise inputs for mlir output fusion #3569

Conversation

pfultz2 commented Oct 29, 2024

codecov bot commented Nov 30, 2024

Codecov Report

shivadbhavsar left a comment

Choose a reason for hiding this comment

shivadbhavsar Dec 6, 2024

Choose a reason for hiding this comment

pfultz2 Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

migraphx-bot commented Jan 29, 2025

migraphx-bot commented Jan 29, 2025

pfultz2 Dec 10, 2024 •

edited

Loading