MNN batch inference time not more efficient than single image #3184

danielbr33 · 2025-01-28T23:39:48Z

You can use C++ API to do it. How ever, it's not more efficient to use multi batch for inference.

Why is multi batch computing inefficient using MNN?

Is this true for all operators?

Originally posted by @mingyunzzu in #673

Can somebody explain why inference with batch isn't more efficient in MNN? When I run detection on single image it takes 7 miliseconds and when I run on batch of 32 images it takes 8 miliseconds per image. This is only the time of inference measured by time of runSession without preparing images and postprocessing. What can I use to reach better results?

jxt1234 · 2025-01-29T03:42:28Z

The issue's bug has been resolved. Now batch inference time depend on the compute flops of device. If one image has reach the peak flops, then batch image will not more efficient.
The get the device's compute peak flops, can use ./run_test.out speed/MatMulBConst
Normally GPU has more flops than CPU, you can use opencl instead of CPU to forward.

jxt1234 added the User The user ask question about how to use. Or don't use MNN correctly and cause bug. label Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MNN batch inference time not more efficient than single image #3184

MNN batch inference time not more efficient than single image #3184

danielbr33 commented Jan 28, 2025

jxt1234 commented Jan 29, 2025

MNN batch inference time not more efficient than single image #3184

MNN batch inference time not more efficient than single image #3184

Comments

danielbr33 commented Jan 28, 2025

jxt1234 commented Jan 29, 2025