[CPU] [ARM] [INT8] FullyConnected #25171

eshoguli · 2024-06-22T21:57:32Z

Details:

[ARM] [INT8] FullyConnected

Tickets:

CVS-149494

github-actions · 2024-07-29T00:21:04Z

This PR will be closed in a week because of 2 weeks of no activity.

alvoron · 2024-12-12T17:02:40Z

General comment: seems like we need to merge it after #26239. Some adjustment process will be needed

Needed changes have been applied.

dmitry-gorokhov · 2024-12-13T11:27:34Z

src/plugins/intel_cpu/src/nodes/executors/acl/acl_lowp_fullyconnected.cpp

+}
+
+arm_compute::Status ACLLowpFullyConnectedExecutor::validateTensorsInfo(const ACLInfos & aclMemoryInfos) {
+    auto &tensor_info = aclMemoryInfos[ACLArgs::ACL_SRC_0];


Just wondering why we are setting dequantization scale on Input port? Logically I would expect it to be applied on dst port

src/plugins/intel_cpu/src/nodes/executors/acl/acl_lowp_fullyconnected.cpp

This reverts commit 9226262.

dmitry-gorokhov · 2024-12-18T10:01:13Z

src/plugins/intel_cpu/src/nodes/executors/fullyconnected_implementations.cpp

+            [](const MemoryArgs& memory) -> bool {
+                const auto dequantizationScales = getDeQuantizedScales(memory);
+                bool isPerChannelQuantization = dequantizationScales.size() > 1;
+                // per-channel quantization is not unsupported by ACL


minor: "not unsupported"?

### Details: - *[ARM] [INT8] FullyConnected* ### Tickets: - *CVS-149494* --------- Co-authored-by: Aleksandr Voron <[email protected]>

eshoguli requested review from a team as code owners June 22, 2024 21:57

github-actions bot added category: IE Tests OpenVINO Test: plugins and common category: CPU OpenVINO CPU plugin category: LP transformations OpenVINO Low Precision transformations labels Jun 22, 2024

eshoguli changed the title ~~[TEST] [ARM] [INT8] FullyConnected~~ [TEST] [CPU] [ARM] [INT8] FullyConnected Jun 23, 2024

eshoguli force-pushed the es/aarch64/int8 branch from 46e41b5 to c2d4099 Compare June 26, 2024 00:21

github-actions bot removed the category: LP transformations OpenVINO Low Precision transformations label Jun 26, 2024

eshoguli requested review from a team as code owners June 26, 2024 10:53

github-actions bot added category: GPU OpenVINO GPU plugin category: build OpenVINO cmake script / infra labels Jun 26, 2024

eshoguli changed the title ~~[TEST] [CPU] [ARM] [INT8] FullyConnected~~ [CPU] [ARM] [INT8] FullyConnected Jun 26, 2024

eshoguli force-pushed the es/aarch64/int8 branch 5 times, most recently from b972f54 to 743281f Compare July 2, 2024 00:07

eshoguli force-pushed the es/aarch64/int8 branch 5 times, most recently from b56d725 to ea6c2b2 Compare July 10, 2024 19:32

github-actions bot added the Stale label Jul 29, 2024

eshoguli force-pushed the es/aarch64/int8 branch 2 times, most recently from af1105f to 0d7c9ec Compare July 31, 2024 00:36

alvoron added 2 commits December 11, 2024 15:04

Merge branch 'master' into es/aarch64/int8

5d8c67d

changes required after openvinotoolkit#26239

7a07337

alvoron requested a review from a team as a code owner December 12, 2024 16:56

alvoron requested review from itikhono and removed request for a team December 12, 2024 16:56

github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Dec 12, 2024

Merge branch 'master' into es/aarch64/int8

aeca18e

alvoron force-pushed the es/aarch64/int8 branch from 5dc1b5c to 80f626f Compare December 12, 2024 17:17

fix code style and warnings

44e04cf

alvoron force-pushed the es/aarch64/int8 branch from 80f626f to 44e04cf Compare December 12, 2024 17:35

dmitry-gorokhov reviewed Dec 13, 2024

View reviewed changes

alvoron added 8 commits December 13, 2024 13:19

rollback dq scales fusing

fddd3f4

removed DQ check

55e0d0d

stop wrapping FQ with Convert

9226262

Revert "stop wrapping FQ with Convert"

aa64460

This reverts commit 9226262.

mark getDeQuantizedScales as OV_CPU_MAYBE_UNUSED_FUNCTION

0079f5f

added missed code to prepareWeightMemory

5455c1b

fix fuse condition

118cfa8

Merge branch 'master' into es/aarch64/int8

f782508

alvoron force-pushed the es/aarch64/int8 branch from af0a528 to f782508 Compare December 17, 2024 12:14

alvoron added 3 commits December 17, 2024 14:04

clang-format fix

5721c63

Merge branch 'master' into es/aarch64/int8

bf9f2f6

fix x64 expected nodes

83bc0c3

dmitry-gorokhov approved these changes Dec 18, 2024

View reviewed changes

dmitry-gorokhov added this pull request to the merge queue Dec 18, 2024

Merged via the queue into openvinotoolkit:master with commit 9ff5942 Dec 18, 2024
182 checks passed

dmitry-gorokhov deleted the es/aarch64/int8 branch December 18, 2024 12:12

11happy pushed a commit to 11happy/openvino that referenced this pull request Dec 23, 2024

[CPU] [ARM] [INT8] FullyConnected (openvinotoolkit#25171)

2a7c797

### Details: - *[ARM] [INT8] FullyConnected* ### Tickets: - *CVS-149494* --------- Co-authored-by: Aleksandr Voron <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] [ARM] [INT8] FullyConnected #25171

[CPU] [ARM] [INT8] FullyConnected #25171

eshoguli commented Jun 22, 2024 •

edited

Loading

github-actions bot commented Jul 29, 2024

alvoron commented Dec 12, 2024

dmitry-gorokhov Dec 13, 2024

dmitry-gorokhov Dec 18, 2024

[CPU] [ARM] [INT8] FullyConnected #25171

[CPU] [ARM] [INT8] FullyConnected #25171

Conversation

eshoguli commented Jun 22, 2024 • edited Loading

Details:

Tickets:

github-actions bot commented Jul 29, 2024

alvoron commented Dec 12, 2024

dmitry-gorokhov Dec 13, 2024

Choose a reason for hiding this comment

dmitry-gorokhov Dec 18, 2024

Choose a reason for hiding this comment

eshoguli commented Jun 22, 2024 •

edited

Loading