Skip to content

Commit

Permalink
[CPU] Optimize DQ FC with stack allocated aux accum registers
Browse files Browse the repository at this point in the history
  • Loading branch information
dmitry-gorokhov committed Jan 22, 2025
1 parent a7ef4f8 commit 0c6cff2
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 6 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -169,11 +169,6 @@ static bool useDynamicQuantizationImpl(size_t dqGroupSize,
if (zpPtr && !one_of(zpPtr->getDesc().getPrecision(), ov::element::u8, ov::element::u4, ov::element::undefined))
return false;

// TODO: heuristic: disable avx2 asymmetric
bool is_asymmetric_weights = one_of(weightsDesc->getPrecision(), ov::element::u8, ov::element::u4);
if (is_asymmetric_weights && !dnnl::impl::cpu::x64::mayiuse(dnnl::impl::cpu::x64::avx512_core_vnni))
return false;

const size_t simdWidth = 16;
if (dqGroupSize % simdWidth)
return false;
Expand Down

0 comments on commit 0c6cff2

Please sign in to comment.