[WC] Fix ratio_defining_params #2653

l-bat · 2024-04-24T20:53:56Z

Changes

Reason for changes

Fix bug in _get_ratio_defining_params method

Tests

test_shared_gather_all_layers

codecov · 2024-04-24T21:51:40Z

Codecov Report

Attention: Patch coverage is 0% with 2 lines in your changes are missing coverage. Please review.

Project coverage is 29.87%. Comparing base (21bd852) to head (15afc14).
Report is 3 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff              @@
##           develop    #2653       +/-   ##
============================================
- Coverage    62.02%   29.87%   -32.16%     
============================================
  Files          495      495               
  Lines        45973    45971        -2     
============================================
- Hits         28516    13735    -14781     
- Misses       17457    32236    +14779

Files	Coverage Δ
...ization/algorithms/weight_compression/algorithm.py	`0.00% <0.00%> (-95.22%)`	⬇️

... and 264 files with indirect coverage changes

Flag	Coverage Δ
COMMON	`?`
ONNX	`?`
OPENVINO	`?`
TENSORFLOW	`29.87% <0.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
common	`76.35% <ø> (-12.10%)`	⬇️
torch	`0.01% <ø> (-32.84%)`	⬇️
tensorflow	`93.74% <ø> (ø)`
onnx	`0.00% <ø> (-93.07%)`	⬇️
openvino	`0.00% <ø> (-94.25%)`	⬇️
ptq	`15.19% <0.00%> (-65.48%)`	⬇️

tests/openvino/native/quantization/test_weights_compression.py

ljaljushkin · 2024-04-25T11:16:14Z

nncf/quantization/algorithms/weight_compression/algorithm.py

        ratio_defining_params = list(
            filter(
                lambda wp: wp.node_with_weight.metatype in self._backend_entity.matmul_metatypes,
                all_weight_params,
            )
        )

+        # The last MatMul layer is quantized to 4-bits if all_layers=True or if the layer is shared


This comment looks wrong.
if the layer is shared means that it has the same precision as embedding, and it's 4bit only with all_layers

ljaljushkin

LGTM, would be nice to correct comment, but it's minor

l-bat requested a review from a team as a code owner April 24, 2024 20:53

github-actions bot added NNCF OpenVINO Pull requests that updates NNCF OpenVINO NNCF PTQ Pull requests that updates NNCF PTQ labels Apr 24, 2024

l-bat requested a review from ljaljushkin April 25, 2024 07:39

ljaljushkin requested changes Apr 25, 2024

View reviewed changes

tests/openvino/native/quantization/test_weights_compression.py Show resolved Hide resolved

ljaljushkin reviewed Apr 25, 2024

View reviewed changes

ljaljushkin approved these changes Apr 25, 2024

View reviewed changes

[WC] Fix ratio_defining_params

e3a425b

l-bat force-pushed the lt/fix_ratio_defining_params branch from bbc7f71 to 15afc14 Compare May 13, 2024 14:43

Fix comment

15afc14

l-bat requested a review from ljaljushkin May 14, 2024 12:44

ljaljushkin approved these changes May 14, 2024

View reviewed changes

ljaljushkin merged commit d94c564 into openvinotoolkit:develop May 14, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WC] Fix ratio_defining_params #2653

[WC] Fix ratio_defining_params #2653

l-bat commented Apr 24, 2024

codecov bot commented Apr 24, 2024 •

edited

Loading

ljaljushkin Apr 25, 2024

ljaljushkin left a comment

[WC] Fix ratio_defining_params #2653

[WC] Fix ratio_defining_params #2653

Conversation

l-bat commented Apr 24, 2024

Changes

Reason for changes

Tests

codecov bot commented Apr 24, 2024 • edited Loading

Codecov Report

ljaljushkin Apr 25, 2024

Choose a reason for hiding this comment

ljaljushkin left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 24, 2024 •

edited

Loading