Reduced the number of graph rebuilds #2011

alexsu52 · 2023-07-28T08:15:46Z

Changes

Extended the signature of algorithm methods by NNCFGraph. This changes shows quantization speed-up in 2.34x for "hf-internal-testing/tiny-random-GPTNeoXForCausalLM" model from optimum.

Reason for changes

Reduced the number of graph rebuilds

Related tickets

ref: 113245

Tests

N/A

alexsu52 · 2023-07-28T08:34:32Z

run pylint pre-commit tests
run pytorch pre-commit tests
run tensorflow pre-commit tests
run openvino pre-commit tests
run onnx pre-commit tests

nncf/onnx/graph/model_utils.py

nncf/quantization/algorithms/post_training/algorithm.py

nncf/quantization/algorithms/smooth_quant/algorithm.py

andrey-churkin · 2023-08-03T12:47:23Z

nncf/quantization/algorithms/algorithm.py

    def apply(
        self,
        model: TModel,
+        graph: NNCFGraph,


How should user create the graph to pass it? Should it use the NNCFGraphFactory factory to build it?

Yes, the introduction of a simple API to create a model graph will be in a follow up PR.

andrey-churkin · 2023-08-03T13:00:58Z

nncf/quantization/algorithms/bias_correction/algorithm.py

@@ -484,10 +486,11 @@ def output_filter_func(point):
            output_fp.extend(tensor_collector.get_statistics().mean_values)
        return np.array(output_fp)

-    def get_statistic_points(self, model: TModel) -> StatisticPointsContainer:
+    def get_statistic_points(self, model: TModel, graph: NNCFGraph) -> StatisticPointsContainer:


The next question is not related to these changes but I think it is important. Is the model parameter a quantized model or an initial one? The following code snippet makes one think that this model with quantizers

model_copy = self._backend_entity.remove_fq_from_inputs(copy_model(model), graph)

In this case, I really don't understand how the PostTrainingQuantization.get_statistic_points() method works because it takes an initial model (not quantized). It makes sense to discuss this statement offline.

Good question! @KodiaqQ, could you answer on this comment.

To be true, the model here, for BC, is without quantizers, and remove_fq_from_inputs may be removed. But let's keep it for the next PRs because BC algo is not the main part of DQ.

andrey-churkin · 2023-08-03T13:19:07Z

nncf/quantization/algorithms/post_training/algorithm.py

@@ -189,13 +191,15 @@ def _create_statistics_aggregator(self, dataset: Dataset, backend: BackendType)
            return PTStatisticsAggregator(dataset)
        return None

-    def _apply(
+    def apply(


Could you please update PostTrainingQuantization.get_statistic_points() method as well?

Thanks for the comment. I updated PostTrainingQuantization.get_statistic_points() taking into account PR #2013, because PostTrainingQuantization.get_statistic_points() function has incorrect implementation anyway.

cc @daniil-lyakhov

nncf/quantization/algorithms/post_training/algorithm.py

alexsu52 changed the title ~~reduced the number of graphs rebuilt~~ Reduced the number of graph rebuilds Jul 28, 2023

alexsu52 marked this pull request as ready for review July 28, 2023 08:33

andrey-churkin self-requested a review July 28, 2023 12:36

alexsu52 force-pushed the as/ptq_speedup branch 4 times, most recently from c724f9b to 2d1403e Compare July 28, 2023 14:58

alexsu52 requested review from KodiaqQ and AlexanderDokuchaev July 28, 2023 15:05

KodiaqQ reviewed Aug 1, 2023

View reviewed changes

nncf/onnx/graph/model_utils.py Show resolved Hide resolved

nncf/quantization/algorithms/post_training/algorithm.py Outdated Show resolved Hide resolved

nncf/quantization/algorithms/smooth_quant/algorithm.py Show resolved Hide resolved

alexsu52 requested a review from KodiaqQ August 3, 2023 09:16

alexsu52 added 4 commits August 3, 2023 15:39

reduced the number of graphs rebuilt

0e31a74

make torch precommit green

314d3d4

replied to comments

7bc0e3d

fixed rebase issues

83b9a78

alexsu52 force-pushed the as/ptq_speedup branch from 6cdfc5f to 83b9a78 Compare August 3, 2023 11:58

andrey-churkin reviewed Aug 3, 2023

View reviewed changes

AlexanderDokuchaev approved these changes Aug 3, 2023

View reviewed changes

nncf/quantization/algorithms/post_training/algorithm.py Outdated Show resolved Hide resolved

replied to comments

628f011

alexsu52 requested a review from a team as a code owner August 4, 2023 07:32

KodiaqQ approved these changes Aug 4, 2023

View reviewed changes

alexsu52 merged commit 215718f into openvinotoolkit:develop Aug 4, 2023

andrey-churkin mentioned this pull request Aug 9, 2023

Add HyperparameterTuner algorithm #1980

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduced the number of graph rebuilds #2011

Reduced the number of graph rebuilds #2011

alexsu52 commented Jul 28, 2023 •

edited

Loading

alexsu52 commented Jul 28, 2023

andrey-churkin Aug 3, 2023

alexsu52 Aug 4, 2023

andrey-churkin Aug 3, 2023

alexsu52 Aug 4, 2023

KodiaqQ Aug 4, 2023

andrey-churkin Aug 3, 2023

alexsu52 Aug 4, 2023

Reduced the number of graph rebuilds #2011

Reduced the number of graph rebuilds #2011

Conversation

alexsu52 commented Jul 28, 2023 • edited Loading

Changes

Reason for changes

Related tickets

Tests

alexsu52 commented Jul 28, 2023

andrey-churkin Aug 3, 2023

Choose a reason for hiding this comment

alexsu52 Aug 4, 2023

Choose a reason for hiding this comment

andrey-churkin Aug 3, 2023

Choose a reason for hiding this comment

alexsu52 Aug 4, 2023

Choose a reason for hiding this comment

KodiaqQ Aug 4, 2023

Choose a reason for hiding this comment

andrey-churkin Aug 3, 2023

Choose a reason for hiding this comment

alexsu52 Aug 4, 2023

Choose a reason for hiding this comment

alexsu52 commented Jul 28, 2023 •

edited

Loading