-
Notifications
You must be signed in to change notification settings - Fork 239
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[WC] Align compression subgraphs for both weight input data types (#2537
) ### Changes Precision configuration of input OV model for weight compression can be one of the following three: 1. Weights and activations are in FP32 precision (model is saved with `compress_to_fp16=False`) 2. Weights are in FP16 and activations are in FP32 (model is saved with `compress_to_fp16=True`) 3. Weight and activations are in FP16 (e.g., PT model is first halfed and then converted to OV) This PR make compression subgraphs equal for all these three cases. Compression activations are always executed in FP16. So for the first case an additional f16 -> f32 Convert node is added. ![image](https://github.com/openvinotoolkit/nncf/assets/23343961/f301b7de-0599-4935-bcbb-0d28a67d8280)
- Loading branch information
1 parent
96eef35
commit df81f44
Showing
3 changed files
with
56 additions
and
33 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters