Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release notes v2.11 #2710

Merged
merged 19 commits into from
Jun 13, 2024
Merged
Changes from 15 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
52 changes: 52 additions & 0 deletions ReleaseNotes.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,57 @@
# Release Notes

## New in Release 2.11.0

Post-training Quantization:

- Breaking changes:
- ...
- General:
- ...
- Features:
- (OpenVINO) Added Scale Estimation algorithm for 4-bit data-aware weights compression. This algorithm estimates a more optimal scale in compression scheme based on difference between output of matrix multiplication in the original precision and the compressed one. For that, the optional `scale_estimation` parameter has been added to `nncf.compress_weights()` and can be used to minimize accuracy degradation of compressed models (note that this option increases the compression time).
- (OpenVINO) Added GPTQ algorithm for 8/4-bit data-aware weights compression, supporting INT8, INT4, and NF4 data types. Introduced the optional `gptq` parameter to `nncf.compress_weights()` to enable the GPTQ algorithm.
- (OpenVINO) Added support for bf16 weights compression in `nncf.compress_weights()`
- Fixes:
- (OpenVINO) Fix an incorrect Add node determination in FastBiasCorrection, BiasCorrection, and ChannelAlighnment algorithms.
- Improvements:
- ...
- Deprecations/Removals:
- Remove extra dependencies to install backends from setup.py (like `[torch]` are `[tf]`, `[onnx]` and `[openvino]`).
- Remove `openvino-dev` dependency.
- Examples:
- Update ignored scope for YOLOv8 examples utilizing a subgraphs approach for OpenVINO and ONNX.
- Tutorials:
- [Post-Training Optimization of Stable Video Diffusion Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/stable-video-diffusion/stable-video-diffusion.ipynb)
- [Post-Training Optimization of YOLOv10 Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/yolov10-optimization/yolov10-optimization.ipynb)
- [Post-Training Optimization of LLaVA Next Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/nano-llava-multimodal-chatbot/nano-llava-multimodal-chatbot.ipynb)
- [Post-Training Optimization of S3D MIL-NCE Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/s3d-mil-nce-text-to-video-retrieval/s3d-mil-nce-text-to-video-retrieval.ipynb)
- [Post-Training Optimization of Stable Cascade Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/stable-cascade-image-generation/stable-cascade-image-generation.ipynb)
- Known issues:
- ...

Compression-aware training:

- Breaking changes:
- ...
- General:
- ...
- Features:
- (PyTorch) `nncf.quantize` method now is the recommended path for the quantization initialization for Quantization-Aware Training.
- (PyTorch) Compression modules placement in the model now can be serialized and restored with new API functions: `compressed_model.nncf.get_config()` and `nncf.torch.load_from_config`. The [documentation](/docs/usage/training_time_compression/quantization_aware_training/Usage.md#saving-and-loading-compressed-models) for the saving/loading of a quantized model is available, and Resnet18 [example](examples/quantization_aware_training/torch/resnet18) was updated to use the new API.
- Fixes:
- ...
- Improvements:
- ...
- Deprecations/Removals:
- ...
- Examples:
- [Quantization-Aware Training of STFPM PyTorch model from Anomalib](examples/quantization_aware_training/torch/anomalib)
- Tutorials:
- [Quantization-Sparsity Aware Training of PyTorch ResNet-50 Model](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/pytorch-quantization-sparsity-aware-training/pytorch-quantization-sparsity-aware-training.ipynb)
- Known issues:
- ...

## New in Release 2.10.0

Post-training Quantization:
Expand Down
Loading