Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Fix kernels synchronization in PagedAttention operation #28645

Merged

Conversation

sshlyapn
Copy link
Contributor

@sshlyapn sshlyapn commented Jan 23, 2025

Details:

  • Fix synchronization in PagedAttention operation when KV-cache rotation is enabled but skipped for the current iteration.
    Previously, dep_events was always replaced with res_events if has_rotated_blocks=true, leading to empty events vector being passed to the next kernels and causing accuracy deviations in cases of out_of_order queue due to missing synchronization

@sshlyapn sshlyapn added the category: GPU OpenVINO GPU plugin label Jan 23, 2025
@sshlyapn sshlyapn added this to the 2025.1 milestone Jan 23, 2025
@sshlyapn sshlyapn requested review from a team as code owners January 23, 2025 14:09
@sshlyapn sshlyapn changed the title [GPU] Fix synchronization in PagedAttention kernel [GPU] Fix kernels synchronization in PagedAttention operation Jan 23, 2025
…otation is enabled but skipped for the current iteration
@michal-miotk
Copy link
Contributor

LGTM

@p-durandin p-durandin added this pull request to the merge queue Jan 23, 2025
github-merge-queue bot pushed a commit that referenced this pull request Jan 23, 2025
### Details:
- Fix synchronization in PagedAttention operation when KV-cache rotation
is enabled but skipped for the current iteration.
Previously, `dep_events` was always replaced with `res_events` if
`has_rotated_blocks=true`, leading to empty events vector being passed
to the next kernels and causing accuracy deviations in cases of
out_of_order queue due to missing synchronization
 - PR to master: #28645
Merged via the queue into openvinotoolkit:master with commit 485833c Jan 23, 2025
166 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GPU OpenVINO GPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants