Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Low purity but high VAF #26

Closed
yavorska opened this issue Jul 20, 2023 · 10 comments
Closed

Low purity but high VAF #26

yavorska opened this issue Jul 20, 2023 · 10 comments
Assignees
Labels
help wanted Extra attention is needed

Comments

@yavorska
Copy link

Hi there,

We were hoping to use CNAqc to detect errors in some of our purity estimates (particularly for low TMB cases). We've found a few samples where analyze_peaks() passes with a low estimated purity (~0.2).

However, based on the VAF plot this seems unlikely (there's a clear peak close to 0.5 in diploid regions suggesting a much higher purity).

Are there any settings that could be modified to change this behaviour? I've tried using the "rightmost" method but that seems to cause pretty much every estimate to fail.

Thank you!

image
@caravagn
Copy link
Collaborator

caravagn commented Jul 23, 2023

Hi, I see your panel D. I agree that it should be possible to select the 50% peak. Likely it's matter of tuning some peak-detection parameters, the defaults are set to work well with higher TMB and maybe this is poor for your specific case.

We can try to help you find better params, but it would help us to work on your VAFs. Is there any way you can share a synthetic copy of data from panel D? For instance, you can make up nucleotide positions, ref/alt etc -- we just need the same read count values.

@caravagn caravagn self-assigned this Jul 23, 2023
@caravagn caravagn added the help wanted Extra attention is needed label Jul 23, 2023
@yavorska
Copy link
Author

Hi @caravagn,
That would be great! At least one such sample is available in the research environment (https://www.genomicsengland.co.uk/research/research-environment). If you raise a GEL service desk ticket, I can point you to it, feel free to tag me directly, @Olena Yavorska.

@caravagn
Copy link
Collaborator

@AliceAntonello can you do open the ticket following @yavorska?

@AliceAntonello
Copy link
Contributor

Hi @yavorska, I found the sample you referred to in the research environment, could you share with me the code you used to produce the CNAqc object, so that I follow the same steps as you?

@yavorska
Copy link
Author

Hi @AliceAntonello, I emailed the details a while back to the account that was used to generate the SD ticket. Can you confirm if you got them?

@AliceAntonello
Copy link
Contributor

AliceAntonello commented Sep 19, 2023 via email

@caravagn
Copy link
Collaborator

Has this been resolved? If yes, @AliceAntonello you should close the issue.

@AliceAntonello
Copy link
Contributor

AliceAntonello commented Oct 14, 2023 via email

@caravagn
Copy link
Collaborator

I think in general it is hard to believe that the peak close to 0.5 is actually a clonal peak. If these are WGS samples it seems plausible to find hundreds of clonal mutations, and not 3.

What tumours were this @yavorska ?

@yavorska
Copy link
Author

Hi @caravagn, it's an ALL.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants