Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

short-read sequence assemblies for variation detection #1556

Open
C-YONG opened this issue Dec 3, 2024 · 2 comments
Open

short-read sequence assemblies for variation detection #1556

C-YONG opened this issue Dec 3, 2024 · 2 comments

Comments

@C-YONG
Copy link

C-YONG commented Dec 3, 2024

Hello Developers,

I've successfully built a pan-genome consisting of 7 fugu genomes using MC without any problems. Thank you for your kind descriptions on GitHub. Now, I'm planning to find more SVs by adding short-read sequence data.
Should I perform haplotype sampling or allele frequency filtering?
Should I just use CILP filtering?
Thank you for your support, and I look forward to hearing from you soon!

Best Regards

@glennhickey
Copy link
Collaborator

As shown here, haplotype sampling (--haplo) gives better performance than allele frequency filtering.

So my suggestion is to use the haplotype filtering, but since we only tested on human, I can't guarantee that it will outperform frequency filtering --filter 2 --giraffe on fugu.

@C-YONG
Copy link
Author

C-YONG commented Dec 4, 2024

Hello! Based on the content from the VG Giraffe best practices document :"With a small number of haplotypes (e.g. 10), the default graph is usually a good choice."
Does this mean we don’t need filtering(d2) or haplotype sampling?
I would like to understand whether such filtering or haplotype sampling is necessary for graphs constructed with a small number of genomes. For such simple graphs, is the default already sufficient?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants