Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Circular dna overlap region #2361

Open
hyeong8984 opened this issue Jan 10, 2025 · 1 comment
Open

Circular dna overlap region #2361

hyeong8984 opened this issue Jan 10, 2025 · 1 comment

Comments

@hyeong8984
Copy link

Hi,
I tried to assemble circular DNA with hifi reads, and made it.

I found overlapping region of contig with width about 22 kbp using nucmer. (total contig length = 878,387bp)

My question is that, why this overlapping region is made in specific region?
How canu define the seed reads to start assembly and end..?
I have been studying canu but cannot find it well.

Thank you for your time, and I look forward to your response.

@skoren
Copy link
Member

skoren commented Jan 10, 2025

Circular molecules typically generate self-similar sequences at the start and end, see https://canu.readthedocs.io/en/latest/faq.html#my-circular-element-is-duplicated-has-overlap. This is because the start of the molecule is arbitrarily defined (there's no start/end on a circle) and reads are not split so the end read can repeat the start sequence again. Typically, canu will mark these as circular in the header and provide trim coordinates to remove the redundancy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants