Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Library Depth Normalization #15

Open
imk1 opened this issue Feb 4, 2024 · 2 comments
Open

Library Depth Normalization #15

imk1 opened this issue Feb 4, 2024 · 2 comments

Comments

@imk1
Copy link

imk1 commented Feb 4, 2024

I noticed that you have multiple options for library depth normalization, including upper quartile, total sum, and harmonic mean, and that upper quartile is the default. Can you explain my upper quartile is the default? (My intuition is that total sum better captures the library depth.) Thanks!

@talashuach
Copy link
Collaborator

Library depth effects can be seen as shifting the entire distribution of reads-per-barcode. So in the general case, an upper quartile and the total sum should return very similar correction factors.
However, if some samples have extreme outliers (which I've seen in some MPRA datasets), the total sum can be pretty sensitive to those, whereas the upper quartile is not. So by default we use an upper quartile, but total sum is supported if you know your data doesn't have this issue and prefer to use the entire sum.

Hope this helps!

@imk1
Copy link
Author

imk1 commented Feb 8, 2024

Thank you for this very helpful explanation! Upper quartile seems like it would handle extreme outliers on the low end of reads but not on the high end of reads. Is there a reason for not also eliminating the top few percent?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants