Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use relative abundance table for diversity statistics #12

Open
Dahn-YoungDong opened this issue Apr 15, 2022 · 2 comments
Open

Use relative abundance table for diversity statistics #12

Dahn-YoungDong opened this issue Apr 15, 2022 · 2 comments

Comments

@Dahn-YoungDong
Copy link

Hello in your tutorial, you said "Alpha diversity measures the diversity within a single sample and is generally based on the number and relative abundance of taxa at some rank (e.g. species or OTUs). Beta diversity also uses the number of relative abundance of taxa at some rank, but measures variation between samples."

I have relative abundance output from Metaphlan 3. But in some other forums, it advises against using relative abundance. Especially, in your tutorial data, it is rarefied OTU count, not relative abundance. Do you think that makes a difference?

@Dahn-YoungDong
Copy link
Author

@zachary-foster

@zachary-foster
Copy link
Contributor

The term "relative abundance" is being used differently in the two contexts. In:

Alpha diversity measures the diversity within a single sample and is generally based on the number and relative abundance of taxa at some rank (e.g. species or OTUs). Beta diversity also uses the number of relative abundance of taxa at some rank, but measures variation between samples.

I am talking about how metrics take into account both the number of species as well as their abundance. The term "relative" here is used to emphases that differences in abundance, even if the same species are present, influence the diversity metric. Whether the abundance measure is read counts or proportions is not important in this context. the word "relative" could be removed and it would have the same meaning.

In the other context, "relative abundance" means converting the read count to proportions. For some metrics that assume unfiltered read count data (e.g. Chao1), proportions cant be used. Most other metrics can use proportions, but simply converting to proportions does not account for the effect of differences in sampling depth. Therefore, the same community sequenced with more reads will have a higher diversity metric using the raw read counts or proportions. The purpose of rarefaction is to avoid this bias. You could also set any proportions to 0 that are less than 1 / (the minimum read depth for a sample)

Sorry if that is confusing. Let me know if it does not make sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants