Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gene omission on capture or study level #12

Open
derrik-gratz opened this issue May 16, 2024 · 3 comments
Open

Gene omission on capture or study level #12

derrik-gratz opened this issue May 16, 2024 · 3 comments

Comments

@derrik-gratz
Copy link
Collaborator

Workflow currently includes omitting genes found in less than 50 cells, but this is done on a per capture level. This causes some genes to be omitted from captures and present in others, which may affect DGE across conditions. Granted, these genes probably don't have much data to work with, but it would still be better to avoid this just in case

@micahpf
Copy link
Contributor

micahpf commented May 16, 2024

I don't think this is a problem for our pseudobulk workflow though, since we're running edgeR::filterByExpr anyway, which flags low/no expression genes in each sample for removal from the model fitting procedure.

@derrik-gratz
Copy link
Collaborator Author

I agree that it shouldn't affect pseudo bulk, but I'm not sure if that is our 'defacto' workflow yet. I could see a niche scenario where it informs clustering in some way.

I just implemented it to the package (not pushed yet), do you think there'd be a reason to NOT do it? E.g. information loss?

@micahpf
Copy link
Contributor

micahpf commented May 16, 2024

I'm not sure. Maybe you could just open a draft pull request while we do some more digging?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants