-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intelligent gene name scraping #46
Comments
Thinking about the
Also, I remember having an issue with annotables (a lot of histone genes suddenly disappeared between versions). I will file an issue on their github just to be safe this will be working. Otherwise it might be best to save a version of the |
|
This package should have functions that will help with this |
Ok, this is implemented and I've done some basic testing locally (though more probably needed). One note: I needed the grch38 annotables annotation, which I didn't manage to import because it is a dataframe, so I've just saved in and committed it. This might be better anyway because then we will not be reliant on any updates they make to their package, but happy to change if you have a better idea. |
Yes like the idea of saving locally if it can reduce dependencies (also reduces potential for things to break later) |
Potential issue: the |
Currently cytosel assumes that
rownames(sce)
are gene symbols (note: user doesn't necessarily uploadSingleCellExperiment
, could beSeurat
orAnnData
).However, the rownames could be ensembl/entrez IDs, maybe the symbols are in
colData(sce)
but maybe they're not.Can we write a function called
parse_rownames
that:rownames(sce)
are symbolscolData(sce)
that might be symbols it could userownames(sce)
are - if they're ensembl/entrez, convert them to symbol (use annotables). Otherwise throw an error (dialog)Two notes:
scuttle
that does thisThe text was updated successfully, but these errors were encountered: