-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
different TFs are discovered from different cistarget databases #526
Comments
Hi @gilgolan73 This is expected. The precomputed database uses ENCODE SCREEN regions. These regulatory regions are based on data from a variety of cell lines. These cell lines might not be too similar to your data. For this reason I would trust the results from the custom database more. Best, Seppe |
Hello @SeppeDeWinter , Thank you, |
Hi @SeppeDeWinter , An additional question- when I look at the TF_names.txt file, I can see that some TFs are missing (in both analyses). Is it because their motif does not appear in the database? If so, can I add their motif somehow? Thank you, |
Hi @gilgolan73
Yes, this in theory possible. This would involve generating a new database containing the union of SCREEN regions (https://screen.encodeproject.org/) and you consensus peak set.
This file is generated based on your motif enrichment results. If the TF does not appear it means that no motif for your TF of interest is enriched. It is of course possible that our collection does not contain any motif for your TF of interest. For this, please check wether your TF of interest occurs on the motif-to-TF annotation table: https://resources.aertslab.org/cistarget/motif2tf/ I hope I could help with this? Best, S |
Hi,
Thanks, |
Hi <Which regions did you use from the SCREEN database? all cCREs?> <My TF of interest do appear on the motif-to-TF annotation table (motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl). However they do not appear in my results. Do you do some filtering of these motifs when running create_cistarget_databases? Is there a way to look if their motifs appear in the cistarget databases? We don't do any filtering, but you can check using this code if you want. from ctxcore.rnkdb import FeatherRankingDatabase
db = FeatherRankingDatabase(<PATH_TO_REGIONS_V_MOTIF_DB>, name="test")
# are any motifs annotated to your TF of interest in
db.genes I think the region sets you are using for motif enrichment might not represent the state where the TF is expressed very well and that this is the reason why you don't find any of its motifs enriched. Best, Seppe |
Hello @SeppeDeWinter , Additionally, from my understanding, the region sets that I am using for motif enrichment are derived from the pycistopic analysis. Is there any parameters or steps that you recommend to change in this analysis? Thank you, |
Hi @gilgolan73 Yes, my mistake. df = db.load_full() All the best, Seppe |
Describe the bug
Hi, I am using scenicplus to find eRegulons in my sc-multiomic data (mouse).
I ran scenicplus twice, each time using a different cistarget databases:
When I looked at the results, I noticed that a lot of the TFs that were found using the custom databases, are missing in the precomputed databases-based analysis (and vice-versa). For example, for the custom-based databases , I found 100 TFs with regulons (among these, 51 were not found in the precomputed-based analysis). For the precomputed-based analysis, I found 94 TFs with regulons (among these, 46 were not found in the custom-based analysis).
These different sets of found TFs results in very different results (in terms of the eRegulons found, and the ability to perform in-silico perturbation assays for the TFs). Why do I get such different results? Is there a way to include more TFs in the custom-based analysis (which I guess is more accurate?) ?
Thank you,
Gil Golan
The text was updated successfully, but these errors were encountered: