- Exon-Level Quantification: It represents genes as graphs, where nodes are exons and edges are junction reads, capturing detailed transcriptomic information at the exon level.
- Better Cell Embedding: DOLPHIN leverages exon and junction read data to significantly improve the accuracy of cell embeddings, providing better resolution and resulting in more precise, biologically meaningful cell clusters compared to conventional gene-count based approaches.
- Enhanced Alternative Splicing Detection: By aggregating exon and junction reads from neighboring cells, DOLPHIN significantly enhances the detection of alternative splicing events, providing deeper insights into cell-specific splicing patterns.
- Superior Performance in Downstream Analysis: DOLPHIN consistently outperforms conventional gene-count methods in multiple downstream tasks, including the identification of differential exon markers and alternative splicing events. This high-resolution approach allows DOLPHIN to uncover biologically significant exon markers that are often missed by traditional methods.
Installing DOLPHIN directly from GitHub ensures you have the latest version. (Please install directly from GitHub to use the provided Jupyter notebooks for tutorials)
git clone https://github.com/mcgilldinglab/DOLPHIN.git
cd DOLPHIN
Creating and Activating the Conda Environment
conda env create -f environment.yaml
conda activate DOLPHIN
Installing the DOLPHIN Package
- Standard Installation
pip install .
- Developer Mode Installation
pip install -e .
Validate That DOLPHIN Is Successfully Installed
import DOLPHIN
-
First, generate the exon-level reference GTF file by following the instructions in the exon_gtf_generation tutorial.
-
Then, use the following tutorials to align the raw RNA-seq data and generate exon read counts and junction read counts:
-
For Full-length scRNA-seq, refer to the Full-length scRNA-seq tutorial.
-
For 10X RNA-seq, refer to the 10X tutorial.
-
-
After aligning the RNA-seq data, generate the feature matrix and adjacency matrix using the provided methods in the tutorial.
DOLPHIN Training and Cell Embedding
You can download the processed dataset from here and follow the example to run the model.
For a detailed tutorial on cell aggregation, please refer to the Cell Aggregation Tutorial.
-
Detecting Alternative Splicing using Outrigger: To detect alternative splicing events, please follow the Alternative Splicing Detection Tutorial.
-
Alternative Splicing Analysis: This section explains the alternative splicing analysis performed as described in the manuscript. For a detailed tutorial, please refer to the Alternative Splicing Analysis.
For a detailed walkthrough of the exon-level differential gene analysis, please follow this tutorial.
If you find the tool is useful to your study, please consider citing the DOLPHIN manuscript.