Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use with outputs of scATAC-pro #20

Open
jonhsussman opened this issue Aug 21, 2022 · 2 comments
Open

Use with outputs of scATAC-pro #20

jonhsussman opened this issue Aug 21, 2022 · 2 comments

Comments

@jonhsussman
Copy link

Hello,

I am wondering whether there is a straightforward way to use AMULET with the output of scATAC-pro. The vignettes seem to require the outputs from cellranger which conveniently produce a csv file. scATAC-produces cell_barcodes.bam file, is there a way to convert this to csv file to be in a form to work with AMULET.

Thanks,
Jonathan

@ajt986
Copy link
Member

ajt986 commented Aug 22, 2022

Hi Jonathan,

For AMULET, you just need to provide the fragments (either the .bam file or .txt/tsv.gz file in the same format as the fragments.tsv.gz file from CellRanger and the CSV file with the barcodes. The CSV file can be whatever you want as long as it has a header and the following columns: 1) barcode 2) is__cell_barcode . The 'is__cell_barcode' column is essentially just a column where it's 1 if the barcode corresponds to a barcode used in the analysis and anything else (e.g., 0) if it's not to be included. The python reader just looks for these column names to identify them. For the bam file reader, you just need to provide the column indices using '--cellidx' and '--iscellidx'. For example, if you have a csv file where the first column is your 'barcode' and the second is the 'is__cell_barcode', you would add --cellidx 0 --iscellidx 1.

You'll need to check what's available from scATAC-pro if you need to convert the fragment/bam files. It is more involved but essentially the bam file is just a typic paired-end ATAC-seq bam file with an additional attribute that stores the barcode. The default is "CB" from cellranger, and this is what AMULET looks for. If there's a different attribute name, you can specify with the --bambc option. The fragment file just needs to be the same format as specified here: https://support.10xgenomics.com/single-cell-atac/software/pipelines/latest/output/fragments

As long as the barcodes match between the fragment/bam file and the csv file, AMULET should be good to go.

Best,
Asa

@jonhsussman
Copy link
Author

Hi Asa,

Thank you for your suggestions here, I will look into these comments carefully.

--Jonathan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants