Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto-Save Index Files Alongside BAMs #446

Open
wants to merge 298 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
298 commits
Select commit Hold shift + click to select a range
17b9f6f
Added VQSR to `SrWholeGenome` and `SRJointCallGVCFsWithGenomicsDB`
jonn-smith Jan 10, 2023
bb908b2
Updated to GATK 4.3.0.0
jonn-smith Jan 11, 2023
3873227
Added region annotation.
jonn-smith Jan 11, 2023
8656ced
Fixing vcf.gz typo
jonn-smith Jan 11, 2023
9827f26
Fixed index extension error.
jonn-smith Jan 11, 2023
59995ae
Updated more vcf.gz index names.
jonn-smith Jan 11, 2023
cd57bc7
Fixed variant annotation task.
jonn-smith Jan 12, 2023
cf9d2f8
Terra changed where submissions go (now at gs://<bucket>/submissions/)
SHuang-Broad Dec 13, 2022
383015c
Autobump version 3.0.57 --> 3.0.58
kvg Dec 16, 2022
c2b08c9
Workflow to perform joint calling on gVCFs with GLNexus
kvg Dec 4, 2022
f0bd376
Workflow for joint-calling gVCFs with GLNexus
kvg Dec 4, 2022
72d29d0
Increased size of disk to store intermediate files
kvg Dec 4, 2022
19cd1cb
Change location of temp directory
kvg Dec 4, 2022
18d7a8f
Separate joint calling from bgzipping and indexing for better restart…
kvg Dec 5, 2022
37e7ad0
Performance improvements
kvg Dec 8, 2022
5c57c47
Revert those changes because the amount of disk space we need is actu…
kvg Dec 9, 2022
d83058f
Now optimized to use multiple high-performance disks
kvg Dec 9, 2022
384c7d6
Create a Hail MatrixTable of the joint-called VCF
kvg Dec 12, 2022
a85213d
Some reorganization that will hopefully improve call-caching.
kvg Dec 14, 2022
912ed67
Some simplifications to Hail MT delocalization
kvg Dec 15, 2022
374768e
Changed some variable names for consistency
kvg Dec 15, 2022
7cae1ec
Temporary workflow to convert an existing GVCF to a Hail MatrixTable
kvg Dec 17, 2022
1b4e068
Allocate more disk space
kvg Dec 17, 2022
55f30ba
Allow optional specification of a reference sequence for Hail
kvg Dec 18, 2022
798481c
Debugging
kvg Dec 18, 2022
20e5d35
More debugging
kvg Dec 18, 2022
e7fde6e
Forgot to specify a disk size properly
kvg Dec 18, 2022
c1b34ce
Custom reference sequence usage needs to be specified in the hl.impor…
kvg Dec 18, 2022
f805015
Try to resolve file corruption issue by separating the joint calling …
kvg Dec 23, 2022
dba9b5c
Fixed bad temp dir specification
kvg Dec 23, 2022
c6fd504
A contig-parallelized joint-calling workflow
kvg Dec 25, 2022
311886a
Forgot to surround optional num_cpus in select_first()
kvg Dec 25, 2022
8b652d1
Allow max cpus to be configurable
kvg Dec 25, 2022
6456f1d
Increase disk size request
kvg Dec 25, 2022
277ab6e
Temporary workflow to handle some issues arising from the LRJointCall…
kvg Dec 31, 2022
57b5c94
Fixed a typo
kvg Dec 31, 2022
d70f12a
Added documentation
kvg Jan 14, 2023
79836be
Autobump version 3.0.58 --> 3.0.59
kvg Jan 14, 2023
22b249d
Added workflow to convert a BAM file to fastq files
kvg Dec 14, 2022
f884211
Many updates for short read Malaria processing.
jonn-smith Dec 15, 2022
7d8715e
Fixed genome name in Hail MT
jonn-smith Jan 14, 2023
cd97819
Added workflow to create a Zarr store
kvg Jan 15, 2023
bb09228
Merge branch 'jts_kvg_sp_malaria' of github.com:broadinstitute/long-r…
kvg Jan 15, 2023
0a78e3c
Register Zarr store workflow in Dockstore
kvg Jan 15, 2023
ec52f5b
Reordering dockstore entries
kvg Jan 15, 2023
6083586
Adding in temporary WDL to work around dockstore.
jonn-smith Jan 17, 2023
991105d
Fixing typos.
jonn-smith Jan 17, 2023
a4195d6
Updating disk space required for `ImportGVCFs`
jonn-smith Jan 17, 2023
b7bdab2
Fixing disk space issue with genomicsDB.
jonn-smith Jan 17, 2023
3716920
Fixing disk space reqs.
jonn-smith Jan 17, 2023
89ecbd3
Fixed woeful byte -> GB conversion error.
jonn-smith Jan 17, 2023
17be0ab
Fixing dupe entries in `.dockstore.yml`
jonn-smith Jan 17, 2023
dd39fdf
Adjusted disk size due to `tar` in the command block of `ImportGVCFs`
jonn-smith Jan 17, 2023
e09bf4a
Removed preemption from `ImportGVCFs`
jonn-smith Jan 17, 2023
1e15392
Updated calls to have `0` preemptable attempts (for now).
jonn-smith Jan 17, 2023
34a594c
Attempt to fix `read_int` issue.
jonn-smith Jan 17, 2023
a5818a3
Fixed issue with disk size in Joint Calling.
jonn-smith Jan 23, 2023
8f3d56c
Minor updates to variant / joint calling.
jonn-smith Feb 8, 2023
f0b9177
Adding in diagram of lrma-sp-malaria pipeline.
jonn-smith Feb 8, 2023
556abc6
Minor change to legacy monitoring script.
jonn-smith Feb 9, 2023
740cda3
Added `BenchmarkVCFs` and instrumentation to other tasks.
jonn-smith Feb 14, 2023
aa5e26f
Fixing typos.
jonn-smith Feb 14, 2023
b02e46f
Adding BenchmarkVCFs to dockstore.
jonn-smith Feb 15, 2023
530af79
Fixing output name in `SRWholeGenome.wdl`
jonn-smith Feb 15, 2023
18de47b
Minor changes to `BenchmarkVCFs`
jonn-smith Feb 15, 2023
481c007
Adding support for viewing reads in the auto-generated IGV sessions.
jonn-smith Feb 15, 2023
ae5be97
fixing typo.
jonn-smith Feb 15, 2023
08a83f1
Fixed formatting and problem with IGV xml generation.
jonn-smith Feb 16, 2023
ba5ce9f
Added `CompareVcfBenchmarks.wdl`.
jonn-smith Feb 16, 2023
0313616
Fixed file name.
jonn-smith Feb 16, 2023
08420fe
Changing default for gc plots.
jonn-smith Feb 16, 2023
4f7b975
Adding `resources.log` to outputs for monitored tasks.
jonn-smith Feb 27, 2023
f7afa73
Added instrumentation for the rest of the WDLs to check resources.
jonn-smith Feb 28, 2023
2c8eff4
Adding in docker image for malaria work.
jonn-smith Feb 28, 2023
23ea87e
Fixing requirements for barcode script.
jonn-smith Feb 28, 2023
0c64ff2
Cleaned up the barcode script a little.
jonn-smith Feb 28, 2023
4baac16
Updated barcode script to handle csv/tsv input.
jonn-smith Mar 1, 2023
8583c7f
Now it actually works for CSVs/TSVs.
jonn-smith Mar 1, 2023
e3d2a40
Minor updates.
jonn-smith Mar 1, 2023
07fe5f2
Fixed ingest of new barcode def TSV.
jonn-smith Mar 1, 2023
a4ddaa0
Adding in WDL for P. falciparum barcode panel processing.
jonn-smith Mar 2, 2023
4dc4404
Adding PanelProcessMalariaBarcodesForRh.wdl to dockstore.
jonn-smith Mar 2, 2023
d35ec00
Adding some tests for python versioning and conda env.
jonn-smith Mar 2, 2023
ca33680
Fixed conda env sourcing.
jonn-smith Mar 2, 2023
d4a4b21
Fixed orientation of input tsv.
jonn-smith Mar 2, 2023
1dc9be2
Changing Multi/Poly column name to Terra convention.
jonn-smith Mar 2, 2023
e2ede54
Fixing parsing issue with `ISO3` field.
jonn-smith Mar 2, 2023
bf6a27d
Fixing `:` -> `.` conversion.
jonn-smith Mar 2, 2023
075618b
Adding outputs to the WDL.
jonn-smith Mar 2, 2023
9d37761
Fixing outputs in the workflow.
jonn-smith Mar 2, 2023
4edd017
Fixed bug in hail.wdl, updated docs for Panel wdl.
jonn-smith Mar 7, 2023
e943ae4
Adding in images for better documentation in the barcode workspace
jonn-smith Mar 8, 2023
1e5fc54
Added in renaming the sample to the new sample name in SRWholeGenome.
jonn-smith Mar 12, 2023
fad15eb
Fixing typo
jonn-smith Mar 12, 2023
1ba674e
Adding indexing to Rename task.
jonn-smith Mar 12, 2023
5cf9bc7
Updated finalization info.
jonn-smith Mar 12, 2023
79936e0
Fixing issue.
jonn-smith Mar 12, 2023
01072f3
Fixing sample name in other haplotypecaller files.
jonn-smith Mar 12, 2023
5547bae
Fixing Barcode demo WDL.
jonn-smith Mar 23, 2023
58e9d83
Adding ProcessMalariaBarcodesDemo to dockstore.
jonn-smith Mar 23, 2023
bd34fe6
Removed finalization inputs.
jonn-smith Mar 23, 2023
310a11d
Added ability to use sites-only VCFs as truth.=
jonn-smith May 11, 2023
dbed29d
Fixing typo.
jonn-smith May 11, 2023
bc5aa50
Fixing args with `ALT` sample mode.
jonn-smith May 11, 2023
ece4d0d
Fixing a bug in final stats counting
jonn-smith May 11, 2023
4c46d07
Added step to actually remove filtered variants.
jonn-smith Jun 7, 2023
faf6f87
Fixed VQSR filtered output
jonn-smith Jun 7, 2023
51a38ce
Fixed typo in finalization.
jonn-smith Jun 7, 2023
2eebea9
Fixed a name conflict.
jonn-smith Jun 7, 2023
5ed14d8
Removing allele-specific annotations.
jonn-smith Jul 1, 2023
5f5f117
Exposed more recalibration inputs.
jonn-smith Jul 2, 2023
2687c96
Fixing type issue with VQSR priors.
jonn-smith Jul 2, 2023
3722d4b
Fixing long read joint calling wdl.
jonn-smith Jul 3, 2023
e9b393b
Fixed modified joint calling to be faster.
jonn-smith Jul 4, 2023
8194cda
Adding `SRJointCallGVCFsWithGenomicsDB` to dockstore.
jonn-smith Jul 4, 2023
cf3313d
Fixing bug.
jonn-smith Jul 4, 2023
68464a9
Parallelized the joint call. Added functional annotation.
jonn-smith Jul 5, 2023
0ae8a79
Fixing issue with reblock GVCFs
jonn-smith Jul 5, 2023
d691809
Adding Zarr conversion into SR Joint Calling WDL.
jonn-smith Jul 6, 2023
e0a077e
Removing automatic Zarr conversion
jonn-smith Jul 6, 2023
a0705be
Fixed issue with VQSR in `SRJointCallGVCFsWithGenomicsDB.wdl`
jonn-smith Jul 10, 2023
3f155dd
Removing unused parameter.
jonn-smith Jul 10, 2023
8d67991
Fixed a bug in `GatherVCFsCloud`
jonn-smith Jul 10, 2023
7ef0b11
Fixed typo in `ApplyVQSR` task.
jonn-smith Jul 10, 2023
b032152
Fixing vcf extension for some outputs.
jonn-smith Jul 10, 2023
2453418
Fixing gzip issue.
jonn-smith Jul 10, 2023
f127073
Fixed bgzip of SNPEff output.
jonn-smith Jul 11, 2023
a6f857b
Adding functional annotation docker.
jonn-smith Jul 11, 2023
d19b895
Moving snpEff to a new docker image.
jonn-smith Jul 11, 2023
a5d0513
Another fix for the bgzip issue in functional annotation.
jonn-smith Jul 12, 2023
3245b2e
Updated GatherVCFs task to emit vcf.gz files.
jonn-smith Jul 12, 2023
da809dc
Removed explicit tabix index from GatherVCFsCloud.
jonn-smith Jul 12, 2023
b5636c0
Adding debugging code.
jonn-smith Jul 12, 2023
f9efd97
Added back in the `tabix` indexing.
jonn-smith Jul 12, 2023
b9ba892
Fixing elusive typo in outputs for `GatherVCFs`
jonn-smith Jul 12, 2023
5450c71
Added output for annotated VCF.
jonn-smith Jul 13, 2023
c15514f
Fixed outputs for SRJointCallGVCFsWithGenomicsDB.wdl
jonn-smith Jul 13, 2023
a75c714
Fixing issue in SRWholeGenome caused by previous commit.
jonn-smith Jul 18, 2023
2f9994d
Fixing critical bug in `SRUtils::Bam2Fq`
jonn-smith Jul 20, 2023
b79a1fb
Fixed corner case in `Finalize::FinalizeToDir`
jonn-smith Jul 20, 2023
1f5f75c
Whitespace change to make dockstore register changes.
jonn-smith Jul 20, 2023
d62f604
Fixing issue with bash string comparisons in `Finalize::FinalizeToDir`
jonn-smith Jul 20, 2023
fd1ed84
Minor bugfix.
jonn-smith Jul 20, 2023
03232d1
Added wdl to train CNNScoreVariants.
jonn-smith Aug 1, 2023
b6c61e3
Added wdl to train CNN filters.
jonn-smith Aug 1, 2023
6a7cafb
Added in more memory for MarkDuplicates.
jonn-smith Aug 2, 2023
ddfe2a0
Fixing memory for `MarkDuplicates`.
jonn-smith Aug 2, 2023
0fa9d7f
Giving more headroom for OS processes.
jonn-smith Aug 2, 2023
2719e8d
Fixed disk space in Create2DReadTensors
jonn-smith Aug 9, 2023
743fa04
Fixing CNN training task.
jonn-smith Aug 10, 2023
87de80b
Fixing another typo.
jonn-smith Aug 10, 2023
b6aeec2
Removing preemptible from CNN Training WDL.
jonn-smith Aug 10, 2023
009c0fc
Provisioned GPUs for `TrainCNN`.
jonn-smith Aug 10, 2023
27e188b
Added log messages.
jonn-smith Aug 11, 2023
ad089bc
Updated CNN model training and haplotypecaller parameters.
jonn-smith Aug 18, 2023
4574d2e
Multiple updates.
jonn-smith Sep 5, 2023
a76a9ec
Added task to compute fingerprint of a bam file.
jonn-smith Sep 6, 2023
97356f6
Added task for `ExtractVariantAnnotations`
jonn-smith Sep 6, 2023
fc35d5a
Added task for TrainVariantAnnotationsModel
jonn-smith Sep 6, 2023
8aef65f
Intermediate commit.
jonn-smith Sep 6, 2023
d6d6219
Updates for VETS integration and file outputs.
jonn-smith Sep 8, 2023
1c51f6e
Updates for barcoding.
jonn-smith Sep 8, 2023
6892a3a
Minor efficiency cleanup.
jonn-smith Sep 8, 2023
61f31b3
Updates to functional annotation.
jonn-smith Sep 8, 2023
d41e073
Added more snpEff outputs to `SRJointCallGVCFsWithGenomicsDB`
jonn-smith Sep 11, 2023
06d4b95
Fixing the outputs for merge vcfs.
jonn-smith Sep 12, 2023
36b45cc
Fixing bug in WDL for `ExtractVariantAnnotations` and `ScoreVariantAn…
jonn-smith Sep 12, 2023
ce7d921
Fxiing another typo/.
jonn-smith Sep 13, 2023
acd1bb0
Fixed a copy/paste error for memory configuration in a few tasks.
jonn-smith Sep 13, 2023
198478f
Fixing typo in argument for `ScoreVariantAnnotations`
jonn-smith Sep 13, 2023
d921c37
Fixed typo in ScoreVariantAnnotations and ExtractVariantAnnotations.
jonn-smith Sep 13, 2023
0d837ef
Adding some debugging for outputs.
jonn-smith Sep 14, 2023
84b904e
Fixing outputs for `TrainVariantAnnotationsModel`
jonn-smith Sep 14, 2023
81d0f2e
Fixing the same output file issue.
jonn-smith Sep 14, 2023
34e74d8
Updated `ScoreVariantAnnotations` with some debugging.
jonn-smith Sep 14, 2023
3483e10
Additional debugging.
jonn-smith Sep 14, 2023
09594a1
Fixing typo in `SRWholeGenome` for VETS.
jonn-smith Sep 14, 2023
0ad718b
Fixing yet another typo in `SRWholeGenome`
jonn-smith Sep 14, 2023
3b897d6
Fixed another issue.
jonn-smith Sep 14, 2023
ef1b225
Fixing barcode extraction wdl.
jonn-smith Sep 15, 2023
5d350ea
Updated output type of barcode to be `String` instead of `File`.
jonn-smith Sep 15, 2023
c17f87d
Removing separate log file for gatk in VETS tasks.
jonn-smith Sep 15, 2023
9b7b279
Fixed output file issue.
jonn-smith Sep 18, 2023
eab15e3
Fixing issue in joint calling workflow.
jonn-smith Sep 18, 2023
bdaa117
Added flag to `SRWholeGenome` to indicate data has been processed.
jonn-smith Sep 19, 2023
fe50a5f
Changed snpEff results folder name
jonn-smith Sep 19, 2023
41e5389
Added Workflow to extract single-organism contaminants.
jonn-smith Sep 19, 2023
2c8a8fd
Fixed outputs for `RemoveSingleOrganismContamination`to be finalized.
jonn-smith Sep 19, 2023
b103772
Connected `RemoveSingleOrganismContamination` in `SRFlowcell`
jonn-smith Sep 19, 2023
fc8697f
Increasing the memory for contamination alignment
jonn-smith Sep 21, 2023
e25e272
Adding missing sort step to read decontamination workflow.
jonn-smith Sep 21, 2023
7accfa4
Fixed RemoveSingleOrganismContamination. Minor change to ReblockGVCFs
jonn-smith Sep 21, 2023
3560ca4
Fixing issue in `SRFlowcell` caused by other chantges.
jonn-smith Sep 21, 2023
633d744
Removed decontamination task. Added AlignedMetrics.
jonn-smith Sep 25, 2023
10d26d7
Added WDL to extract reads from amplicon regions.
jonn-smith Oct 8, 2023
6ebfbe5
Added new diagram of high-level view of pipeline.
jonn-smith Oct 9, 2023
809628c
Adding in workspace markdown documentation.
jonn-smith Oct 9, 2023
4f9db3b
Updated workspace markdown doc.
jonn-smith Oct 9, 2023
83f77f4
Updated dockstore; Added `PfalciparumDrugResistanceSummary.wdl`
jonn-smith Oct 9, 2023
52c0322
Fixing bug in PfalciparumDrugResistanceSummary.wdl
jonn-smith Oct 9, 2023
27e55ba
Fixing typo.
jonn-smith Oct 9, 2023
2fc6de2
Adding docker file and make file for Karamoko Niare's pipeline.
jonn-smith Nov 13, 2023
1719798
Fixed bugs in `sr-malaria-niare-pipeline` dockerfile.
jonn-smith Nov 13, 2023
bb42948
Exposed het rate parameters.
jonn-smith Nov 17, 2023
d9a44b9
Adding debug info to BenchmarkVCFs.wdl
jonn-smith Nov 27, 2023
e32dad7
Commenting out debugging code.
jonn-smith Nov 27, 2023
7c359bf
Removed debugging code.
jonn-smith Nov 27, 2023
c39eb09
Added in WDLs to reproduce Niare et al. GATK4 pipeline.
jonn-smith Nov 30, 2023
0cc4abd
Renaming a wdl.
jonn-smith Nov 30, 2023
b48b669
Adding Niare et al. WDLs to dockstore.
jonn-smith Nov 30, 2023
061204d
Removed manual monitoring instrumentation.
jonn-smith Nov 30, 2023
0cfad4a
Fixing typo.
jonn-smith Nov 30, 2023
fa0a79d
Fixing bugs in `sr-malaria-niare-pipeline` dockerfile.
jonn-smith Nov 13, 2023
3ff4efa
Cleaned up docker image for niare paper.
jonn-smith Dec 1, 2023
d1bb297
increasing resources for niare hc
jonn-smith Dec 2, 2023
46c8aa3
Fixing missing parameters in niare wdls
jonn-smith Dec 2, 2023
dc9b902
Removed R script from niare haplotypecaller.
jonn-smith Dec 3, 2023
bff5cdf
Fixing outputs in Niare VETS workflow.
jonn-smith Dec 3, 2023
28cde27
Removed R script output for niare vqsr
jonn-smith Dec 3, 2023
f946425
Fixing syntax error.
jonn-smith Dec 4, 2023
54c6fef
Fixing logic error in Niare VQSR pipeline.
jonn-smith Dec 4, 2023
932c3bc
Fixing issue where variants in Niare pipeline would be duplicated.
jonn-smith Dec 4, 2023
3429644
Fixing bad input in Niare workflows.
jonn-smith Dec 4, 2023
00f01c1
Fixing file / string mixup in Niare HC.
jonn-smith Dec 5, 2023
94a7bbe
Fixing mito contig name in Niare VQSR workflow.
jonn-smith Dec 6, 2023
39fa72a
Removing contig sharding from VQSR in niare to fix variance issue.
jonn-smith Dec 7, 2023
4d617f5
Fixing yet another issue in the vqsr stuff.
jonn-smith Dec 7, 2023
4ae002b
MAde BQSR optional.
jonn-smith Dec 7, 2023
ae5a5e6
Adding new WDL for expanded drug resistance markers.
jonn-smith Dec 12, 2023
22a45e0
Adding ExpandedDrugResistanceMarkerExtraction to dockstore.
jonn-smith Dec 12, 2023
af056ba
Fixed issue in ExpandedDrugResistanceMarkerExtraction.wdl
jonn-smith Dec 12, 2023
5417d9c
Adding genotype info to drug resistance extraction.
jonn-smith Dec 12, 2023
62ccda9
Adding ExpandedDrugResistanceMarkerAggregation.wdl
jonn-smith Dec 12, 2023
7a7b1c5
Fixing het value in aggregation table.
jonn-smith Dec 12, 2023
32b6ae8
Changing output format for ExpandedDrugResistanceMarkerExtraction.wdl
jonn-smith Dec 12, 2023
a9899b8
Updating format for ExpandedDrugResistanceMarkerExtraction.wdl again
jonn-smith Dec 12, 2023
3488be2
Fixed ExpandedDrugResistanceMarkerAggregation.wdl for new formats.
jonn-smith Dec 12, 2023
b049ce4
Fixing bug in ExpandedDrugResistanceMarkerAggregation.wdl
jonn-smith Dec 12, 2023
31d4921
Fixed a bug.
jonn-smith Dec 12, 2023
903276b
Fixing another bug in ExpandedDrugResistanceMarkerAggregation.wdl
jonn-smith Dec 13, 2023
64fed4f
Now expanded drug reports contain synonymous mutations as well.
jonn-smith Dec 14, 2023
25f8e5d
Changed how file localization happens in ExpandedDrugResistanceMarker…
jonn-smith Dec 14, 2023
583d6d8
Fixing a typo in ExpandedDrugResistanceMarkerAggregation.wdl
jonn-smith Dec 14, 2023
39d70bd
Fixing trailing tab character.
jonn-smith Dec 14, 2023
22ba3ca
Final cleanup commit.
jonn-smith Jan 10, 2024
b4adfaa
Deprecating this branch.
jonn-smith Jan 10, 2024
fc35a15
Fixed an incorrect filename specification
kvg Feb 28, 2024
04db875
Streamlined BAM Index File Storage
shadizaheri Apr 11, 2024
64571bb
Streamlined BAM Index File Storage (added the modified SRIndexBam wor…
shadizaheri Apr 11, 2024
84e00ce
Update Utils.wdl
shadizaheri Apr 11, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
88 changes: 88 additions & 0 deletions .dockstore.yml
Original file line number Diff line number Diff line change
Expand Up @@ -108,3 +108,91 @@ workflows:
subclass: wdl
primaryDescriptorPath: /wdl/PBMASIsoSeqDemultiplex.wdl
testParameterFiles:
- name: SRBamToFq
subclass: wdl
primaryDescriptorPath: /wdl/SRBamToFq.wdl
testParameterFiles:
- name: SRIndexBam
subclass: wdl
primaryDescriptorPath: /wdl/SRIndexBam.wdl
testParameterFiles:
- name: SRWholeGenome
subclass: wdl
primaryDescriptorPath: /wdl/SRWholeGenome.wdl
testParameterFiles:
- name: SRFlowcell
subclass: wdl
primaryDescriptorPath: /wdl/SRFlowcell.wdl
testParameterFiles:
- name: SRJointCallGVCFsWithGenomicsDB
subclass: wdl
primaryDescriptorPath: /wdl/SRJointCallGVCFsWithGenomicsDB.wdl
testParameterFiles:
- name: LRJointCallGVCFsWithGenomicsDB
subclass: wdl
primaryDescriptorPath: /wdl/LRJointCallGVCFsWithGenomicsDB.wdl
testParameterFiles:
- name: LRJointCallGVCFs
subclass: wdl
primaryDescriptorPath: /wdl/LRJointCallGVCFs.wdl
testParameterFiles:
- name: ConvertToHailMT
subclass: wdl
primaryDescriptorPath: /wdl/ConvertToHailMT.wdl
testParameterFiles:
- name: ConvertToZarrStore
subclass: wdl
primaryDescriptorPath: /wdl/ConvertToZarrStore.wdl
testParameterFiles:
- name: LRConvertBCF
subclass: wdl
primaryDescriptorPath: /wdl/LRConvertBCF.wdl
testParameterFiles:
- name: BenchmarkVCFs
subclass: wdl
primaryDescriptorPath: /wdl/BenchmarkVCFs.wdl
testParameterFiles:
- name: CompareVcfBenchmarks
subclass: wdl
primaryDescriptorPath: /wdl/CompareVcfBenchmarks.wdl
testParameterFiles:
- name: PanelProcessMalariaBarcodesForRh
subclass: wdl
primaryDescriptorPath: /wdl/PanelProcessMalariaBarcodesForRh.wdl
testParameterFiles:
- name: ProcessMalariaBarcodesDemo
subclass: wdl
primaryDescriptorPath: /wdl/ProcessMalariaBarcodesDemo.wdl
testParameterFiles:
- name: ExtractRegionsFromBam
subclass: wdl
primaryDescriptorPath: /wdl/ExtractRegionsFromBam.wdl
testParameterFiles:
- name: PfalciparumDrugResistanceSummary
subclass: wdl
primaryDescriptorPath: /wdl/PfalciparumDrugResistanceSummary.wdl
testParameterFiles:
- name: SRWholeGenome_Pf_Niare_VQSR
subclass: wdl
primaryDescriptorPath: /wdl/SRWholeGenome_Pf_Niare_VQSR.wdl
testParameterFiles:
- name: SRWholeGenome_Pf_Niare_VETS
subclass: wdl
primaryDescriptorPath: /wdl/SRWholeGenome_Pf_Niare_VETS.wdl
testParameterFiles:
- name: SRJointCallGVCFsWithGenomicsDB_Pf_Niare_VQSR
subclass: wdl
primaryDescriptorPath: /wdl/SRJointCallGVCFsWithGenomicsDB_Pf_Niare_VQSR.wdl
testParameterFiles:
- name: SRJointCallGVCFsWithGenomicsDB_Pf_Niare_VETS
subclass: wdl
primaryDescriptorPath: /wdl/SRJointCallGVCFsWithGenomicsDB_Pf_Niare_VETS.wdl
testParameterFiles:
- name: ExpandedDrugResistanceMarkerExtraction
subclass: wdl
primaryDescriptorPath: /wdl/ExpandedDrugResistanceMarkerExtraction.wdl
testParameterFiles:
- name: ExpandedDrugResistanceMarkerAggregation
subclass: wdl
primaryDescriptorPath: /wdl/ExpandedDrugResistanceMarkerAggregation.wdl
testParameterFiles:
7 changes: 7 additions & 0 deletions BRANCH_IS_DEPRECATED.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
THIS BRANCH IS DEPRECATED.

AS OF 2024 Jan 09, ALL CODE IN THIS BRANCH HAS BEEN MERGED TO MAIN.

THIS BRANCH SHALL CONTINUE TO EXIST AS A REFERENCE UNTIL
JONN IS BACK FROM PATERNITY LEAVE.

2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
[![Generic badge](https://img.shields.io/badge/version-3.0.57-blue.svg)](https://shields.io/)
[![Generic badge](https://img.shields.io/badge/version-3.0.59-blue.svg)](https://shields.io/)
![CI/CD](https://github.com/broadinstitute/long-read-pipelines/workflows/CI/CD/badge.svg)
![Nightly](https://github.com/broadinstitute/long-read-pipelines/workflows/Nightly/badge.svg)

Expand Down
2 changes: 1 addition & 1 deletion VERSION
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@

version=3.0.57
version=3.0.59
70 changes: 70 additions & 0 deletions docker/lr-functional-annotation/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
# Start with a good base python3 image:
FROM ubuntu:20.04
MAINTAINER Jonn Smith <[email protected]>

# Make sure we don't need to interact with any package installations:
ARG DEBIAN_FRONTEND=noninteractive

# Set the working directory to /
WORKDIR /

################################################################################
# Install system packages:

# Make sure we can actually talk to package repos:
RUN apt-get update
RUN apt-get install -y apt-utils
RUN apt-get -y upgrade

RUN apt-get update && apt-get -y upgrade

# Development / prereqs for building software:
# Utilities / tools:
# Get libcurses:
# install gsutil requirements:
RUN apt-get -y install make gcc g++ autoconf sudo && \
apt-get -y install git bash vim time bc sed perl wget curl bzip2 man unzip && \
apt-get -y install liblzma-dev libbz2-dev libncurses5-dev libncursesw5-dev && \
apt-get --allow-releaseinfo-change update && \
apt install -y curl git-lfs time datamash

# Install python:
RUN apt-get install -y python python3 python3-pip

# Setup crcmodc for gsutil:
RUN apt-get install -y gcc python3-dev python3-setuptools && \
pip3 uninstall -y crcmod && \
pip3 install --no-cache-dir -U crcmod

# install gsutil:
RUN curl https://sdk.cloud.google.com | bash

# Get Java:
# Install OpenJDK-8
RUN apt-get update && \
apt-get install -y openjdk-11-jdk && \
apt-get install -y ant && \
apt-get clean

# Fix certificate issues
RUN apt-get update && \
apt-get install ca-certificates-java && \
apt-get clean && \
update-ca-certificates -f

# Setup JAVA_HOME -- useful for docker commandline
ENV JAVA_HOME /usr/lib/jvm/java-11-openjdk-amd64/
RUN export JAVA_HOME

# Get snPEff
RUN wget https://snpeff.blob.core.windows.net/versions/snpEff_latest_core.zip && \
unzip snpEff_latest_core.zip

# Get tabix (for bgzip):
RUN apt-get install -y tabix

################################################################################
# Final runtime configuration:
# Let's start at the root:
WORKDIR /

17 changes: 17 additions & 0 deletions docker/lr-functional-annotation/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
IMAGE_NAME = lr-functional-annotation
VERSION = 0.0.1

TAG1 = us.gcr.io/broad-dsp-lrma/$(IMAGE_NAME):$(VERSION)
TAG2 = us.gcr.io/broad-dsp-lrma/$(IMAGE_NAME):latest

all: | build push

build:
docker build -t $(TAG1) -t $(TAG2) .

build_no_cache:
docker build --no-cache -t $(TAG1) -t $(TAG2) .

push:
docker push $(TAG1)
docker push $(TAG2)
27 changes: 27 additions & 0 deletions docker/lr-malaria/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
FROM continuumio/miniconda3:22.11.1

MAINTAINER Jonn Smith

# copy other resources
COPY ./environment.yml /

# install conda packages
RUN conda env create -f /environment.yml && conda clean -a
ENV PATH=/root/google-cloud-sdk/bin/:${PATH}

# install gsutil
RUN apt-get --allow-releaseinfo-change update
RUN apt install -y curl git-lfs time datamash
RUN curl https://sdk.cloud.google.com | bash

# Setup crcmodc for gsutil:
RUN apt-get install -y gcc python3-dev python3-setuptools && \
pip3 uninstall -y crcmod && \
pip3 install --no-cache-dir -U crcmod

# copy python scripts
COPY python/* /python_scripts/
RUN chmod +x /python_scripts/*

# activate conda environment
RUN echo "source activate lr-malaria" > ~/.bashrc
17 changes: 17 additions & 0 deletions docker/lr-malaria/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
IMAGE_NAME = lr-malaria
VERSION = 0.0.1

TAG1 = us.gcr.io/broad-dsp-lrma/$(IMAGE_NAME):$(VERSION)
TAG2 = us.gcr.io/broad-dsp-lrma/$(IMAGE_NAME):latest

all: | build push

build:
docker build -t $(TAG1) -t $(TAG2) .

build_no_cache:
docker build --no-cache -t $(TAG1) -t $(TAG2) .

push:
docker push $(TAG1)
docker push $(TAG2)
22 changes: 22 additions & 0 deletions docker/lr-malaria/environment.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
name: lr-malaria
channels:
- default
- bioconda
- anaconda
dependencies:
- python=3.6
- cython
- pip
- pip:
- numpy
- scipy
- pandas
- seaborn
- statsmodels
- networkx
- matplotlib
- patsy
- openpyxl
- scikit-learn
- numpyencoder
- tqdm
Loading
Loading