-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Rebuild a fits manifest from an HSC data directory and speedup HSC da…
…ta loading (#115) # Rebuild a fits manifest from an HSC data directory. * Added a new verb rebuild_manifest * When run with the HSC dataset class this verb will: 0) Scan the data directory and ingest HSC cutout files 1) Read in the original catalog file configured for download for metadata 2) Write out rebuilt_manifest.fits in the data directory * Fixed up config resolution so that fibad_config.toml in the cwd works again for CLI invocations. * Adding progressive logging for long steps. * Rebuild command will never open or use the manifest file in the data directory because the assumption is that file is corrupt. # Speeding up HSC Data loading * Parallelizing _scan_file_dimensions() Using Schwimmbad and multiprocessing to parallelize extracting the dimensions of files in HSCDataSet to effect speedup of 124x on 10M+ file datasets. * Added progressive log entries for HSCDataSet file scan * Use manifest by default when no filter_catalog provided. This skips the file scan on large datasets * Choose number of processes in a way that doesn't run afoul of system limits Co-authored-by: Drew Oldag <[email protected]>
- Loading branch information
Showing
7 changed files
with
345 additions
and
67 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.