reduce RAM update parquet by beam. #33

h-a-graham · 2024-09-23T09:18:56Z

At the moment, the memory usage is quite high when we have to load the entire granule (i.e every beam in the granule) into memory. Instead, we could consider saving each beam to the parquet file with update - that way memory usage should be 1/8th what it is now. Not essential immediately but I can see this being useful for working with the direct access. Imagine being able to use very cheap compute with minimal memory to transfer the hdf files to parquet in another s3 bucket - could be a very low cost way to enhance GEDI access...

h-a-graham added the enhancement New feature or request label Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce RAM update parquet by beam. #33

reduce RAM update parquet by beam. #33

h-a-graham commented Sep 23, 2024

reduce RAM update parquet by beam. #33

reduce RAM update parquet by beam. #33

Comments

h-a-graham commented Sep 23, 2024