You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wonder if it's possible to support sparse array to some degree in Labels. If it's a mask then the sparse format could save quite a lot of disk space.
For an alternative approach, available already today, I suggest to convert the labels to collections of multipolygons (="shapes" object, represented as a geopandas.GeoDataFrame). You can use the functions to_polygons() and rasterize() from spatialdata for the conversions).
For a long term approach, I kindly ask you to open the discussion also in the ngff repo (same link as above).
We believe that standardization of file formats will make it easier in the long term to enabled cross-interoperable workflows. Also, vendors would have an incentive in producing directly data in a standard format.
Currently, we are still not 100% NGFF compliant because we needed some additional features, but we are working on either contributing some of our ideas to NGFF, either clearly defining in a specification document how we complement missing features from NGFF using external technologies (e.g. GeoParquet).
Back to your question. I think that having sparse arrays in labels would benefit not only the spatial omics community but also the bioimaging community, so I would definitely consider starting the discussion within NGFF, where it can have more visibility.
I wonder if it's possible to support sparse array to some degree in Labels. If it's a mask then the sparse format could save quite a lot of disk space.
Sparse array status in xarray: pydata/xarray#3213
Documentation: https://docs.xarray.dev/en/latest/user-guide/duckarrays.html
The text was updated successfully, but these errors were encountered: