Allow to save recordingless analyzer as #3443

alejoe91 · 2024-09-25T17:23:28Z

This PR allows to save an analyzer as binary or zarr even if recordingless

chrishalcrow · 2024-09-26T06:54:24Z

Hello, if I've already saved a sorting_analyzer in my_sa and then run the following code:

import spikeinterface.full as si
sa = si.load_sorting_analyzer("my_sa")
sa._recording = None
sa_zarr = sa.save_as(format="zarr", folder="my_zarr_sa")

Then the saved zarr folder has the correct folder structure but all the files are things like called things like 0 and 0.0 and are a few bytes big. If you save sa_zarr as a binary folder, it works. So something is wrong with the zarr saving.

alejoe91 · 2024-09-26T07:08:58Z

Those are the name of zarr chunks :) The recording will be saved as an empty dict, so it makes sense that it's few bytes. everything else should be ok (but I'll double check)

chrishalcrow · 2024-09-26T07:19:55Z

Those are the name of zarr chunks :) The recording will be saved as an empty dict, so it makes sense that it's few bytes. everything else should be ok (but I'll double check)

Oh right! ~~Everything is named like this, even e.g. sorting_provenance?~~
EDIT: ok I've read about zarr, cool!

alejoe91 · 2024-09-26T07:45:36Z

https://probeinterface.readthedocs.io/en/main/releases/0.2.18.html

Those are the name of zarr chunks :) The recording will be saved as an empty dict, so it makes sense that it's few bytes. everything else should be ok (but I'll double check)

Oh right! ~~Everything is named like this, even e.g. sorting_provenance?~~ EDIT: ok I've read about zarr, cool!

If you're interested, you can check the hidden files. You should have a .zarray describing the properties of the datasets and a .zattrs which is a dictionary with additional metadata ;)

chrishalcrow · 2024-09-26T07:55:44Z

src/spikeinterface/core/sortinganalyzer.py

@@ -2015,7 +2023,10 @@ def copy(self, new_sorting_analyzer, unit_ids=None):
            new_extension.data = self.data
        else:
            new_extension.data = self._select_extension_data(unit_ids)
-        new_extension.run_info = self.run_info.copy()
+        if self.run_info is not None:


If you import copy from copy you can use copy(self.run_info) which returns None if self.run_info is None, which would save some lines of code and indentations here and at line 2047.

samuelgarcia · 2024-09-26T14:16:01Z

I am not sure to like this.
Lets discuss before any merger please.
How do we get rec_attributes if not recording ?

alejoe91 · 2024-09-26T14:18:05Z

From the analyzer rec_attributes, which is always set! This is essential especially when loading an old waveform_extractor

zm711 · 2024-09-26T19:29:23Z

src/spikeinterface/core/sortinganalyzer.py

+            if self.format == "binary_folder":
+                extension_folder = self._get_binary_extension_folder()
+                run_info_file = extension_folder / "run_info.json"
+                run_info_file.write_text(json.dumps(run_info, indent=4), encoding="utf8")


One question I have here in general not related to this PR is if we encode with 'utf-8' will that then mess with other encoding systems? Or do we think that the encoding was really just limited to shell script issues? Is there any place where bouncing between encodings may bite us?

Since we are building the dictionaries we are sure that they can be UTF-8 encoded and decoded, so I think it's safe!

What about the case for paths where the paths include Chinese or Japanese characters, which can't be encoded with uft-8? For run_info that doesn't matter, but I'm wondering about other json files we make. It might be that we just let people know that this works for utf-8 and may work for other encodings. But since most people on the team all use utf-8 I think it will be hard for us to know for sure for this. Doesn't really hold anything up I guess.

samuelgarcia · 2024-09-30T14:29:44Z

src/spikeinterface/core/sortinganalyzer.py

@@ -352,8 +353,6 @@ def create_memory(cls, sorting, recording, sparsity, return_scaled, rec_attribut
    def create_binary_folder(cls, folder, sorting, recording, sparsity, return_scaled, rec_attributes):
        # used by create and save_as

-        assert recording is not None, "To create a SortingAnalyzer you need to specify the recording"


Could we move this to the function def create_sorting_analyzer ?

samuelgarcia · 2024-09-30T14:30:38Z

src/spikeinterface/core/sortinganalyzer.py

+            with open(folder / "recording.json", mode="w") as f:
+                json.dump({}, f, indent=4)


I would write nothing I think.

samuelgarcia · 2024-09-30T14:31:50Z

src/spikeinterface/core/sortinganalyzer.py

-            recording.dump(folder / "recording.json", relative_to=folder)
-        elif recording.check_serializability("pickle"):
-            recording.dump(folder / "recording.pickle", relative_to=folder)
+        if recording is not None:


we should have a mutualy exclusive between recording and rec_attributes in this function at some point

samuelgarcia · 2024-09-30T14:32:39Z

src/spikeinterface/core/sortinganalyzer.py

+                warnings.warn(
+                    "SortingAnalyzer with zarr : the Recording is not json serializable, the recording link will be lost for future load"
+                )


This warning should be also in the other format no ?

Is it not there? I'll check

…ertions

Allow to save recordingless analyzer as

c86cd5f

alejoe91 added the core Changes to core module label Sep 25, 2024

alejoe91 added 3 commits September 25, 2024 19:27

Fix missing run_info

7359f16

Fix missing run_info 2

b037b29

Fix missing run_info 3

b5b553f

chrishalcrow reviewed Sep 26, 2024

View reviewed changes

alejoe91 added 2 commits September 26, 2024 10:21

Merge branch 'main' into fix-save-as-recordingless

19cc6ea

relax test causal to fix failure

a5372b0

Chris' suggestion

211d68e

zm711 reviewed Sep 26, 2024

View reviewed changes

alejoe91 added this to the 0.101.2 milestone Sep 27, 2024

samuelgarcia reviewed Sep 30, 2024

View reviewed changes

alejoe91 added 3 commits September 30, 2024 17:56

Skip saving empty recording files/fields and improve warnings and ass…

e044e19

…ertions

Remove redundant assertions

2838c48

further relaxation of causal_filter equality tests....

c338658

samuelgarcia approved these changes Oct 1, 2024

View reviewed changes

samuelgarcia merged commit b0c2bae into SpikeInterface:main Oct 1, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to save recordingless analyzer as #3443

Allow to save recordingless analyzer as #3443

alejoe91 commented Sep 25, 2024 •

edited

Loading

chrishalcrow commented Sep 26, 2024 •

edited

Loading

alejoe91 commented Sep 26, 2024

chrishalcrow commented Sep 26, 2024 •

edited

Loading

alejoe91 commented Sep 26, 2024

chrishalcrow Sep 26, 2024 •

edited

Loading

samuelgarcia commented Sep 26, 2024

alejoe91 commented Sep 26, 2024

zm711 Sep 26, 2024

alejoe91 Sep 26, 2024

zm711 Sep 27, 2024

samuelgarcia Sep 30, 2024

samuelgarcia Sep 30, 2024

alejoe91 Sep 30, 2024

samuelgarcia Sep 30, 2024

samuelgarcia Sep 30, 2024

alejoe91 Sep 30, 2024

alejoe91 Sep 30, 2024

		with open(folder / "recording.json", mode="w") as f:
		json.dump({}, f, indent=4)

Allow to save recordingless analyzer as #3443

Allow to save recordingless analyzer as #3443

Conversation

alejoe91 commented Sep 25, 2024 • edited Loading

chrishalcrow commented Sep 26, 2024 • edited Loading

alejoe91 commented Sep 26, 2024

chrishalcrow commented Sep 26, 2024 • edited Loading

alejoe91 commented Sep 26, 2024

chrishalcrow Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

samuelgarcia commented Sep 26, 2024

alejoe91 commented Sep 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alejoe91 commented Sep 25, 2024 •

edited

Loading

chrishalcrow commented Sep 26, 2024 •

edited

Loading

chrishalcrow commented Sep 26, 2024 •

edited

Loading

chrishalcrow Sep 26, 2024 •

edited

Loading