New system to partially replace xforms (maybe) #14

jacobpennington · 2022-06-29T22:06:25Z

The xforms system is useful in principle (i.e. being able to ensure consistent re-use of preprocessing and fitting procedures), but the implementation did not scale well and many of the xforms functions had hard-coded associations with lab-specific usage.

Idea for a new system (inspired by scikit-learn pipelines):

a Pipeline class that performs a series of data transformations. Idea being that a user can:

Add operations to a Pipeline instance
Define a subclass of Pipeline with the operations already specified (similar to pre-built models idea)
Call Pipeline.transform(data) to perform the steps.

Example 1:

# Preprocessing only
data = {'stimulus': ...<waveforms>... , 'response': ...<spikes>... , 'state': ...}
pipe = Pipeline()
# Add function objects that expect some kind of data as the first argument
pipe.add_step(sound_to_spectrogram, input='stimulus', kwargs={'n_channels': 18})
pipe.add_step(spikes_to_rates, input='response')
pipe.add_step(split_by_fraction, kwargs={'fraction': 0.9, 'axis': 0})
# Transform the data
new_data = pipe.transform(data)

Example 2:

class MyStandardPipeline(Pipeline):
    def __init__(self):
        self.add_steps(
            (sound_to_spectrogram, input='stimulus', kwargs={'n_channels': 18}),
            (spikes_to_rates, input='response'),
            (split_by_fraction, kwargs={'fraction': 0.9, 'axis': 0})
        )

data = {'stimulus': ...<waveforms>... , 'response': ...<spikes>... , 'state': ...}
MyStandardPipeline.transform(data)

I think it would be best to limit this to preprocessing for simplicity, but model fitting could also be included.

Example:

data = {'stimulus': ...<waveforms>... , 'response': ...<spikes>... , 'state': ...}
model = Model().add_layers(...)

pipe = Pipeline()
pipe.add_steps(...<preprocessing>...)
pipe.add_step(model.fit, input=None, kwargs={'target': 'response'}  # None: get full data dict instead of one value
pipe.add_steps(...<more processing>...)
pipe.add_step(model.fit, ...)

The text was updated successfully, but these errors were encountered:

jacobpennington added the enhancement New feature or request label Jun 29, 2022

jacobpennington self-assigned this Jun 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New system to partially replace xforms (maybe) #14

New system to partially replace xforms (maybe) #14

jacobpennington commented Jun 29, 2022 •

edited

Loading

New system to partially replace xforms (maybe) #14

New system to partially replace xforms (maybe) #14

Comments

jacobpennington commented Jun 29, 2022 • edited Loading

jacobpennington commented Jun 29, 2022 •

edited

Loading