Convert data to arrays if possible when creating datasets #48

simetenn · 2018-05-11T08:29:20Z

Converts data to a numpy array if data is not a ndarray or None. Among other things this enables lists to directly be used when creating a dataset, similar to h5py.

This also enables other objects to be converted to numpy object arrays. This is in line with the current support for and handling of object arrays, but see issue #47.

codecov · 2018-05-11T08:33:17Z

Codecov Report

Merging #48 into dev will increase coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff            @@
##              dev     #48      +/-   ##
=========================================
+ Coverage   97.58%   97.6%   +0.01%     
=========================================
  Files          11      11              
  Lines        1369    1376       +7     
=========================================
+ Hits         1336    1343       +7     
  Misses         33      33

Impacted Files	Coverage Δ
tests/test_dataset.py	`100% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update edbdea9...eb479a5. Read the comment docs.

dragly

I think we should definitely use np.shape, but I'm not sure if np.array(data) is a good idea (see the inline comment for details).

dragly · 2018-05-14T20:44:05Z

exdir/core/group.py

@@ -113,12 +113,15 @@ def create_dataset(self, name, shape=None, dtype=None,

        prepared_data, attrs, meta = ds._prepare_write(data, self.plugin_manager.dataset_plugins.write_order)

+        if not isinstance(prepared_data, np.ndarray) and prepared_data is not None:
+            prepared_data = np.array(prepared_data)


Which types will be affected by this? I remember something about quantities automatically getting converted the wrong way (i.e. not by the plugin if there was no plugin) because there was some support for conversion that we didn't expect. If this only affects lists (and other types that you'd really expect to get converted to an np.array), I'm all for it. But if this may trigger some unexpected conversion that a plugin should have done when the plugin is not enabled, I think we should only add it for types we already know (such as lists).

dragly · 2018-05-14T20:44:26Z

exdir/core/group.py

            raise ValueError(
                "Provided shape and data.shape do not match: {} vs {}".format(
-                    shape, data.shape
+                    shape, np.shape(shape)


Looks like a typo. Should it be np.shape(data)?

dragly · 2018-05-14T20:44:40Z

tests/test_dataset.py

+    data = [1, 2, 3]
+    dset = grp.create_dataset('foo', data=data)
+    assert dset.shape == (3,)
+    assert np.array_equal(dset.data, np.array(data))


Thanks for adding a test!

simetenn added 2 commits May 11, 2018 10:04

Convert data to arrays if possible when creating datasets

acbfa33

Merge branch 'dev' into lists

eb479a5

simetenn requested review from dragly and miladh May 11, 2018 08:29

dragly requested changes May 14, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert data to arrays if possible when creating datasets #48

Convert data to arrays if possible when creating datasets #48

simetenn commented May 11, 2018

codecov bot commented May 11, 2018 •

edited

Loading

dragly left a comment

dragly May 14, 2018

dragly May 14, 2018

dragly May 14, 2018

Convert data to arrays if possible when creating datasets #48

Are you sure you want to change the base?

Convert data to arrays if possible when creating datasets #48

Conversation

simetenn commented May 11, 2018

codecov bot commented May 11, 2018 • edited Loading

Codecov Report

dragly left a comment

Choose a reason for hiding this comment

dragly May 14, 2018

Choose a reason for hiding this comment

dragly May 14, 2018

Choose a reason for hiding this comment

dragly May 14, 2018

Choose a reason for hiding this comment

codecov bot commented May 11, 2018 •

edited

Loading