Dump Error Keys #1139

deckar01 · 2019-02-15T14:35:31Z

Store errors by key when dumping collections to prevent item errors from being reported as their parent.

Example:

from marshmallow import Schema, ValidationError, fields


class Test(Schema):
    foo = fields.List(fields.Int)

Test().dump({'foo': ['five']})
# Before: {'foo': ['Not a valid integer.']}
# After: {'foo': {0: ['Not a valid integer.']}}

deckar01 · 2019-02-15T14:39:23Z

marshmallow/fields.py

+                        result[keys[key]] = error.valid_data
+                else:
+                    if key in keys:
+                        result[keys[key]] = deser_val


I'm not really happy with copy and pasting this much code. It is effectively 6 8 4 copies of the same thing, with slight variations. I suspect there is a utility interface that can abstract this functionality while also improving the code's readability.

I took a first pass at deduplicating the logic. I didn't see a clean way to abstract the operations across field classes, but I avoided introducing any duplicate logic in this PR and deduplicated a few operations that were already present.

I need to test what the performance implications are for this abstraction. It should be minimal, because the primary difference is getattr being used in place of static attribute access once in each method. An alternative would be to use a lambda instead, but I don't have an intuition for which is preferable or faster.

A few methods were passing None into the attr and obj/data arguments instead of passing them through like the rest. Normalizing this did not affect the tests, but I suspect the original behavior was an obscure bug. I need to look at the implications of this change a little closer.

I suspect the original behavior was an obscure bug.

Confirmed. See #1176.

sloria

Implementation looks correct; just a minor spelling change suggested.

I'm also not sure about how to deduplicate the logic across field classes without introducing a confusing amount of indirection. This could happen after @lafrech's suggested ContainerField becomes a reality, but no need to block merging of this PR nor #1066 for it.

Granted there are no performance regressions with this, I think this is on the right track.

marshmallow/fields.py

lafrech · 2019-02-15T21:32:50Z

I pushed a ContainerMixin proposal in #1066.

lafrech · 2019-06-04T07:55:52Z

I finally dropped the ContainerMixin I proposed in #1066 and submitted a simpler PR (#1229) so don't count on it.

deckar01 · 2019-06-16T23:41:42Z

I will get this up to date now that #1229 has landed.

sloria · 2019-07-14T19:41:14Z

@deckar01 Are you still working on this? Let us know if you need help with it.

deckar01 · 2019-07-15T16:05:57Z

Benchmark:

Setup: benchmark.py

# dev
load 18.1108 µs ± 0.65%
dump 10.8205 µs ± 1.14%
load no err 18.0016 µs ± 0.69%
dump no err 10.7058 µs ± 2.38%

# 1132-nested-dump-errors
load 18.7329 µs ± 0.85%
dump 10.8929 µs ± 0.42%
load no err 18.6097 µs ± 0.85%
dump no err 10.9445 µs ± 0.97%

# change
load +3.43%
dump +0.66%
load +3.37%
dump +2.23%

sloria · 2019-07-16T00:14:55Z

It's a minor performance regression, but I'm wondering if we should incur it at all...we've since decided that validation should only happen on deserialization and that the dump should be considered valid. Do we want to continue supporting validation-on-serialization?

deckar01 · 2019-07-16T00:41:40Z

The performance regression is a side effect of refactoring to avoid copy and pasting validation code. It could be omitted. I will continue the discussion about dump validation on the issue.

deckar01 · 2019-07-17T16:14:14Z

Closing in favor of #1304.

deckar01 commented Feb 15, 2019

View reviewed changes

sloria added this to the 3.0 milestone Feb 15, 2019

sloria approved these changes Feb 15, 2019

View reviewed changes

marshmallow/fields.py Outdated Show resolved Hide resolved

deckar01 mentioned this pull request Mar 24, 2019

fields.Function doesn't pass the value #1176

Closed

deckar01 force-pushed the 1132-nested-dump-errors branch from 2fdcee7 to 76cf369 Compare March 26, 2019 18:49

deckar01 mentioned this pull request Jun 14, 2019

Wrong error messages raised when using fields.Dict #1240

Closed

Jared Deckard added 5 commits July 15, 2019 09:17

Store Dict errors by key when dumping

ec3ef5d

Store List errors by key when dumping

d0e290d

Deduplicate list marshalling logic

8568a11

Deduplicate mapping marshalling logic

081158a

Store Tuple errors by key when dumping

07f37e2

deckar01 force-pushed the 1132-nested-dump-errors branch from 76cf369 to 07f37e2 Compare July 15, 2019 14:41

sloria mentioned this pull request Jul 16, 2019

Remove validation on serialization #1132

Closed

deckar01 closed this Jul 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dump Error Keys #1139

Dump Error Keys #1139

deckar01 commented Feb 15, 2019

deckar01 Feb 15, 2019 •

edited

Loading

deckar01 Feb 15, 2019 •

edited

Loading

deckar01 Mar 27, 2019

sloria left a comment

lafrech commented Feb 15, 2019

lafrech commented Jun 4, 2019

deckar01 commented Jun 16, 2019

sloria commented Jul 14, 2019

deckar01 commented Jul 15, 2019

sloria commented Jul 16, 2019

deckar01 commented Jul 16, 2019

deckar01 commented Jul 17, 2019 •

edited

Loading

Dump Error Keys #1139

Dump Error Keys #1139

Conversation

deckar01 commented Feb 15, 2019

deckar01 Feb 15, 2019 • edited Loading

Choose a reason for hiding this comment

deckar01 Feb 15, 2019 • edited Loading

Choose a reason for hiding this comment

deckar01 Mar 27, 2019

Choose a reason for hiding this comment

sloria left a comment

Choose a reason for hiding this comment

lafrech commented Feb 15, 2019

lafrech commented Jun 4, 2019

deckar01 commented Jun 16, 2019

sloria commented Jul 14, 2019

deckar01 commented Jul 15, 2019

sloria commented Jul 16, 2019

deckar01 commented Jul 16, 2019

deckar01 commented Jul 17, 2019 • edited Loading

deckar01 Feb 15, 2019 •

edited

Loading

deckar01 Feb 15, 2019 •

edited

Loading

deckar01 commented Jul 17, 2019 •

edited

Loading