You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
But I'm not sure if this workaround would potentially break the gradient synchronization issue mentioned in #994. @Isotr0py I'd be very grateful if you could run a sanity check on this.
The text was updated successfully, but these errors were encountered:
Alright, so after merging #1000 and fixing issue #1002, I believe the gradient asynchrony issue has been solved. Here's what I found for my run, compared with the previous run with the same settings:
Lower loss for every step
The output sample images got way better. The sample images in the previous run look broken now to me. The details were cranky and the texture was crude.
Similar to issue #994, but happened while attempting to generate sample images in
sdxl_train.py
.The error message:
AttributeError: 'DistributedDataParallel' object has no attribute 'text_projection'
The issue can be fixed by unwrapping the model, i.e., change
to
But I'm not sure if this workaround would potentially break the gradient synchronization issue mentioned in #994.
@Isotr0py I'd be very grateful if you could run a sanity check on this.
The text was updated successfully, but these errors were encountered: