Improve evaluation results for Batch norm #2108

astonzhang · 2022-04-21T01:06:55Z

http://preview.d2l.ai.s3-website-us-west-2.amazonaws.com/d2l-en/master/chapter_convolutional-modern/batch-norm.html

The evaluation results of batch norm for all frameworks need improvements. For example, as the text states, the performance should be better than that of LeNet without BN. However, the training looks unstable as shown in the plots, such as in the plot of val_acc.

astonzhang · 2022-04-21T01:11:02Z

@AnirudhDagar feel free to share your insight here while you're working on #2107 and #2099

cheungdaven · 2022-04-21T03:16:13Z

@astonzhang I found that removing the batchnorm on the fully-connected layers can make the curve stable.

astonzhang · 2022-04-21T18:08:25Z

@cheungdaven In fact, it seems that ResNet and DenseNet have similar issues:

http://preview.d2l.ai.s3-website-us-west-2.amazonaws.com/d2l-en/master/chapter_convolutional-modern/resnet.html
http://preview.d2l.ai.s3-website-us-west-2.amazonaws.com/d2l-en/master/chapter_convolutional-modern/densenet.html

although RegNet in PyTorch has smoother plots:
http://preview.d2l.ai.s3-website-us-west-2.amazonaws.com/d2l-en/master/chapter_convolutional-modern/cnn-design.html

Since ResNet and DenseNet do not apply BN on the FC layers in network heads, perhaps the plot issue is with somewhere else? Are you able to find any literature that supports removal of BN after FC layers? If not, could you try something else?

AnirudhDagar · 2023-08-28T09:11:17Z

Closing this since it was fixed with the latest version of all the frameworks.

astonzhang assigned AnirudhDagar Apr 21, 2022

astonzhang added the needed for next release label Apr 21, 2022

astonzhang assigned cheungdaven and unassigned AnirudhDagar Apr 21, 2022

AnirudhDagar closed this as completed Aug 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve evaluation results for Batch norm #2108

Improve evaluation results for Batch norm #2108

astonzhang commented Apr 21, 2022

astonzhang commented Apr 21, 2022

cheungdaven commented Apr 21, 2022

astonzhang commented Apr 21, 2022

AnirudhDagar commented Aug 28, 2023

Improve evaluation results for Batch norm #2108

Improve evaluation results for Batch norm #2108

Comments

astonzhang commented Apr 21, 2022

astonzhang commented Apr 21, 2022

cheungdaven commented Apr 21, 2022

astonzhang commented Apr 21, 2022

AnirudhDagar commented Aug 28, 2023