Batch Norm And Batch Size 1 Recommendations

mikeobr · February 13, 2020, 2:23am

Hi,

I’ve been trying to train a model using ResnetV2 from the model zoo. Because I am generating training data on the fly, I have been training with a batch size = 1.

I noticed a weird behavior when running my network for inference: I would get more accurate results using with ag.record() rather than with ag.predict_mode().

According to this:

Warning: the estimates for the batch mean and variance can themselves have high variance when the batch size is small (or when the spatial dimensions of samples are small). This can lead to instability during training, and unreliable estimates for the global statistics.

How should I approach BatchNorm usage? Is there any danger in using ag.record() for inference?Should I make a custom ResnetV2 model with no BatchNorm layers?

Topic		Replies	Views
Training with one batch gives different training/validation accuracies when shuffled	2	580	October 18, 2017
Asking for the training hyper-parameters for ImageNet-1k Discussion	3	3447	April 20, 2018
Finetuneing a pretrained ResNet50_v1d in gluoncv Gluon	1	452	December 31, 2018
Default YOLOv3 does not improve	3	953	July 26, 2019
Nan in loss after several epochs in SemSeg problem Gluon	4	3289	May 7, 2018

Batch Norm And Batch Size 1 Recommendations

Related Topics