Custom Normalization Layers very slow

dfferstl · April 11, 2020, 11:42am

Hi,
I recently implemented custom normalization layers for image classification and object detection tasks in MXNet, namely GroupNorm and Feature Response Normalization. (I know there exists already a GN implementation in GluonCV). These Normalization blocks only consist of a few simple operations but slowing down my training speed by a factor of x2 compared to the standard BN.
Why take these custom blocks so much longer to compute? Is there any way to accelerate this apart of implementing them in C++?
Here is my code on the FRN (https://arxiv.org/abs/1911.09737):

    def hybrid_forward(self, F, x, gamma, beta, tau, eps):

        # mean squared norm of x
        nu2 = F.mean(F.square(x), axis=[2, 3], keepdims=True)
    
        # filter response normalization
        x = F.broadcast_mul(x, F.sqrt( F.broadcast_add( nu2, F.abs(eps) ) ) )

        # affine transformation and thresholded linear unit (TLU)
        x = F.broadcast_maximum( F.broadcast_add( F.broadcast_mul( gamma, x ), beta ), tau )

        return x

Topic		Replies	Views
Understanding and modifying Faster RCNN Gluon	0	434	June 28, 2020
Custom parameter is dumped during a forward call	1	412	January 3, 2020
Custom block backward pass Discussion	2	862	November 28, 2017
Regularizer for Custom Parameter Gluon	3	600	June 21, 2019
Custom symbol loss with gluon Gluon	4	624	June 8, 2019

Custom Normalization Layers very slow

Related Topics