Customizing L2 Regularization using Module API

safrooze · November 18, 2017, 4:08am

The only way to customize the L2 regularization of an optimizer (e.g. if one wanted to amplify regularization of the embedding layer only) is by calling set_wd_mult on the optimizer. However set_wd_mult is not exposed through the Module API and wd_mult cannot be set through optimizer parameters passed into Module.fit() call. As a result, the only way to customize L2 regularization appears to require creating and initializing the optimizer outside of the module and passing it into fit() call. A simple modification to the optimizer’s constructor can allow wd_mult to be passed into the call. Any thoughts on this?

safrooze · September 19, 2018, 5:45pm

There really is no good way of achieving this. However Gluon API makes this much simpler, easily allowing multiple trainers to be created with different weight decay parameters.

Topic		Replies	Views
Multiple weight decay rates Gluon	4	773	January 4, 2019
Custom Loss + L2 Regularization Discussion	3	1396	July 6, 2018
L1 regularization implementation in Gluon	0	401	March 24, 2020
Gradient nan when using 2-norm in lstm network Gluon	0	392	August 16, 2019
Multiple losses Gluon	7	3650	June 5, 2018

Customizing L2 Regularization using Module API

Related Topics