I recently read some papers, and one of their idea is eliminated the weight decay on the bias and batch nomalization. And I use mxnet.symbol api to train the model, but I can’t find any documents or informations to implement that in mxnet. So, I want to ask your guys this forum, thank you so much.