Softmax block in Gluon

avolozin · October 20, 2017, 9:10pm

The http://gluon.mxnet.io/chapter02_supervised-learning/softmax-regression-gluon.html# doesn’t explain how to get a “pure” Softmax block in a Gluon model.

“Note, we didn’t have to include the softmax layer because MXNet’s has an efficient function that simultaneously computes the softmax activation and cross-entropy loss. However, if ever need to get the output probabilities,”

it seems like the last sentence is not complete. Is the only way to build a custom block and use F.softmax(x) in the hybrid_forward method?

Thanks!
-Andrey

astonzhang · October 21, 2017, 5:55am

>>> import mxnet as mx
>>> from mxnet import ndarray as nd
>>> x = nd.array([1, 2, 3, 4])
>>> y = nd.softmax(x)
>>> y

[ 0.0320586   0.08714432  0.23688284  0.64391428]
<NDArray 4 @cpu(0)>
>>> sum(y)

[ 1.]
<NDArray 1 @cpu(0)>

avolozin · October 21, 2017, 6:23pm

Thanks astonzhang!

I was actually looking for an existing softmax block to use in Gluon models. In the meantime I am using a little custom one:

class Softmax(HybridBlock):

def __init__(self, **kwargs):
     super(Softmax, self).__init__(**kwargs)

def hybrid_forward(self, F, x):
    return F.softmax(x)

madjam · October 23, 2017, 5:28pm

@avolozin this seems useful enough to be directly added to gluon.
Would be be ok with creating a PR request?
https://mxnet.incubator.apache.org/community/contribute.html

avolozin · October 23, 2017, 8:29pm

Thanks @madjam! I will need to checkout the code and setup a dev env - might have time toward the end of this week, quite swamped now

Another option would be to add ‘softmax’ as a new type in mxnet.gluon.nn.Activation. What do you think?

madjam · October 23, 2017, 8:38pm

I like that idea. Keras does this in a similar fashion. https://keras.io/activations/

avolozin · October 23, 2017, 8:58pm

Thanks! I’ll ping you when I get some time to implement this

YCAyca · January 9, 2020, 12:53pm

I was searching the same attribute and It could be better if we had softmax as a new type in mxnet.gluon.nn.Activation !

Topic		Replies	Views
Custom HybridBlock Problem when not hybridizing with random_uniform Gluon	4	1253	December 16, 2018
SVRG Optimization on gluon (mx.module on gl.trainer) Gluon	2	536	November 29, 2018
Hybridizing if elif and else statements in Gluon	3	754	March 25, 2019
How to return subset of a gluon hybrid block? Gluon	3	418	April 21, 2019
Updating the parameters of HybridBlocks Discussion	6	1993	December 1, 2017

Softmax block in Gluon

Related Topics