Expected labels of dependent variable in binary classification

FraPochetti · October 25, 2017, 8:25pm

I am a bit confused about which labels MXNet is expecting in a binary classification context.
In my problem, I have a dependent variable which looks like an array of 1s and 0s i.e. [1,0,0,0,1,1….0,0,0,1].
In numpy terms, its shape is (n_data_points,).

Given that, my last 2 layers in the model are defined as follows:

fc2 = mx.symbol.FullyConnected(data = fc1bn, name=‘fc2’, num_hidden=1)
mlp = mx.symbol.LogisticRegressionOutput(data = fc2, name = ‘softmax’)

This works perfectly.
Thing is, this works as well

fc2 = mx.symbol.FullyConnected(data = fc1bn, name=‘fc2’, num_hidden=2)
mlp = mx.symbol.SoftmaxOutput(data = fc2, name = ‘softmax’)

whilst I would have expected the above to work only if the dependent variable was one-hot-encoded, i.e. [[1,0],[0,1],[0,1],[0,1],[1,0],…[0,1],[1,0]], or again, in numpy terms, shaped as (n_data_points,2).

Apparently SoftmaxOutput is smart enough to spit out a probability and return argmax at the same time.
Now, the question is, is there a recommended way of structuring a binary classification problem?
Shall one use a one-hot-encoded variable or not?
Knowing that LogisticRegressionOutput and SoftmaxOutput do exactly the same thing in a binary context, which one is recommended?

piiswrong · October 26, 2017, 10:36pm

As the examples here show: http://mxnet.incubator.apache.org/api/python/symbol.html?highlight=softmaxoutput#mxnet.symbol.SoftmaxOutput
SoftmaxOutput by default takes integer labels.

FraPochetti · October 27, 2017, 7:36am

Thanks a lot! This is helpful

Topic		Replies	Views
Mxnet C++ : multi-label classfier Discussion	2	540	July 5, 2018
Predicting softmax with variable number of labels	4	573	February 26, 2019
Pretrained network for multi-class classification Gluon	5	1206	March 13, 2018
Multi label classification Discussion	0	333	May 10, 2020
RuntimeError: mae_label is not presented, while testing? MXNet Model Server	1	595	August 2, 2019

Expected labels of dependent variable in binary classification

Related Topics