Batch_dot one hot vectors with embeddings results in nan

The bug only occurs on CPU:

src/operator/nn/./fully_connected-inl.h:200: float16 fully connected layer is currentlyonly supported by CuDNN version.

Well, it works fine on GPU. So, not a big deal.