RNN implementation difference between Gluon and Symbolic

orchidmajumder · March 5, 2018, 7:14am

I have a few questions regarding RNN (LSTM/GRU) implementation in Gluon and/or Symbolic.

Is there a difference between how LSTM and GRU are implemented in Gluon vs Symbolic?
Does Gluon follow the CudNN RNN implementation for LSTM/GRU?
In case that’s true, how does it handle packing the input elements into contiguous memory? PyTorch has a method called pack_padded_sequence, does Gluon do something like this internally?
Does Gluon RNN implementation support variable length input? In case it does, what’s the way to pass a variable length NDArray to it since MXNet NDArray requires elements to be of same dimensions?

piiswrong · March 6, 2018, 7:45pm

no
for GPU, yes. CPU, no
Gluon currently requires all samples in a batch to have the same length. You need to pad them to the same length yourself.
not within the same batch

orchidmajumder · March 6, 2018, 8:23pm

Thanks a lot Eric. I just have one final question. About point no. 3, is just padding them to same length sufficient or do we need to do something to make them contiguous in memory?

ShootingSpace · March 12, 2018, 12:05am

We heard that the advantage of the imperative style of mxnet over symbolic style (tensorFlow) is allowing variable length sequence.

But from the doc bucketing, it seems that mxnet support variable length sequence input only within the symbolic style (module)?

It would be great if Gluon support it.

Topic		Replies	Views
Gluon RNN with sequence length and defer initialization Gluon	4	612	October 8, 2020
Gluon performance compared with low level api Gluon	3	1061	February 9, 2018
Different architecture per batch & parameter sharing across batches for Gluon RNN cells Gluon	2	969	October 16, 2017
Variational RNN Implementation Gluon	3	531	August 30, 2018
How to write a dynamically unrolled RNN with GLUON Gluon	1	973	June 29, 2018

RNN implementation difference between Gluon and Symbolic

Related Topics