thanks, may I ask another question? As far as I can see, in_grad always has the same shapes as in_data. But till now I have’t seen any example how to use out_grad. Is it also the same size of out_data? How is this used in backward propagation? Many thanks in advance!