HW 9. How to prevent overflow to NaN

Our model worked fine on CPU but when ran on GPU we started getting NaNs. Any idea? We tried forcing the data type to be of type float32 but that didn’t really help.

Same problem here. Keep geeting NaN when trained on GPU with full dataset. Is there any possible explanation for this?

anyone solved this? its really frustrating…