[Mixed-Precision] In mixed-precision training/inference, does Gluon do float16 addition or float32 addition?

OldeElk · October 16, 2019, 5:58pm

I was participating MicroNet Challenge recently and the host (Google) has proposed a way to calculate FLOPs for float16 models. They think that, for float16 model, the mult operations in the matrix by matrix multiplication should be considered as float16 operations however the add operations should be float32.

From what Nvidia and MXNet tutorial claimed:

Nvidia: The Volta generation of GPUs introduces Tensor Cores, which provide 8x more throughput than single-precision math pipelines. Each Tensor Core performs D = A x B + C, where A, B, C, and D are matrices. A and B are half-precision 4x4 matrices, whereas D and C can be either half or single precision 4x4 matrices .

MXNet: Nvidia Tensor Cores essentially perform the computation D = A * B + C, where A and B are half-precision matrices, while C and D could be either half-precision or full precision .

I am wondering how, in actual implementation, Gluon mixed-precision does addition for the float16 model? Half or full precision? Thanks!

OldeElk · October 17, 2019, 5:28pm

Anyone to help plz…

ThomasDelteil · October 22, 2019, 11:55pm

HI @OldeElk, If you quantize your model using .cast('float16) then all your weights are casted to float16.
If you use the Automatic Mixed Precision mode, only the necessary operators will be casted.
You can read more about it in this blog post: https://medium.com/apache-mxnet/simplify-mixed-precision-training-with-mxnet-amp-dc2564b1c7b0

Topic		Replies	Views
[solved] Network in float16 Discussion	5	1653	April 15, 2019
Inference using float16 general-question	1	362	February 21, 2019
Failed to convert symbol for mixed precision inference Discussion	4	544	October 26, 2018
Understand number of parameters in mx.viz.print_summary(sym) Gluon	2	2154	September 28, 2018
Same input different output Gluon	2	655	August 6, 2019

[Mixed-Precision] In mixed-precision training/inference, does Gluon do float16 addition or float32 addition?

Related Topics