Concise Implementation for Multi-GPU Computation

https://d2l.ai/chapter_computational-performance/multiple-gpus-concise.html