I’m using the following code to create an array on gpu and the program never finishes. I got no error, the program just keeps running.
import mxnet as mx
mx.ndarray.ones(shape=(1,1),ctx=mx.gpu(0))
The code runs as expected both on my other GPU and on CPU. I’ve checked the GPU usage using nvidia-smi.exe and the memory usage is continuously increasing from a starting value of approximately 90MiB. nvidia-smi gives the following output
+-----------------------------------------------------------------------------+
Wed Jul 7 09:57:30 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 471.11 Driver Version: 471.11 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 4000 TCC | 00000000:65:00.0 Off | N/A |
| 30% 51C P8 19W / 125W | 306MiB / 8063MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 Quadro P400 WDDM | 00000000:B3:00.0 On | N/A |
| 34% 38C P8 N/A / N/A | 216MiB / 2048MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2960 C ...iles\Python39\pythonw.exe 305MiB |
etc.
I’m using the following :
-
Windows server 2019 Essentials
-
Quadro RTX 4000
-
NVIDIA Graphics driver 471.11
-
Cuda 10.2.86
-
Python 3.9.5
-
pip 21.1.1
-
mxnet-cu102 2.0.0b20201108
-
cuDNN 6.14.11.10020
I’ve tested both TCC and WDDM modes on the GPU. I’ve also used driver version 441.66 where the cuda version displayed by nvidia-smi is 10.2 but the issue persists. I’ve also tested mxnet-cu102 1.7.0. Both mxnet versions were downloaded from https://dist.mxnet.io/python/cu102 and installed using pip. I’ve tried reinstalling all the programs. I run the code using IDLE.