When using MXNet (Scala) to predict on CPU using a CNN, the CPU utilization never goes above 15%. The machine doesn’t appear to be doing much IO either.
Any idea what could be going wrong?
When using MXNet (Scala) to predict on CPU using a CNN, the CPU utilization never goes above 15%. The machine doesn’t appear to be doing much IO either.
Any idea what could be going wrong?
Could you share more information such as version of MXNet (build from source or pip installed, if build from source, what are the compile flags). If possible, could you also share the script you ran for predication?
try
export MXNET_CPU_WORKER_NTHREADS=(a larger int)
Using MXNet 0.11.0:
Per the suggestions on the MXNet CPU performance page, I tried setting the thread affinity and number of available threads to OpenMP, but there’s no difference.
I can’t share all of my prediction code, but it’s pretty standard:
@zhreshold:
I’ll give that a shot and report back. Thanks.