GPU memory leak with Sanic


I am trying to integrate mxnet model in Sanic asynchronous application and am encountering a memory leak when running the model in run_in_executor. Have tried ctx.empty_cache() and mx.nd.waitall() and still, the leak persists when parallelizing the requests sent to the application.
I am relatively new to mxnet and any help would be appreciated.