Fastest way to compute cosine similarities of ndarrays

ThomasDelteil · October 23, 2019, 2:07am

Here is the solution in python.

You can reproduce it in Scala using the Scala API:
Here are some useful tutorials:

import mxnet as mx
import time

tic = time.time()
first_term = mx.nd.random.uniform(shape=(10000,2048), ctx=mx.gpu())
second_term = mx.nd.random.uniform(shape=(10000,2048), ctx=mx.gpu())

first_term_normalized = first_term / mx.nd.norm(first_term, axis=1, keepdims=1)
second_term_normalized = second_term / mx.nd.norm(second_term, axis=1, keepdims=1)

cosine_similarity = mx.nd.batch_dot(first_term_normalized.expand_dims(axis=1), second_term_normalized.expand_dims(axis=2)).squeeze()
mx.nd.waitall()
print(time.time()-tic)
print(cosine_similarity)

(it takes about ~10ms on GPU)

Topic		Replies	Views
How to get the value of the ndarray in scala apis Discussion	0	309	October 22, 2019
Calculation of cosine similarity is giving error Discussion	1	522	October 31, 2019
Mxnet vs numpy incredible slow Performance	1	1075	August 8, 2019
How to hash mxnet.np.ndarray fast? Performance	0	284	December 26, 2022
NDArray.concat failed to concatenate two array on different GPUs?	3	1661	May 17, 2018

Fastest way to compute cosine similarities of ndarrays

Related Topics