Similarity formula needs explanation

Question

xxxpsyduck opened this issue 4 years ago · comments

Please explain the following formula:

sim = (1. + np.dot(emb1, emb2)) / 2

why using this instead of cosine similarity?

SthPhoenix · Answer 1 · Fri Oct 23 2020 20:01:16 GMT+0800 (China Standard Time)

Since we are computing similarity of normed embeddings dot product is equal to cosine distance in this case.

(1 + sim )/2 formula is to normalize similarity in range [0…1] instead of [-1…1], it's just for convenience of representation