Why we don’t say KL divergence is a distance

Jimmy (xiaoke) Shen
1 min readSep 12, 2020

--

Because usually, the distance has the symmetry property, which means D(A, B) = D(B, A).

However, this is not the case for the KL divergence.

You can check the asymmetry from this gif.

--

--

No responses yet