Why we don’t say KL divergence is a distance
1 min readSep 12, 2020
Because usually, the distance has the symmetry property, which means D(A, B) = D(B, A).
However, this is not the case for the KL divergence.
You can check the asymmetry from this gif.