TIL (somewhat embarrassingly) that en.wikipedia.org/wiki/Jensen%E provides a connection of sorts between mutual information and KL divergence

Follow

To be more precise: the interesting thing is that one can interpret D_JS(A||B) as a mutual information between something for any pair of distributions. (One can do the inverse with D_KL: I(A;B) for any two variables can be interpreted as D_KL between some two distributions.)

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.