Wow this is really great. I just realised last weak that MLE can be motivated with the KL divergence between true distribution and approximation. My mind was blown in how obvious that connection was.
Holy over the top almighty. Is this comment even real ? "Mind blown" and all. Tomorrow, the sun rose, "blown is my mind".
Apologies for the snark but I can't fathom how someone who is aware of the definition of KL not see the likelihood in it.
Holy over the top almighty. Is this comment even real ? "Mind blown" and all. Tomorrow, the sun rose, "blown is my mind".
Apologies for the snark but I can't fathom how someone who is aware of the definition of KL not see the likelihood in it.