A mixture model uses multiple distributions to fit a complex distribution. Weโll focus on a mixture of two Normal distributions, which looks like this:
This is akin to two ๐๏ธ Normal Models where we randomly draw from each one according to probability
The likelihood for
This likelihood is very complicated, and this is because each datapoint
Incorporating
However, we only have
- Start with initial values for
. - Expectation: calculate expected values for each
,
- Plug
into the likelihood and find new values that maximize the likelihood; note that it can be easier to instead maximize the log likelihood. - Go back to step 2, repeat until convergence.
The EM algorithm converges to a fixed set of values, providing only point estimates