A mixture model uses multiple distributions to fit a complex distribution. Weโll focus on a mixture of two Normal distributions, which looks like this:
This is akin to two ๐๏ธ Normal Models where we randomly draw from each one according to probability 
The likelihood for 
This likelihood is very complicated, and this is because each datapoint 
Incorporating 
However, we only have 
- Start with initial values for - Expectation: calculate expected values for each 
- Plug - Go back to step 2, repeat until convergence.
The EM algorithm converges to a fixed set of values, providing only point estimates