The cross entropy method (CEM) is a stochastic optimization algorithm that addresses the general optimization problem
via random selection.
for some world model
First, weโre motivated by the naive, completely stochastic approximation algorithm (sometimes called โrandom shootingโ):
- Pick
from some distribution. - Choose
.
The cross entropy method notes that
Formally, the CEM algorithm is as follows.
- Sample
from , which is typically a Gaussian. - Evaluate
. - Pick the elites
with the highest value ( ). - Refit
to the elites and repeat.
Note that though this method is efficient, it only works in low dimensions. Moreover, it only supports open-loop planning and doesnโt incorporate any environment feedback to replan.