Response surface methods is a mix between โ๏ธ Reinforcement Learning and โ Active Learning that aims some that minimizes (for unknown function ). To do so, we repeatedly query and improve our guess for .
Unlike active learning, our goal is to minimize instead of fitting to the entire data. This makes the problem much more like reinforcement learning, specifically ๐ Contextual Bandit, where corresponds with an action and is the reward (or loss, in our minimization case).