When we need function approximation in the case of large complex state spaces, we want to find a parameterization that balances complexity and expressiveness. On the least complex end of the spectrum is linear approximation, which simply applies an inner product between the input and parameters; for a value function estimate, we have
where
While this is often impractical in practice, this method does offer some properties for theoretical analysis. The gradient of our function is
so our gradient update is
where
Feature Construction
Since linear function approximation only uses the inner product, the expressiveness of our state feature vector is crucial to generalization.
- Polynomials across state dimensions allows us to relate multiple states together.
- Fourier basis decomposes signals into periodic functions.
- Coarse coding produces a binary in-or-out vector with respect to some basis; the feature for some component is
if the state is in the designated region or otherwise. - Tile coding is a specific type of coarse coding that breaks the state space into multiple tile grids.
- Radial basis functions, commonly used in 🥢 Generalized Linear Models, are a soft version of coarse coding that measures similarity with Gaussian kernel functions.