Autoencoders are deep learning structures that learn a compressed latent space of the input data. In a sense, autoencoders generalize 🗜️ Principle Component Analysis with a 🕸️ Multilayer Perceptron or 👁️ Convolutional Neural Network; the hidden layer of the neural network is analogous to , the compressed form of .

Using the neural network, we still optimize weights to minimize reconstruction error—forcing the network’s output to match the input. However, with a standard neural network, it’s easy to perfectly fit the training data; therefore, we impose one or more constraints.

Bottleneck, where the hidden layer has dimension .
Input noise, making a de-noising autoencoder.
or regularization penalties.
Forcing orthogonality or independence.

Stacked Training

One potentially-more efficient method of training an autoencoder is the stacking method, which trains layers one at a time.

For each hidden layer , take the values from , then process it and feed it to .
Optimize to reconstruct in ; in other words, we’re training to learn to reconstruct intermediate features in the neural network.

Explorer

🧬 Autoencoder

Stacked Training

Backlinks

Graph View

Explorer

🧬 Autoencoder

Stacked Training §

Backlinks

Graph View

Stacked Training