Theory
Principal Component Regression uses ๐๏ธ Principle Component Analysis to provide a regularization effect for regression. After PCA, we can apply OLS linear regression on the embeddings of
If weโre given partially-labeled data, we can use PCR to train with the labels. This is an example of semi-supervised learning, where our dataset is partially unlabeled.
- Calculate PCA on all
, then project labeled to get and train OLS regression only on the labeled data. - Unlabeled data gives some information about the structure of input space, allowing us to train a stronger regression model.
Model
PCR contains the PCA parameters, scores
Training
Given training data
Weโll train a regression model with
Prediction
Given input