Supervised learning learns an accurate model
Note
We can use any function, simple or complex, to fit the data. However, the ๐ฅช No Free Lunch Theorem states that thereโs no single algorithm that performs best on all data. All models require built-in biases, and itโs our job to choose a class
that works for our data.
Moreover, we want the model to generalize well. Mathematically, our goal is to minimize risk
where
Non-Parametric Models
Non-parametric models donโt optimize any weights.
- ๐ K-Nearest Neighbors compares an input with the known data.
- ๐ญ Decision Trees split up the data on features that give the most information about the label.
- ๐ฏ Kernel Regression is a soft form of KNN that considers example similarity.
An ๐ป Ensemble combines non-parametric models to generate more accurate predictions.
Parametric Models
Parametric models, on the other hand, have a set of weights to optimize. Many optimize the their probability likelihood using ๐ช Maximum Likelihood Estimate or ๐ฒ Maximum A Posteriori, while others use more model-specific methods.
While thereโs a huge diversity of solutions, parametric models can be categorized into the type of problem they solve: regression or classification.
Regression
Regression problems deal with predicting continuous
- ๐ฆ Linear Regression fits a line to linear data.
- ๐ฅข Generalized Linear Model adapts this idea to work for non-linear data.
- ๐จ Principle Component Regression combines dimensionality reduction with linear regression.
Classification
Classification problems, on the other hand, work with discrete
- ๐ฆ Logistic Regression is an extension of linear regression that outputs class probabilities.
- ๐ฉ๏ธ Support Vector Machines divide the data into two classes with a hyperplane.
- ๐ถ Naive Bayes classifies extremely large features sets by assuming that features are conditionally independent given the label.
Online Learning
There are some online variants of these algorithms that train via streaming, going through each datapoint one-by-one and updating the weights.
- ๐ผ Least Mean Squares is an online version of linear regression.
- ๐ Perceptrons perform online updates for SVMs.
If thereโs not enough labeled data, โ Active Learning strategies identify the most useful datapoints to label next.