Summary statistics describes how a ๐ช Random Variable behaves, essentially summarizing the distribution.
Info
The following will assume that weโre operating over continuous space. However, the equations are roughly similar for discrete random variables if we substitute the integration for a summation.
If we have only a small empirical sample for the random variable instead of access to the whole population, we can only use values to estimate the true summary statistics. Then, our sample mean, sample variance, and sample covariance matrix equations are as follows:
Note
Note that for our sample variance, we divide by instead of to account for the variance of the sample mean . Since weโre basing the sample variance off an empirical estimate for the mean instead of the true mean, we have to adjust for its inaccuracy accordingly. This formula can be mathematically derived, and we can show that .