Like Law of Large Numbers, it is also a tool for asymptotic analysis.
The core idea of this theorem is that When you add together a large number of random variables, regardless of their individual distributions, their sum (or mean) will tend towards a normal distribution.
Formal Statement
Suppose we have an i.i.d random variables:
The sample mean is defined as:
The expected value of each is , with expected Variance .
From Law of Large Numbers, we know that as , .
We the normalize it:
And we would know that:
Why ?
We already know that since
The Variance of is:
By Weak Law of Large Numbers, we know that , the distribution would be more and more concentrated around as increases, eventually a degenerate distribution at .
To avoid this, we multiply by , thus the variance would be: