Central Limit Theorem and Normal Distribution

Swapna Tamhankar, November 15, 2016, 0 Comments

In probability theory, the central limit theorem (CLT) states that, given certain conditions (large sample size), the arithmetic mean of a sufficiently large number of iterates of independent random variables, each with a well-defined expected value (mean) and finite variance, will be approximately normally distributed, regardless of the underlying distribution.
[latexpage]

So if samples of size n are drawn randomly from a population that has a mean µ and a standard deviation of $ \sigma$, the sample means, are approximately normally distributed for sufficiently large sample sizes (n > 30) regardless of the shape of the population distribution.

The mean of the sample means is same as population µ and its standard deviations is as $ \sigma/\sqrt n$. The altered mean and standard deviations are then used in calculating normal probabilities. The above image is the visual representation of the concept in discussion. As the sample number of observations “n” increases the distribution of the data starts fitting as a bell shaped curve.

About Normal Distribution: The normal distribution is described or characterized by two parameters: the mean µ and the standard deviation σ . The values of µ and σ produce a normal distribution. The density function of the normal distribution is$f(x) = \frac{1}{\sqrt {2\pi\sigma}}
\star e^\frac{-1}{2}(\frac{x-\mu}{ \sigma ^ 2})$

Using Integral Calculus to determine areas under the normal curve from this function is difficult and time- consuming, therefore, virtually all researchers use table values to analyze normal distribution problems rather than this formula. The mechanism was developed by which all normal distributions can be converted into a single distribution: the z distribution. This process yields the standardized normal distribution scores z, also known as Gaussian scores.

The conversion formula for these Gaussian scores is $ Z = (\frac{x-\mu}{ \sigma})$

The normal distribution is also popularly known as the bell shaped distribution. The distribution is symmetric around its mean. The distribution is very robust in nature. It finds applications across various situations in research, social sciences, Biostatistics, business etc.

Significance of CLT:

As a result of central limit theorem, we can use the adjoining altered formula for Z scores so as to use the normal distribution for prediction in analytics. One of the essential conditions to enable us to do so is a large sample size.

$ Z = \frac{\check{x}-\mu}{ (\frac {\sigma}{\sqrt n})}$

In short, the central limit theorem creates the potential for applying the normal distribution to many problems when the sample size is sufficiently large. My take on this central limit theorem is a little philosophical. I call it The Midas Touch.

Tags: biostatistics, business, central limit theorem, normal distribution, research, social sciences

About author

Swapna Tamhankar is a faculty with IBS Business School, Mumbai. She is a mathematics graduate and holds an MBA (Finance) from Mumbai University. ...more

More ways to connect with us..

Channels[+]

About MarketExpress

Partners

MarketExpress Media & Education

Founded in 2011 by IIT-ians, VJTI-ians and Finance Professionals MarketExpress is an online financial, business news, insights and research portal. MarketExpress platform brings together some of the industry's top experts on finance, business, education with a common goal - to provide quality information, insights & analysis.

Emotional Intelligence: Collaboration is Key for Startup Founders Success

How sunscreen protects your skin

Pewabic Pottery: Still handcrafted in Detroit

Smarter Decisions, Faster: The Future of Real-Time Data Analytics

Trending

Information is key but not equal to expertise possessed or the ability to do things in a professional way

Co-founders ( required ), Equity with no Monthly Pay…

Multimillion dollar idea & cofounder pay package

MarketExpress Media & Education