ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.geeksforgeeks.org/maths/central-limit-theorem/ |
| Last Crawled | 2026-04-08 10:10:43 (1 day ago) |
| First Indexed | 2025-07-01 14:41:24 (9 months ago) |
| HTTP Status Code | 200 |
| Meta Title | Central Limit Theorem in Statistics - GeeksforGeeks |
| Meta Description | Your All-in-One Learning Portal: GeeksforGeeks is a comprehensive educational platform that empowers learners across domains-spanning computer science and programming, school education, upskilling, commerce, software tools, competitive exams, and more., Your All-in-One Learning Portal. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions. |
| Meta Canonical | null |
| Boilerpipe Text | Last Updated :
6 Mar, 2026
One of the most basic principles in statistics, the Central Limit Theorem (CLT) describes how the sample mean distribution changes with increasing sample size.
If the sample is sufficiently large (usually n > 30), then the sample means' distribution will be normally distributed regardless of the underlying population distribution, whether it is normal, skewed, or otherwise.
All types of mean distributions tend to converge to a Normal Distribution as the sample size increases.
This is crucial since, even if population distribution is unknown, statisticians are able to draw inferences about the population based on the sample data. Larger samples are more accurate because CLT also proves that the distribution of the sample mean will have the mean as the population mean, and the standard deviation will reduce with increasing sample size. This theorem forms the basis for many. All types of mean distributions tend to converge to a normal distribution as the sample size increases.
The
Central Limit Theorem
in statistics states that as the sample size increases and its variance is finite, then the distribution of the sample mean approaches the normal distribution, irrespective of the shape of the population distribution.
Central Limit Theorem Formula
Let us assume we have a random variable X. Let σ be its standard deviation, and let μ be the mean of the random variable.
Now, as per the Central Limit Theorem, the sample mean
X
‾
\overline{X}
will approximate a normal distribution, which is given as
X
‾
\overline{X}
⁓ N(μ, σ/√n).
The Z-score of the random variable
X
‾
\overline{X}
is given as Z =
x
‾
−
μ
σ
n
\dfrac{\overline x - \mu}{\frac{\sigma}{\sqrt n}}
. Here
x
‾
\overline x
is the mean
X
‾
\overline X
.
The image of the formula is attached below.
Central Limit Theorem Proof
Let the independent random variables be X
1
, X
2
, X
3
, ..., Xn, which are identically distributed and where their mean is zero (μ = 0) and their variance is one (σ
2
= 1).
The Z score is given as, Z =
X
‾
−
μ
σ
n
=
n
(
X
ˉ
n
−
μ
)
σ
\dfrac{\overline X - \mu}{\frac{\sigma}{\sqrt n}}= \frac{\sqrt{n} (\bar{X}_n - \mu)}{\sigma}
where
X
ˉ
n
=
1
n
∑
i
=
1
n
X
i
.
\bar{X}_n = \frac{1}{n} \sum_{i=1}^n X_i. \:
Here, according to the Central Limit Theorem, Z approximates to a normal distribution as the value of n increases.
i.e.,
Z
n
→
d
N
(
0
,
1
)
as
n
→
∞
Z_n \xrightarrow{d} \mathcal{N}(0,1) \quad \text{as} \quad n \to \infty
Let m(t) be the moment generating function of Xi
⇒ M(0) = 1
⇒ M'(1) = E(Xi) = μ = 0
⇒ M''(0) = E(Xi
2
) = 1
The Moment Generating Function for Xi/√n is given as E[e
tXi/√n
]
Since X1, X
2
, X
3
. . . X
n
are independent, the Moment Generating Function for (X
1
+ X
2
+ X
3
+ . . . + X
n
)/√n is given as [M(t/√n)]
n
Let us assume a function
f(t) = log M(t)
⇒ f(0) = log M(0) = 0
⇒ f'(0) = M'(0)/M(0) = μ/1 = μ
⇒ f''(0) = (M(0). M"(0) - M'(0)²/M'(0)² = 1
Now, using L'Hospital's Rule, we will find t/√n as t
2
/2
⇒ [M(t/√n)]
2
= [e
f(t/√n)
]
n
⇒ [e
nf(t/√n)
] = e^(t
2
/2)
Thus the Central Limit Theorem has been proved by getting the Moment Generating Function of a Standard Normal Distribution.
Central Limit Theorem Example
Let's say we have a large sample of observations and each sample is randomly produced and independent of other observations. Calculate the average of the observations, thus having a collection of averages of observations. Now as per the Central Limit Theorem, if the sample size is adequately large, then the probability distribution of these sample averages will approximate to a normal distribution.
Assumptions of the Central Limit Theorem
The Central Limit Theorem is valid for the following conditions:
The drawing of the sample from the population should be random.
The drawing of the samples should be independent of each other.
The sample size should not exceed ten percent of the total population when sampling is done without replacement.
The sample size should be adequately large.
CLT only holds for a population with finite variance.
Steps to Solve Problems on Central Limit Theorem
Problems of the Central Limit Theorem that involve >, <, or "between" can be solved by the following steps:
Step 1:
First identify the > and < associated with sample size, population size, mean, and variance in the problem. Also there can be 'between,; associated with a range of two numbers.
Step 2:
Draw a Graph with Mean as the Center
Step 3:
Find the Z-Score using the formula
Step 4:
Refer to the Z table to find the value of Z obtained in the previous step.
Step 5:
If the problem involves '>', subtract the Z score from 0.5; if the problem involves '<', add 0.5 to the Z score; and if the problem involves 'between,' then perform only steps 3 and 4.
Step 6:
The Z score value is found along
X
‾
\overline X
Step 7:
Convert the decimal value obtained in all three cases to decimal.
Mean of the Sample Mean
According to the Central Limit Theorem:
If you have a population with a mean μ, the mean of the sample means (also called the expected value of the sample mean) will be equal to the population mean:
E
(
X
ˉ
)
E(\bar{X})
= μ
Standard Deviation of the Sample Mean
The standard deviation of the sample mean (often called the
standard error
) describes how much the sample mean is expected to vary from the true population mean. It is calculated using the population standard deviation σ and the sample size n:
σ
Xˉ
=
σ
n
\frac{\sigma}{\sqrt{n}}
σ
p
^
=
p
(
1
−
p
)
n
\sigma_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}}
(For categorical data, the standard error for proportions is calculated using the true population proportion p)
Central Limit Theorem Applications in Computer Science
Performance Analysis & Benchmarking
Measuring latency/response times of systems (e.g., web servers, databases).
The
average latency over many requests
converges to a normal distribution.
Enables use of confidence intervals and parametric tests (t-tests) to compare system optimizations.
A/B Testing & Experimentation
Comparing conversion rates between two website versions.
User conversions are Bernoulli trials (0/1), so the
average conversion rate
(proportion) is approximately normal for large samples.
Validates statistical tests (e.g., Z-tests) to determine if differences are significant. Without CLT, comparing proportions would be less straightforward.
Monte Carlo Simulations
Estimating complex values (e.g., π, financial risks, graphics rendering) via random sampling.
The
simulation output
(e.g., mean of samples) becomes normally distributed around the true value.
Provides
error bounds
(e.g., "estimate ± 2 standard errors") and justifies increasing samples to reduce error.
Machine Learning (ML) & Statistics
Used for model evaluation. Accuracy/F1-scores of ML models over test sets converge to normality, enabling comparison via confidence intervals.
Stochastic Gradient Descent (SGD)
:
Batch gradients are averages of random samples → approximately normal noise.
Feature Engineering: Aggregated features (e.g., mean user interactions per day) often become Gaussian-like, simplifying assumptions for models (e.g., linear regression).
Also Check
Central Limit Theorem in Data Science & Machine Learning
Solved Examples
Example 1.
The male population's weight data follows a normal distribution. It has a mean of 70 kg and a standard deviation of 15 kg. What would the mean and standard deviation of a sample of 50 guys be if a researcher looked at their records?
Given: μ = 70 kg, σ = 15 kg, n = 50
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 70 kg
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
= 15/√50
⇒
σ
x
‾
\sigma _{\overline{x}}
≈ 2.1 kg
Example 2.
A distribution has a mean of 69 and a standard deviation of 420. Find the mean and standard deviation if a sample of 80 is drawn from the distribution.
Given: μ = 69, σ = 420, n = 80
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 69
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
⇒
σ
x
‾
\sigma _{\overline{x}}
= 420/√80
⇒
σ
x
‾
\sigma _{\overline{x}}
=
46.95
Example 3.
The mean age of people in a colony is 34 years. Suppose the standard deviation is 15 years. The sample size is 50. Find the mean and standard deviation of the sample.
Given: μ = 34, σ = 15, n = 50
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 34 years
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
⇒
σ
x
‾
\sigma _{\overline{x}}
= 15/√50
⇒
σ
x
‾
\sigma _{\overline{x}}
=
2.12 years
Example 4.
The mean age of cigarette smokers is 35 years. Suppose the standard deviation is 10 years. The sample size is 39. Find the mean and standard deviation of the sample.
Given: μ = 35, σ = 10, n = 39
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 35 years
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
= 10/√39
⇒
σ
x
‾
\sigma _{\overline{x}}
=
1.601 years
Example 5.
The mean time taken to read a newspaper is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70. Find its mean and standard deviation.
Given: μ = 8.2, σ = 1, n = 70
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 8.2 minutes
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
= 1/√70
⇒
σ
x
‾
\sigma _{\overline{x}}
=
0.11 minutes
Example 6.
A distribution has a mean of 12 and a standard deviation of 3. Find the mean and standard deviation if a sample of 36 is drawn from the distribution.
Given: μ = 12, σ = 3, n = 36
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence,
μ
x
‾
\mu _{\overline{x}}
= μ = 12
Now,
σ
x
‾
=
σ
n
\sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
= 3/√36
⇒
σ
x
‾
\sigma _{\overline{x}}
=
0.5
Example 7.
You want to estimate the mean income of a population with a margin of error of $5, assuming the population standard deviation is $50, and you want a 95% confidence level. What sample size do you need?
Given: Z= 1.96 (for 95% confidence level), σ = 50, E = 5
As per the Central Limit Theorem, the formula to calculate the sample size is.
Hence,
n
=
(
Z
×
σ
E
)
2
n = \left( \frac{Z \times \sigma}{E}\right)^2
n
=
(
1.96
×
50
5
)
2
=
(
98
5
)
2
=
(
19.6
)
2
n = \left( \frac{1.96 \times 50} {5} \right)^2 = \left( \frac{98}{5} \right)^2 = (19.6)^2
n = 384.16 (Round up to the nearest whole number)
n = 385
The required sample size is
385
.
Example 8.
Given that the population proportion p=0.40p=0.40p=0.40 and the sample size n=100, calculate the standard error for the sample proportion
p
^
\hat{p}
.
Given: n=100, p=40% or .40.
As per the Central Limit Theorem, the formula to calculate standard error for proportions.
σ
p
^
=
p
(
1
−
p
)
n
=
100
0.40
(
1
−
0.40
)
=
100
0.24
\sigma_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}} = \sqrt{\frac{100}{0.40(1 - 0.40)}} = \sqrt{\frac{100}{0.24}}
= 0.04899
σ
p
^
=
0.04899
\sigma_{\hat{p}}=0.04899
σ
p
^
≈
0.04899
\sigma_{\hat{p}} \approx 0.04899
Related Articles
Probability Distribution Function
Standard Deviation
Central Tendency
Practice Problem Based on Central Limit Theorem
Question 1.
Given that the population mean is 50 and the population standard deviation is 10, find the Z-score for a sample mean of 52 when the sample size is 25.
Question 2.
If the population has a standard deviation of 15, and you take a sample of 50 from this population, calculate the standard error of the sample mean.
Question 3.
A population has a mean of 100 and a standard deviation of 20. You take a sample of 36. Calculate the 95% confidence interval for the sample mean.
Question 4.
The average height of adult women in a population is 160 cm with a standard deviation of 10 cm. What is the probability that a random sample of 25 women has a mean height greater than 162 cm?
Answer:-
1
2.12
[93.47, 106.53]
0.1587 |
| Markdown | [](https://www.geeksforgeeks.org/)

- Sign In
- [Courses]()
- [Tutorials]()
- [Interview Prep]()
- [Number System and Arithmetic](https://www.geeksforgeeks.org/engineering-mathematics/number-theory/)
- [Algebra](https://www.geeksforgeeks.org/maths/algebra/)
- [Set Theory](https://www.geeksforgeeks.org/maths/set-theory/)
- [Probability](https://www.geeksforgeeks.org/maths/probability-in-maths/)
- [Statistics](https://www.geeksforgeeks.org/maths/statistics/)
- [Geometry](https://www.geeksforgeeks.org/maths/geometry/)
- [Calculus](https://www.geeksforgeeks.org/maths/math-calculus/)
- [Logarithms](https://www.geeksforgeeks.org/maths/logarithms/)
- [Mensuration](https://www.geeksforgeeks.org/maths/mensuration/)
- [Matrices](https://www.geeksforgeeks.org/engineering-mathematics/matrices/)
# Central Limit Theorem in Statistics
Last Updated : 6 Mar, 2026
One of the most basic principles in statistics, the Central Limit Theorem (CLT) describes how the sample mean distribution changes with increasing sample size.
> If the sample is sufficiently large (usually n \> 30), then the sample means' distribution will be normally distributed regardless of the underlying population distribution, whether it is normal, skewed, or otherwise.

All types of mean distributions tend to converge to a Normal Distribution as the sample size increases.
This is crucial since, even if population distribution is unknown, statisticians are able to draw inferences about the population based on the sample data. Larger samples are more accurate because CLT also proves that the distribution of the sample mean will have the mean as the population mean, and the standard deviation will reduce with increasing sample size. This theorem forms the basis for many. All types of mean distributions tend to converge to a normal distribution as the sample size increases.
The ****Central Limit Theorem**** in statistics states that as the sample size increases and its variance is finite, then the distribution of the sample mean approaches the normal distribution, irrespective of the shape of the population distribution.
## Central Limit Theorem Formula
Let us assume we have a random variable X. Let σ be its standard deviation, and let μ be the mean of the random variable.
- Now, as per the Central Limit Theorem, the sample mean
X
‾
\\overline{X}
X
will approximate a normal distribution, which is given as
X
‾
\\overline{X}
X
⁓ N(μ, σ/√n).
- The Z-score of the random variable
X
‾
\\overline{X}
X
is given as Z =
x
‾
−
μ
σ
n
\\dfrac{\\overline x - \\mu}{\\frac{\\sigma}{\\sqrt n}}
n
σ
x−μ
. Here
x
‾
\\overline x
x
is the mean
X
‾
\\overline X
X
.
The image of the formula is attached below.

## Central Limit Theorem Proof
Let the independent random variables be X1, X2, X3, ..., Xn, which are identically distributed and where their mean is zero (μ = 0) and their variance is one (σ2 = 1).
The Z score is given as, Z = X ‾ − μ σ n \= n ( X ˉ n − μ ) σ \\dfrac{\\overline X - \\mu}{\\frac{\\sigma}{\\sqrt n}}= \\frac{\\sqrt{n} (\\bar{X}\_n - \\mu)}{\\sigma} n σ X−μ \= σ n (Xˉn−μ)
where X ˉ n \= 1 n ∑ i \= 1 n X i . \\bar{X}\_n = \\frac{1}{n} \\sum\_{i=1}^n X\_i. \\: Xˉn\=n1∑i\=1nXi.
Here, according to the Central Limit Theorem, Z approximates to a normal distribution as the value of n increases.
i.e., Z n → d N ( 0 , 1 ) as n → ∞ Z\_n \\xrightarrow{d} \\mathcal{N}(0,1) \\quad \\text{as} \\quad n \\to \\infty Zn d N(0,1)asn→∞
Let m(t) be the moment generating function of Xi
⇒ M(0) = 1
⇒ M'(1) = E(Xi) = μ = 0
⇒ M''(0) = E(Xi2) = 1
The Moment Generating Function for Xi/√n is given as E\[etXi/√n\]
Since X1, X2, X3 . . . Xn are independent, the Moment Generating Function for (X1 \+ X2 \+ X3 + . . . + Xn)/√n is given as \[M(t/√n)\]n
Let us assume a function
f(t) = log M(t)
⇒ f(0) = log M(0) = 0
⇒ f'(0) = M'(0)/M(0) = μ/1 = μ
⇒ f''(0) = (M(0). M"(0) - M'(0)²/M'(0)² = 1
Now, using L'Hospital's Rule, we will find t/√n as t2/2
⇒ \[M(t/√n)\]2 = \[ef(t/√n)\]n
⇒ \[enf(t/√n)\] = e^(t2/2)
Thus the Central Limit Theorem has been proved by getting the Moment Generating Function of a Standard Normal Distribution.
****Central Limit Theorem Example****
> Let's say we have a large sample of observations and each sample is randomly produced and independent of other observations. Calculate the average of the observations, thus having a collection of averages of observations. Now as per the Central Limit Theorem, if the sample size is adequately large, then the probability distribution of these sample averages will approximate to a normal distribution.
## Assumptions of the Central Limit Theorem
The Central Limit Theorem is valid for the following conditions:
- The drawing of the sample from the population should be random.
- The drawing of the samples should be independent of each other.
- The sample size should not exceed ten percent of the total population when sampling is done without replacement.
- The sample size should be adequately large.
- CLT only holds for a population with finite variance.
## Steps to Solve Problems on Central Limit Theorem
Problems of the Central Limit Theorem that involve \>, \<, or "between" can be solved by the following steps:
- ****Step 1:**** First identify the \> and \< associated with sample size, population size, mean, and variance in the problem. Also there can be 'between,; associated with a range of two numbers.
- ****Step 2:**** Draw a Graph with Mean as the Center
- ****Step 3:**** Find the Z-Score using the formula
- ****Step 4:**** Refer to the Z table to find the value of Z obtained in the previous step.
- ****Step 5:**** If the problem involves '\>', subtract the Z score from 0.5; if the problem involves '\<', add 0.5 to the Z score; and if the problem involves 'between,' then perform only steps 3 and 4.
- ****Step 6:**** The Z score value is found along
X
‾
\\overline X
X
- ****Step 7:**** Convert the decimal value obtained in all three cases to decimal.
### Mean of the Sample Mean
According to the Central Limit Theorem:
- If you have a population with a mean μ, the mean of the sample means (also called the expected value of the sample mean) will be equal to the population mean:
> E ( X ˉ ) E(\\bar{X}) E(Xˉ) = μ
### Standard Deviation of the Sample Mean
The standard deviation of the sample mean (often called the ****standard error****) describes how much the sample mean is expected to vary from the true population mean. It is calculated using the population standard deviation σ and the sample size n:
> σXˉ = σ n \\frac{\\sigma}{\\sqrt{n}} n σ
>
> σ p ^ \= p ( 1 − p ) n \\sigma\_{\\hat{p}} = \\sqrt{\\frac{p(1 - p)}{n}} σp^\= np(1−p) (For categorical data, the standard error for proportions is calculated using the true population proportion p)
## Central Limit Theorem Applications in Computer Science
### ****Performance Analysis & Benchmarking****
- Measuring latency/response times of systems (e.g., web servers, databases).
- The ****average latency over many requests**** converges to a normal distribution.
- Enables use of confidence intervals and parametric tests (t-tests) to compare system optimizations.
### ****A/B Testing & Experimentation****
- Comparing conversion rates between two website versions.
- User conversions are Bernoulli trials (0/1), so the ****average conversion rate**** (proportion) is approximately normal for large samples.
- Validates statistical tests (e.g., Z-tests) to determine if differences are significant. Without CLT, comparing proportions would be less straightforward.
### ****Monte Carlo Simulations****
- Estimating complex values (e.g., π, financial risks, graphics rendering) via random sampling.
- The ****simulation output**** (e.g., mean of samples) becomes normally distributed around the true value.
- Provides ****error bounds**** (e.g., "estimate ± 2 standard errors") and justifies increasing samples to reduce error.
### ****Machine Learning (ML) & Statistics****
- Used for model evaluation. Accuracy/F1-scores of ML models over test sets converge to normality, enabling comparison via confidence intervals.
- Stochastic Gradient Descent (SGD)****:**** Batch gradients are averages of random samples → approximately normal noise.
- Feature Engineering: Aggregated features (e.g., mean user interactions per day) often become Gaussian-like, simplifying assumptions for models (e.g., linear regression).
### Also Check
> [Central Limit Theorem in Data Science & Machine Learning](https://www.geeksforgeeks.org/machine-learning/central-limit-theorem-in-data-science-and-data-analytics/)
## Solved Examples
****Example 1.**** The male population's weight data follows a normal distribution. It has a mean of 70 kg and a standard deviation of 15 kg. What would the mean and standard deviation of a sample of 50 guys be if a researcher looked at their records?
> Given: μ = 70 kg, σ = 15 kg, n = 50
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 70 kg
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ = 15/√50
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx ****≈ 2.1 kg****
****Example 2.**** A distribution has a mean of 69 and a standard deviation of 420. Find the mean and standard deviation if a sample of 80 is drawn from the distribution.
> Given: μ = 69, σ = 420, n = 80
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 69
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx\= 420/√80
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx ****\=**** ****46\.95****
****Example 3.**** The mean age of people in a colony is 34 years. Suppose the standard deviation is 15 years. The sample size is 50. Find the mean and standard deviation of the sample.
> Given: μ = 34, σ = 15, n = 50
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 34 years
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx\= 15/√50
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx = ****2\.12 years****
****Example 4.**** The mean age of cigarette smokers is 35 years. Suppose the standard deviation is 10 years. The sample size is 39. Find the mean and standard deviation of the sample.
> Given: μ = 35, σ = 10, n = 39
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 35 years
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ = 10/√39
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx = ****1\.601 years****
****Example 5.**** The mean time taken to read a newspaper is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70. Find its mean and standard deviation.
> Given: μ = 8.2, σ = 1, n = 70
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 8.2 minutes
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ = 1/√70
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx = ****0\.11 minutes****
****Example 6.**** A distribution has a mean of 12 and a standard deviation of 3. Find the mean and standard deviation if a sample of 36 is drawn from the distribution.
> Given: μ = 12, σ = 3, n = 36
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} μx = μ = 12
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} σx\= n σ = 3/√36
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} σx = ****0\.5****
****Example 7.**** You want to estimate the mean income of a population with a margin of error of \$5, assuming the population standard deviation is \$50, and you want a 95% confidence level. What sample size do you need?
> Given: Z= 1.96 (for 95% confidence level), σ = 50, E = 5
>
> As per the Central Limit Theorem, the formula to calculate the sample size is.
>
> Hence, n \= ( Z × σ E ) 2 n = \\left( \\frac{Z \\times \\sigma}{E}\\right)^2 n\=(EZ×σ)2
>
> n \= ( 1\.96 × 50 5 ) 2 \= ( 98 5 ) 2 \= ( 19\.6 ) 2 n = \\left( \\frac{1.96 \\times 50} {5} \\right)^2 = \\left( \\frac{98}{5} \\right)^2 = (19.6)^2 n\=(51\.96×50)2\=(598)2\=(19\.6)2
>
> n = 384.16 (Round up to the nearest whole number)
> n = 385
>
> The required sample size is ****385****.
****Example 8.**** Given that the population proportion p=0.40p=0.40p=0.40 and the sample size n=100, calculate the standard error for the sample proportion p ^ \\hat{p} p^.
> Given: n=100, p=40% or .40.
>
> As per the Central Limit Theorem, the formula to calculate standard error for proportions.
>
> σ p ^ \= p ( 1 − p ) n \= 100 0\.40 ( 1 − 0\.40 ) \= 100 0\.24 \\sigma\_{\\hat{p}} = \\sqrt{\\frac{p(1 - p)}{n}} = \\sqrt{\\frac{100}{0.40(1 - 0.40)}} = \\sqrt{\\frac{100}{0.24}} σp^\= np(1−p) \= 0\.40(1−0\.40)100 \= 0\.24100 = 0.04899
>
> σ p ^ \= 0\.04899 \\sigma\_{\\hat{p}}=0.04899 σp^\=0\.04899
>
> σ p ^ ≈ 0\.04899 \\sigma\_{\\hat{p}} \\approx 0.04899 σp^≈0\.04899
### Related Articles
> - [Probability Distribution Function](https://www.geeksforgeeks.org/maths/probability-distribution-function/)
> - [Standard Deviation](https://www.geeksforgeeks.org/maths/standard-deviation-formula/)
> - [Central Tendency](https://www.geeksforgeeks.org/data-science/central-tendency/)
## Practice Problem Based on Central Limit Theorem
****Question 1.**** Given that the population mean is 50 and the population standard deviation is 10, find the Z-score for a sample mean of 52 when the sample size is 25.
****Question 2.**** If the population has a standard deviation of 15, and you take a sample of 50 from this population, calculate the standard error of the sample mean.
****Question 3.**** A population has a mean of 100 and a standard deviation of 20. You take a sample of 36. Calculate the 95% confidence interval for the sample mean.
****Question 4.**** The average height of adult women in a population is 160 cm with a standard deviation of 10 cm. What is the probability that a random sample of 25 women has a mean height greater than 162 cm?
****Answer:-****
> 1. 1
> 2. 2\.12
> 3. \[93.47, 106.53\]
> 4. 0\.1587

## Central Limit Theorem in Statistics \| Formula, Derivation, Examples & Proof

## Central Limit Theorem (CLT) in Machine Learning
Central Limit Theorem in Statistics \| Formula, Derivation, Examples & Proof
Comment
[K](https://www.geeksforgeeks.org/user/kamaljeet69420/)
[kamaljeet69420](https://www.geeksforgeeks.org/user/kamaljeet69420/)
3
Article Tags:
Article Tags:
[Mathematics](https://www.geeksforgeeks.org/category/school-learning/maths/)
[School Learning](https://www.geeksforgeeks.org/category/school-learning/)
[Class 12](https://www.geeksforgeeks.org/category/school-learning/class-12/)
[Maths-Formulas](https://www.geeksforgeeks.org/tag/maths-formulas/)
[Maths-Class-12](https://www.geeksforgeeks.org/tag/maths-class-12/)
[\+1 More]()
### Explore
[](https://www.geeksforgeeks.org/)

Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)

Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
[](https://geeksforgeeksapp.page.link/gfg-app)[](https://geeksforgeeksapp.page.link/gfg-app)
- Company
- [About Us](https://www.geeksforgeeks.org/about/)
- [Legal](https://www.geeksforgeeks.org/legal/)
- [Privacy Policy](https://www.geeksforgeeks.org/legal/privacy-policy/)
- [Contact Us](https://www.geeksforgeeks.org/about/contact-us/)
- [Advertise with us](https://www.geeksforgeeks.org/advertise-with-us/)
- [GFG Corporate Solution](https://www.geeksforgeeks.org/gfg-corporate-solution/)
- [Campus Training Program](https://www.geeksforgeeks.org/campus-training-program/)
- Explore
- [POTD](https://www.geeksforgeeks.org/problem-of-the-day)
- [Job-A-Thon](https://practice.geeksforgeeks.org/events/rec/job-a-thon/)
- [Blogs](https://www.geeksforgeeks.org/category/blogs/?type=recent)
- [Nation Skill Up](https://www.geeksforgeeks.org/nation-skill-up/)
- Tutorials
- [Programming Languages](https://www.geeksforgeeks.org/computer-science-fundamentals/programming-language-tutorials/)
- [DSA](https://www.geeksforgeeks.org/dsa/dsa-tutorial-learn-data-structures-and-algorithms/)
- [Web Technology](https://www.geeksforgeeks.org/web-tech/web-technology/)
- [AI, ML & Data Science](https://www.geeksforgeeks.org/machine-learning/ai-ml-and-data-science-tutorial-learn-ai-ml-and-data-science/)
- [DevOps](https://www.geeksforgeeks.org/devops/devops-tutorial/)
- [CS Core Subjects](https://www.geeksforgeeks.org/gate/gate-exam-tutorial/)
- [Interview Preparation](https://www.geeksforgeeks.org/aptitude/interview-corner/)
- [Software and Tools](https://www.geeksforgeeks.org/websites-apps/software-and-tools-a-to-z-list/)
- Courses
- [ML and Data Science](https://www.geeksforgeeks.org/courses/category/machine-learning-data-science)
- [DSA and Placements](https://www.geeksforgeeks.org/courses/category/dsa-placements)
- [Web Development](https://www.geeksforgeeks.org/courses/category/development-testing)
- [Programming Languages](https://www.geeksforgeeks.org/courses/category/programming-languages)
- [DevOps & Cloud](https://www.geeksforgeeks.org/courses/category/cloud-devops)
- [GATE](https://www.geeksforgeeks.org/courses/category/gate)
- [Trending Technologies](https://www.geeksforgeeks.org/courses/category/trending-technologies/)
- Videos
- [DSA](https://www.geeksforgeeks.org/videos/category/sde-sheet/)
- [Python](https://www.geeksforgeeks.org/videos/category/python/)
- [Java](https://www.geeksforgeeks.org/videos/category/java-w6y5f4/)
- [C++](https://www.geeksforgeeks.org/videos/category/c/)
- [Web Development](https://www.geeksforgeeks.org/videos/category/web-development/)
- [Data Science](https://www.geeksforgeeks.org/videos/category/data-science/)
- [CS Subjects](https://www.geeksforgeeks.org/videos/category/cs-subjects/)
- Preparation Corner
- [Interview Corner](https://www.geeksforgeeks.org/interview-prep/interview-corner/)
- [Aptitude](https://www.geeksforgeeks.org/aptitude/aptitude-questions-and-answers/)
- [Puzzles](https://www.geeksforgeeks.org/aptitude/puzzles/)
- [GfG 160](https://www.geeksforgeeks.org/courses/gfg-160-series)
- [System Design](https://www.geeksforgeeks.org/system-design/system-design-tutorial/)
[@GeeksforGeeks, Sanchhaya Education Private Limited](https://www.geeksforgeeks.org/), [All rights reserved](https://www.geeksforgeeks.org/copyright-information/)
![]() |
| Readable Markdown | Last Updated : 6 Mar, 2026
One of the most basic principles in statistics, the Central Limit Theorem (CLT) describes how the sample mean distribution changes with increasing sample size.
> If the sample is sufficiently large (usually n \> 30), then the sample means' distribution will be normally distributed regardless of the underlying population distribution, whether it is normal, skewed, or otherwise.

All types of mean distributions tend to converge to a Normal Distribution as the sample size increases.
This is crucial since, even if population distribution is unknown, statisticians are able to draw inferences about the population based on the sample data. Larger samples are more accurate because CLT also proves that the distribution of the sample mean will have the mean as the population mean, and the standard deviation will reduce with increasing sample size. This theorem forms the basis for many. All types of mean distributions tend to converge to a normal distribution as the sample size increases.
The ****Central Limit Theorem**** in statistics states that as the sample size increases and its variance is finite, then the distribution of the sample mean approaches the normal distribution, irrespective of the shape of the population distribution.
## Central Limit Theorem Formula
Let us assume we have a random variable X. Let σ be its standard deviation, and let μ be the mean of the random variable.
- Now, as per the Central Limit Theorem, the sample mean
X
‾
\\overline{X}
will approximate a normal distribution, which is given as
X
‾
\\overline{X}
⁓ N(μ, σ/√n).
- The Z-score of the random variable
X
‾
\\overline{X}
is given as Z =
x
‾
−
μ
σ
n
\\dfrac{\\overline x - \\mu}{\\frac{\\sigma}{\\sqrt n}}
. Here
x
‾
\\overline x
is the mean
X
‾
\\overline X
.
The image of the formula is attached below.

## Central Limit Theorem Proof
Let the independent random variables be X1, X2, X3, ..., Xn, which are identically distributed and where their mean is zero (μ = 0) and their variance is one (σ2 = 1).
The Z score is given as, Z = X ‾ − μ σ n \= n ( X ˉ n − μ ) σ \\dfrac{\\overline X - \\mu}{\\frac{\\sigma}{\\sqrt n}}= \\frac{\\sqrt{n} (\\bar{X}\_n - \\mu)}{\\sigma}
where X ˉ n \= 1 n ∑ i \= 1 n X i . \\bar{X}\_n = \\frac{1}{n} \\sum\_{i=1}^n X\_i. \\:
Here, according to the Central Limit Theorem, Z approximates to a normal distribution as the value of n increases.
i.e., Z n → d N ( 0 , 1 ) as n → ∞ Z\_n \\xrightarrow{d} \\mathcal{N}(0,1) \\quad \\text{as} \\quad n \\to \\infty
Let m(t) be the moment generating function of Xi
⇒ M(0) = 1
⇒ M'(1) = E(Xi) = μ = 0
⇒ M''(0) = E(Xi2) = 1
The Moment Generating Function for Xi/√n is given as E\[etXi/√n\]
Since X1, X2, X3 . . . Xn are independent, the Moment Generating Function for (X1 \+ X2 \+ X3 + . . . + Xn)/√n is given as \[M(t/√n)\]n
Let us assume a function
f(t) = log M(t)
⇒ f(0) = log M(0) = 0
⇒ f'(0) = M'(0)/M(0) = μ/1 = μ
⇒ f''(0) = (M(0). M"(0) - M'(0)²/M'(0)² = 1
Now, using L'Hospital's Rule, we will find t/√n as t2/2
⇒ \[M(t/√n)\]2 = \[ef(t/√n)\]n
⇒ \[enf(t/√n)\] = e^(t2/2)
Thus the Central Limit Theorem has been proved by getting the Moment Generating Function of a Standard Normal Distribution.
****Central Limit Theorem Example****
> Let's say we have a large sample of observations and each sample is randomly produced and independent of other observations. Calculate the average of the observations, thus having a collection of averages of observations. Now as per the Central Limit Theorem, if the sample size is adequately large, then the probability distribution of these sample averages will approximate to a normal distribution.
## Assumptions of the Central Limit Theorem
The Central Limit Theorem is valid for the following conditions:
- The drawing of the sample from the population should be random.
- The drawing of the samples should be independent of each other.
- The sample size should not exceed ten percent of the total population when sampling is done without replacement.
- The sample size should be adequately large.
- CLT only holds for a population with finite variance.
## Steps to Solve Problems on Central Limit Theorem
Problems of the Central Limit Theorem that involve \>, \<, or "between" can be solved by the following steps:
- ****Step 1:**** First identify the \> and \< associated with sample size, population size, mean, and variance in the problem. Also there can be 'between,; associated with a range of two numbers.
- ****Step 2:**** Draw a Graph with Mean as the Center
- ****Step 3:**** Find the Z-Score using the formula
- ****Step 4:**** Refer to the Z table to find the value of Z obtained in the previous step.
- ****Step 5:**** If the problem involves '\>', subtract the Z score from 0.5; if the problem involves '\<', add 0.5 to the Z score; and if the problem involves 'between,' then perform only steps 3 and 4.
- ****Step 6:**** The Z score value is found along
X
‾
\\overline X
- ****Step 7:**** Convert the decimal value obtained in all three cases to decimal.
### Mean of the Sample Mean
According to the Central Limit Theorem:
- If you have a population with a mean μ, the mean of the sample means (also called the expected value of the sample mean) will be equal to the population mean:
> E ( X ˉ ) E(\\bar{X}) = μ
### Standard Deviation of the Sample Mean
The standard deviation of the sample mean (often called the ****standard error****) describes how much the sample mean is expected to vary from the true population mean. It is calculated using the population standard deviation σ and the sample size n:
> σXˉ = σ n \\frac{\\sigma}{\\sqrt{n}}
>
> σ p ^ \= p ( 1 − p ) n \\sigma\_{\\hat{p}} = \\sqrt{\\frac{p(1 - p)}{n}} (For categorical data, the standard error for proportions is calculated using the true population proportion p)
## Central Limit Theorem Applications in Computer Science
### ****Performance Analysis & Benchmarking****
- Measuring latency/response times of systems (e.g., web servers, databases).
- The ****average latency over many requests**** converges to a normal distribution.
- Enables use of confidence intervals and parametric tests (t-tests) to compare system optimizations.
### ****A/B Testing & Experimentation****
- Comparing conversion rates between two website versions.
- User conversions are Bernoulli trials (0/1), so the ****average conversion rate**** (proportion) is approximately normal for large samples.
- Validates statistical tests (e.g., Z-tests) to determine if differences are significant. Without CLT, comparing proportions would be less straightforward.
### ****Monte Carlo Simulations****
- Estimating complex values (e.g., π, financial risks, graphics rendering) via random sampling.
- The ****simulation output**** (e.g., mean of samples) becomes normally distributed around the true value.
- Provides ****error bounds**** (e.g., "estimate ± 2 standard errors") and justifies increasing samples to reduce error.
### ****Machine Learning (ML) & Statistics****
- Used for model evaluation. Accuracy/F1-scores of ML models over test sets converge to normality, enabling comparison via confidence intervals.
- Stochastic Gradient Descent (SGD)****:**** Batch gradients are averages of random samples → approximately normal noise.
- Feature Engineering: Aggregated features (e.g., mean user interactions per day) often become Gaussian-like, simplifying assumptions for models (e.g., linear regression).
### Also Check
> [Central Limit Theorem in Data Science & Machine Learning](https://www.geeksforgeeks.org/machine-learning/central-limit-theorem-in-data-science-and-data-analytics/)
## Solved Examples
****Example 1.**** The male population's weight data follows a normal distribution. It has a mean of 70 kg and a standard deviation of 15 kg. What would the mean and standard deviation of a sample of 50 guys be if a researcher looked at their records?
> Given: μ = 70 kg, σ = 15 kg, n = 50
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 70 kg
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} = 15/√50
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} ****≈ 2.1 kg****
****Example 2.**** A distribution has a mean of 69 and a standard deviation of 420. Find the mean and standard deviation if a sample of 80 is drawn from the distribution.
> Given: μ = 69, σ = 420, n = 80
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 69
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}}
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}}\= 420/√80
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} ****\=**** ****46\.95****
****Example 3.**** The mean age of people in a colony is 34 years. Suppose the standard deviation is 15 years. The sample size is 50. Find the mean and standard deviation of the sample.
> Given: μ = 34, σ = 15, n = 50
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 34 years
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}}
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}}\= 15/√50
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} = ****2\.12 years****
****Example 4.**** The mean age of cigarette smokers is 35 years. Suppose the standard deviation is 10 years. The sample size is 39. Find the mean and standard deviation of the sample.
> Given: μ = 35, σ = 10, n = 39
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 35 years
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} = 10/√39
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} = ****1\.601 years****
****Example 5.**** The mean time taken to read a newspaper is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70. Find its mean and standard deviation.
> Given: μ = 8.2, σ = 1, n = 70
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 8.2 minutes
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} = 1/√70
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} = ****0\.11 minutes****
****Example 6.**** A distribution has a mean of 12 and a standard deviation of 3. Find the mean and standard deviation if a sample of 36 is drawn from the distribution.
> Given: μ = 12, σ = 3, n = 36
>
> As per the Central Limit Theorem, the sample mean is equal to the population mean.
>
> Hence, μ x ‾ \\mu \_{\\overline{x}} = μ = 12
>
> Now, σ x ‾ \= σ n \\sigma \_{\\overline{x}}=\\frac{\\sigma }{\\sqrt{n}} = 3/√36
>
> ⇒ σ x ‾ \\sigma \_{\\overline{x}} = ****0\.5****
****Example 7.**** You want to estimate the mean income of a population with a margin of error of \$5, assuming the population standard deviation is \$50, and you want a 95% confidence level. What sample size do you need?
> Given: Z= 1.96 (for 95% confidence level), σ = 50, E = 5
>
> As per the Central Limit Theorem, the formula to calculate the sample size is.
>
> Hence, n \= ( Z × σ E ) 2 n = \\left( \\frac{Z \\times \\sigma}{E}\\right)^2
>
> n \= ( 1\.96 × 50 5 ) 2 \= ( 98 5 ) 2 \= ( 19\.6 ) 2 n = \\left( \\frac{1.96 \\times 50} {5} \\right)^2 = \\left( \\frac{98}{5} \\right)^2 = (19.6)^2
>
> n = 384.16 (Round up to the nearest whole number)
> n = 385
>
> The required sample size is ****385****.
****Example 8.**** Given that the population proportion p=0.40p=0.40p=0.40 and the sample size n=100, calculate the standard error for the sample proportion p ^ \\hat{p}.
> Given: n=100, p=40% or .40.
>
> As per the Central Limit Theorem, the formula to calculate standard error for proportions.
>
> σ p ^ \= p ( 1 − p ) n \= 100 0\.40 ( 1 − 0\.40 ) \= 100 0\.24 \\sigma\_{\\hat{p}} = \\sqrt{\\frac{p(1 - p)}{n}} = \\sqrt{\\frac{100}{0.40(1 - 0.40)}} = \\sqrt{\\frac{100}{0.24}} = 0.04899
>
> σ p ^ \= 0\.04899 \\sigma\_{\\hat{p}}=0.04899
>
> σ p ^ ≈ 0\.04899 \\sigma\_{\\hat{p}} \\approx 0.04899
### Related Articles
> - [Probability Distribution Function](https://www.geeksforgeeks.org/maths/probability-distribution-function/)
> - [Standard Deviation](https://www.geeksforgeeks.org/maths/standard-deviation-formula/)
> - [Central Tendency](https://www.geeksforgeeks.org/data-science/central-tendency/)
## Practice Problem Based on Central Limit Theorem
****Question 1.**** Given that the population mean is 50 and the population standard deviation is 10, find the Z-score for a sample mean of 52 when the sample size is 25.
****Question 2.**** If the population has a standard deviation of 15, and you take a sample of 50 from this population, calculate the standard error of the sample mean.
****Question 3.**** A population has a mean of 100 and a standard deviation of 20. You take a sample of 36. Calculate the 95% confidence interval for the sample mean.
****Question 4.**** The average height of adult women in a population is 160 cm with a standard deviation of 10 cm. What is the probability that a random sample of 25 women has a mean height greater than 162 cm?
****Answer:-****
> 1. 1
> 2. 2\.12
> 3. \[93.47, 106.53\]
> 4. 0\.1587 |
| Shard | 103 (laksa) |
| Root Hash | 12046344915360636903 |
| Unparsed URL | org,geeksforgeeks!www,/maths/central-limit-theorem/ s443 |