
Empirical Rule
The empirical rule, also referred to as the three-sigma rule or 68-95-99.7 rule, is a statistical rule which states that for a normal distribution, almost all observed data will fall within three standard deviations (denoted by σ) of the mean or average (denoted by µ). In particular, the empirical rule predicts that 68% of observations falls within the first standard deviation (µ ± σ), 95% within the first two standard deviations (µ ± 2σ), and 99.7% within the first three standard deviations (µ ± 3σ). The Empirical Rule states that 99.7% of data observed following a normal distribution lies within 3 standard deviations of the mean. Under this rule, 68% of the data falls within one standard deviation, 95% percent within two standard deviations, and 99.7% within three standard deviations from the mean. The empirical rule, also referred to as the three-sigma rule or 68-95-99.7 rule, is a statistical rule which states that for a normal distribution, almost all observed data will fall within three standard deviations (denoted by σ) of the mean or average (denoted by µ). In particular, the empirical rule predicts that 68% of observations falls within the first standard deviation (µ ± σ), 95% within the first two standard deviations (µ ± 2σ), and 99.7% within the first three standard deviations (µ ± 3σ). The Empirical Rule states that 99.7% of data observed following a normal distribution lies within 3 standard deviations of the mean. Under this rule, 68% of the data falls within one standard deviation, 95% percent within two standard deviations, and 99.7% within three standard deviations from the mean. Knowing the distribution's mean is 13.1 years old, the following age ranges occur for each standard deviation: One standard deviation (µ ± σ): (13.1 - 1.5) to (13.1 + 1.5), or 11.6 to 14.6 Two standard deviations (µ ± 2σ): 13.1 - (2 x 1.5) to 13.1 + (2 x 1.5), or 10.1 to 16.1 Three standard deviations (µ ± 3σ): 13.1 - (3 x 1.5) to 13.1 + (3 x 1.5), or, 8.6 to 17.6 This distribution looks as follows: One standard deviation (µ ± σ): 8.6 to 11.4 years Two standard deviations (µ ± 2σ): 7.2 to 12.8 years Three standard deviations ((µ ± 3σ): 5.8 to 14.2 years In statistics, the empirical rule states that 99.7% of data occurs within three standard deviations of the mean within a normal distribution.

What Is the Empirical Rule?
The empirical rule, also referred to as the three-sigma rule or 68-95-99.7 rule, is a statistical rule which states that for a normal distribution, almost all observed data will fall within three standard deviations (denoted by σ) of the mean or average (denoted by µ).
In particular, the empirical rule predicts that 68% of observations falls within the first standard deviation (µ ± σ), 95% within the first two standard deviations (µ ± 2σ), and 99.7% within the first three standard deviations (µ ± 3σ).



Understanding the Empirical Rule
The empirical rule is used often in statistics for forecasting final outcomes. After calculating the standard deviation and before collecting exact data, this rule can be used as a rough estimate of the outcome of the impending data to be collected and analyzed.
This probability distribution can thus be used as an interim heuristic since gathering the appropriate data may be time-consuming or even impossible in some cases. Such considerations come into play when a firm is reviewing its quality control measures or evaluating its risk exposure. For instance, the frequently used risk tool known as value-at-risk (VaR) assumes that the probability of risk events follows a normal distribution.
The empirical rule is also used as a rough way to test a distribution's "normality". If too many data points fall outside the three standard deviation boundaries, this suggests that the distribution is not normal and may be skewed or follow some other distribution.
The empirical rule is also known as the three-sigma rule, as "three-sigma" refers to a statistical distribution of data within three standard deviations from the mean on a normal distribution (bell curve), as indicated by the figure below.
Image by Julie Bang © Investopedia 2019
Examples of the Empirical Rule
Let's assume a population of animals in a zoo is known to be normally distributed. Each animal lives to be 13.1 years old on average (mean), and the standard deviation of the lifespan is 1.5 years. If someone wants to know the probability that an animal will live longer than 14.6 years, they could use the empirical rule. Knowing the distribution's mean is 13.1 years old, the following age ranges occur for each standard deviation:
The person solving this problem needs to calculate the total probability of the animal living 14.6 years or longer. The empirical rule shows that 68% of the distribution lies within one standard deviation, in this case, from 11.6 to 14.6 years. Thus, the remaining 32% of the distribution lies outside this range. One half lies above 14.6 and the other below 11.6. So, the probability of the animal living for more than 14.6 is 16% (calculated as 32% divided by two).
As another example, assume instead that an animal in the zoo lives to an average of 10 years of age, with a standard deviation of 1.4 years. Assume the zookeeper attempts to figure out the probability of an animal living for more than 7.2 years. This distribution looks as follows:
The empirical rule states that 95% of the distribution lies within two standard deviations. Thus, 5% lies outside of two standard deviations; half above 12.8 years and half below 7.2 years. Thus, the probability of living for more than 7.2 years is:
95% + (5% / 2) = 97.5%
What Is the Empirical Rule?
In statistics, the empirical rule states that 99.7% of data occurs within three standard deviations of the mean within a normal distribution. To this end, 68% of the observed data will occur within the first standard deviation, 95% will take place in the second deviation, and 97.5% within the third standard deviation. The empirical rule predicts the probability distribution for a set of outcomes.
How Is the Empirical Rule Used?
The empirical rule is applied to anticipate probable outcomes in a normal distribution. For instance, a statistician would use this to estimate the percentage of cases that fall in each standard deviation. Consider that the standard deviation is 3.1 and the mean equals 10. In this case, the first standard deviation would range between (10+3.2)= 13.2 and (10-3.2)= 6.8. The second deviation would fall between 10 + (2 X 3.2)= 16.4 and 10 - (2 X 3.2)= 3.6, and so forth.
What Are the Benefits of the Empirical Rule?
The empirical rule is beneficial because it serves as a means of forecasting data. This is especially true when it comes to large datasets and those where variables are unknown. In finance specifically, the empirical rule is germane to stock prices, price indices, and log values of forex rates, which all tend to fall across a bell curve or normal distribution.
Related terms:
Bell Curve
A bell curve describes the shape of data conforming to a normal distribution. read more
Business Valuation , Methods, & Examples
Business valuation is the process of estimating the value of a business or company. read more
Monte Carlo Simulation
Monte Carlo simulations are used to model the probability of different outcomes in a process that cannot easily be predicted. read more
Normal Distribution
Normal distribution is a continuous probability distribution wherein values lie in a symmetrical fashion mostly situated around the mean. read more
Probability Distribution
A probability distribution is a statistical function that describes possible values and likelihoods that a random variable can take within a given range. read more
Quality Control Chart
A quality control chart is a graphic that depicts whether sampled products or processes are meeting their intended specifications. read more
Standard Deviation
The standard deviation is a statistic that measures the dispersion of a dataset relative to its mean. It is calculated as the square root of variance by determining the variation between each data point relative to the mean. read more
T Distribution
A T distribution is a type of probability function that is appropriate for estimating population parameters for small sample sizes or unknown variances. read more
Three-Sigma Limits
Three-Sigma Limits is a statistical calculation that refers to data within three standard deviations from a mean. read more
Value at Risk (VaR)
Value at risk (VaR) is a statistic that quantifies the level of financial risk within a firm, portfolio, or position over a specific time frame. read more