Probability Distribution

Probability Distribution

A probability distribution is a statistical function that describes all the possible values and likelihoods that a random variable can take within a given range. Some of them include the normal distribution, chi square distribution, binomial distribution, and Poisson distribution. Alternatively, an investor can get a probability of loss for an amount of loss and time frame using VaR. Misuse and overeliance on VaR has been implicated as one of the major causes of the 2008 financial crisis. As a simple example of a probability distribution, let us look at the number observed when rolling two standard six-sided dice. The binomial distribution, for example, evaluates the probability of an event occurring several times over a given number of trials and given the event's probability in each trial. The normal distribution is fully characterized by its mean and standard deviation, meaning the distribution is not skewed and does exhibit kurtosis.

A probability distribution depicts the expected outcomes of possible values for a given data generating process.

What Is a Probability Distribution?

A probability distribution is a statistical function that describes all the possible values and likelihoods that a random variable can take within a given range. This range will be bounded between the minimum and maximum possible values, but precisely where the possible value is likely to be plotted on the probability distribution depends on a number of factors. These factors include the distribution's mean (average), standard deviation, skewness, and kurtosis.

A probability distribution depicts the expected outcomes of possible values for a given data generating process.
Probability distributions come in many shapes with different characteristics, as defined by the mean, standard deviation, skewness, and kurtosis.
Investors use probability distributions to anticipate returns on assets such as stocks over time and to hedge their risk.

How Probability Distributions Work

Perhaps the most common probability distribution is the normal distribution, or "bell curve," although several distributions exist that are commonly used. Typically, the data generating process of some phenomenon will dictate its probability distribution. This process is called the probability density function.

Probability distributions can also be used to create cumulative distribution functions (CDFs), which adds up the probability of occurrences cumulatively and will always start at zero and end at 100%.

Academics, financial analysts and fund managers alike may determine a particular stock's probability distribution to evaluate the possible expected returns that the stock may yield in the future. The stock's history of returns, which can be measured from any time interval, will likely be composed of only a fraction of the stock's returns, which will subject the analysis to sampling error. By increasing the sample size, this error can be dramatically reduced.

Types of Probability Distributions

There are many different classifications of probability distributions. Some of them include the normal distribution, chi square distribution, binomial distribution, and Poisson distribution. The different probability distributions serve different purposes and represent different data generation processes. The binomial distribution, for example, evaluates the probability of an event occurring several times over a given number of trials and given the event's probability in each trial. and may be generated by keeping track of how many free throws a basketball player makes in a game, where 1 = a basket and 0 = a miss. Another typical example would be to use a fair coin and figuring the probability of that coin coming up heads in 10 straight flips. A binomial distribution is discrete, as opposed to continuous, since only 1 or 0 is a valid response.

The most commonly used distribution is the normal distribution, which is used frequently in finance, investing, science, and engineering. The normal distribution is fully characterized by its mean and standard deviation, meaning the distribution is not skewed and does exhibit kurtosis. This makes the distribution symmetric and it is depicted as a bell-shaped curve when plotted. A normal distribution is defined by a mean (average) of zero and a standard deviation of 1.0, with a skew of zero and kurtosis = 3. In a normal distribution, approximately 68% of the data collected will fall within +/- one standard deviation of the mean; approximately 95% within +/- two standard deviations; and 99.7% within three standard deviations. Unlike the binomial distribution, the normal distribution is continuous, meaning that all possible values are represented (as opposed to just 0 and 1 with nothing in between).

Probability Distributions Used in Investing

Stock returns are often assumed to be normally distributed but in reality, they exhibit kurtosis with large negative and positive returns seeming to occur more than would be predicted by a normal distribution. In fact, because stock prices are bounded by zero but offer a potential unlimited upside, the distribution of stock returns has been described as log-normal. This shows up on a plot of stock returns with the tails of the distribution having greater thickness.

Probability distributions are often used in risk management as well to evaluate the probability and amount of losses that an investment portfolio would incur based on a distribution of historical returns. One popular risk management metric used in investing is value-at-risk (VaR). VaR yields the minimum loss that can occur given a probability and time frame for a portfolio. Alternatively, an investor can get a probability of loss for an amount of loss and time frame using VaR. Misuse and overeliance on VaR has been implicated as one of the major causes of the 2008 financial crisis.

Example of a Probability Distribution

As a simple example of a probability distribution, let us look at the number observed when rolling two standard six-sided dice. Each die has a 1/6 probability of rolling any single number, one through six, but the sum of two dice will form the probability distribution depicted in the image below. Seven is the most common outcome (1+6, 6+1, 5+2, 2+5, 3+4, 4+3). Two and twelve, on the other hand, are far less likely (1+1 and 6+6).

Image

Image by Sabrina Jiang © Investopedia 2020

Related terms:

Bell Curve

A bell curve describes the shape of data conforming to a normal distribution. read more

Binomial Distribution

The binomial distribution is a probability distribution that summarizes the likelihood that a value will take one of two independent values. read more

Business Valuation , Methods, & Examples

Business valuation is the process of estimating the value of a business or company. read more

Chi-Square (χ2) Statistic

A chi-square (χ2) statistic is a test that measures how expectations compare to actual observed data (or model results). read more

Discrete Distribution

A discrete distribution is a statistical distribution that shows the probabilities of outcomes with finite values. read more

Fund Manager

Learn more about fund managers, who oversee a portfolio of mutual or hedge funds and make final decisions about how they are invested. read more

Kurtosis

Kurtosis is a statistical measure used to describe the distribution of observed data around the mean. It is sometimes referred to as the "volatility of volatility."  read more

Log-Normal Distribution

A log-normal distribution is a statistical distribution of logarithmic values from a related normal distribution. read more

Random Variable

A random variable is a variable whose value is unknown, or a function that assigns values to each of an experiment's outcomes. read more

Sampling Error

A sampling error is a statistical error that occurs when a sample does not represent the entire population. See how to avoid sampling errors in data analysis. read more